Cargando…

Disease named entity recognition by combining conditional random fields and bidirectional recurrent neural networks

The recognition of disease and chemical named entities in scientific articles is a very important subtask in information extraction in the biomedical domain. Due to the diversity and complexity of disease names, the recognition of named entities of diseases is rather tougher than those of chemical n...

Descripción completa

Detalles Bibliográficos
Autores principales: Wei, Qikang, Chen, Tao, Xu, Ruifeng, He, Yulan, Gui, Lin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5088735/
https://www.ncbi.nlm.nih.gov/pubmed/27777244
http://dx.doi.org/10.1093/database/baw140
_version_ 1782464154920026112
author Wei, Qikang
Chen, Tao
Xu, Ruifeng
He, Yulan
Gui, Lin
author_facet Wei, Qikang
Chen, Tao
Xu, Ruifeng
He, Yulan
Gui, Lin
author_sort Wei, Qikang
collection PubMed
description The recognition of disease and chemical named entities in scientific articles is a very important subtask in information extraction in the biomedical domain. Due to the diversity and complexity of disease names, the recognition of named entities of diseases is rather tougher than those of chemical names. Although there are some remarkable chemical named entity recognition systems available online such as ChemSpot and tmChem, the publicly available recognition systems of disease named entities are rare. This article presents a system for disease named entity recognition (DNER) and normalization. First, two separate DNER models are developed. One is based on conditional random fields model with a rule-based post-processing module. The other one is based on the bidirectional recurrent neural networks. Then the named entities recognized by each of the DNER model are fed into a support vector machine classifier for combining results. Finally, each recognized disease named entity is normalized to a medical subject heading disease name by using a vector space model based method. Experimental results show that using 1000 PubMed abstracts for training, our proposed system achieves an F1-measure of 0.8428 at the mention level and 0.7804 at the concept level, respectively, on the testing data of the chemical-disease relation task in BioCreative V. Database URL: http://219.223.252.210:8080/SS/cdr.html
format Online
Article
Text
id pubmed-5088735
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-50887352016-11-02 Disease named entity recognition by combining conditional random fields and bidirectional recurrent neural networks Wei, Qikang Chen, Tao Xu, Ruifeng He, Yulan Gui, Lin Database (Oxford) Original Article The recognition of disease and chemical named entities in scientific articles is a very important subtask in information extraction in the biomedical domain. Due to the diversity and complexity of disease names, the recognition of named entities of diseases is rather tougher than those of chemical names. Although there are some remarkable chemical named entity recognition systems available online such as ChemSpot and tmChem, the publicly available recognition systems of disease named entities are rare. This article presents a system for disease named entity recognition (DNER) and normalization. First, two separate DNER models are developed. One is based on conditional random fields model with a rule-based post-processing module. The other one is based on the bidirectional recurrent neural networks. Then the named entities recognized by each of the DNER model are fed into a support vector machine classifier for combining results. Finally, each recognized disease named entity is normalized to a medical subject heading disease name by using a vector space model based method. Experimental results show that using 1000 PubMed abstracts for training, our proposed system achieves an F1-measure of 0.8428 at the mention level and 0.7804 at the concept level, respectively, on the testing data of the chemical-disease relation task in BioCreative V. Database URL: http://219.223.252.210:8080/SS/cdr.html Oxford University Press 2016-10-24 /pmc/articles/PMC5088735/ /pubmed/27777244 http://dx.doi.org/10.1093/database/baw140 Text en © The Author(s) 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Wei, Qikang
Chen, Tao
Xu, Ruifeng
He, Yulan
Gui, Lin
Disease named entity recognition by combining conditional random fields and bidirectional recurrent neural networks
title Disease named entity recognition by combining conditional random fields and bidirectional recurrent neural networks
title_full Disease named entity recognition by combining conditional random fields and bidirectional recurrent neural networks
title_fullStr Disease named entity recognition by combining conditional random fields and bidirectional recurrent neural networks
title_full_unstemmed Disease named entity recognition by combining conditional random fields and bidirectional recurrent neural networks
title_short Disease named entity recognition by combining conditional random fields and bidirectional recurrent neural networks
title_sort disease named entity recognition by combining conditional random fields and bidirectional recurrent neural networks
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5088735/
https://www.ncbi.nlm.nih.gov/pubmed/27777244
http://dx.doi.org/10.1093/database/baw140
work_keys_str_mv AT weiqikang diseasenamedentityrecognitionbycombiningconditionalrandomfieldsandbidirectionalrecurrentneuralnetworks
AT chentao diseasenamedentityrecognitionbycombiningconditionalrandomfieldsandbidirectionalrecurrentneuralnetworks
AT xuruifeng diseasenamedentityrecognitionbycombiningconditionalrandomfieldsandbidirectionalrecurrentneuralnetworks
AT heyulan diseasenamedentityrecognitionbycombiningconditionalrandomfieldsandbidirectionalrecurrentneuralnetworks
AT guilin diseasenamedentityrecognitionbycombiningconditionalrandomfieldsandbidirectionalrecurrentneuralnetworks