Cargando…

Improving the dictionary lookup approach for disease normalization using enhanced dictionary and query expansion

The rapidly increasing biomedical literature calls for the need of an automatic approach in the recognition and normalization of disease mentions in order to increase the precision and effectivity of disease based information retrieval. A variety of methods have been proposed to deal with the proble...

Descripción completa

Detalles Bibliográficos
Autores principales: Jonnagaddala, Jitendra, Jue, Toni Rose, Chang, Nai-Wen, Dai, Hong-Jie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4976299/
https://www.ncbi.nlm.nih.gov/pubmed/27504009
http://dx.doi.org/10.1093/database/baw112
_version_ 1782446847196921856
author Jonnagaddala, Jitendra
Jue, Toni Rose
Chang, Nai-Wen
Dai, Hong-Jie
author_facet Jonnagaddala, Jitendra
Jue, Toni Rose
Chang, Nai-Wen
Dai, Hong-Jie
author_sort Jonnagaddala, Jitendra
collection PubMed
description The rapidly increasing biomedical literature calls for the need of an automatic approach in the recognition and normalization of disease mentions in order to increase the precision and effectivity of disease based information retrieval. A variety of methods have been proposed to deal with the problem of disease named entity recognition and normalization. Among all the proposed methods, conditional random fields (CRFs) and dictionary lookup method are widely used for named entity recognition and normalization respectively. We herein developed a CRF-based model to allow automated recognition of disease mentions, and studied the effect of various techniques in improving the normalization results based on the dictionary lookup approach. The dataset from the BioCreative V CDR track was used to report the performance of the developed normalization methods and compare with other existing dictionary lookup based normalization methods. The best configuration achieved an F-measure of 0.77 for the disease normalization, which outperformed the best dictionary lookup based baseline method studied in this work by an F-measure of 0.13. Database URL: https://github.com/TCRNBioinformatics/DiseaseExtract
format Online
Article
Text
id pubmed-4976299
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-49762992016-08-09 Improving the dictionary lookup approach for disease normalization using enhanced dictionary and query expansion Jonnagaddala, Jitendra Jue, Toni Rose Chang, Nai-Wen Dai, Hong-Jie Database (Oxford) Original Article The rapidly increasing biomedical literature calls for the need of an automatic approach in the recognition and normalization of disease mentions in order to increase the precision and effectivity of disease based information retrieval. A variety of methods have been proposed to deal with the problem of disease named entity recognition and normalization. Among all the proposed methods, conditional random fields (CRFs) and dictionary lookup method are widely used for named entity recognition and normalization respectively. We herein developed a CRF-based model to allow automated recognition of disease mentions, and studied the effect of various techniques in improving the normalization results based on the dictionary lookup approach. The dataset from the BioCreative V CDR track was used to report the performance of the developed normalization methods and compare with other existing dictionary lookup based normalization methods. The best configuration achieved an F-measure of 0.77 for the disease normalization, which outperformed the best dictionary lookup based baseline method studied in this work by an F-measure of 0.13. Database URL: https://github.com/TCRNBioinformatics/DiseaseExtract Oxford University Press 2016-08-08 /pmc/articles/PMC4976299/ /pubmed/27504009 http://dx.doi.org/10.1093/database/baw112 Text en © The Author(s) 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Jonnagaddala, Jitendra
Jue, Toni Rose
Chang, Nai-Wen
Dai, Hong-Jie
Improving the dictionary lookup approach for disease normalization using enhanced dictionary and query expansion
title Improving the dictionary lookup approach for disease normalization using enhanced dictionary and query expansion
title_full Improving the dictionary lookup approach for disease normalization using enhanced dictionary and query expansion
title_fullStr Improving the dictionary lookup approach for disease normalization using enhanced dictionary and query expansion
title_full_unstemmed Improving the dictionary lookup approach for disease normalization using enhanced dictionary and query expansion
title_short Improving the dictionary lookup approach for disease normalization using enhanced dictionary and query expansion
title_sort improving the dictionary lookup approach for disease normalization using enhanced dictionary and query expansion
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4976299/
https://www.ncbi.nlm.nih.gov/pubmed/27504009
http://dx.doi.org/10.1093/database/baw112
work_keys_str_mv AT jonnagaddalajitendra improvingthedictionarylookupapproachfordiseasenormalizationusingenhanceddictionaryandqueryexpansion
AT juetonirose improvingthedictionarylookupapproachfordiseasenormalizationusingenhanceddictionaryandqueryexpansion
AT changnaiwen improvingthedictionarylookupapproachfordiseasenormalizationusingenhanceddictionaryandqueryexpansion
AT daihongjie improvingthedictionarylookupapproachfordiseasenormalizationusingenhanceddictionaryandqueryexpansion