Cargando…

CNN-based ranking for biomedical entity normalization

BACKGROUND: Most state-of-the-art biomedical entity normalization systems, such as rule-based systems, merely rely on morphological information of entity mentions, but rarely consider their semantic information. In this paper, we introduce a novel convolutional neural network (CNN) architecture that...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Haodi, Chen, Qingcai, Tang, Buzhou, Wang, Xiaolong, Xu, Hua, Wang, Baohua, Huang, Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5629610/
https://www.ncbi.nlm.nih.gov/pubmed/28984180
http://dx.doi.org/10.1186/s12859-017-1805-7
_version_ 1783269079083122688
author Li, Haodi
Chen, Qingcai
Tang, Buzhou
Wang, Xiaolong
Xu, Hua
Wang, Baohua
Huang, Dong
author_facet Li, Haodi
Chen, Qingcai
Tang, Buzhou
Wang, Xiaolong
Xu, Hua
Wang, Baohua
Huang, Dong
author_sort Li, Haodi
collection PubMed
description BACKGROUND: Most state-of-the-art biomedical entity normalization systems, such as rule-based systems, merely rely on morphological information of entity mentions, but rarely consider their semantic information. In this paper, we introduce a novel convolutional neural network (CNN) architecture that regards biomedical entity normalization as a ranking problem and benefits from semantic information of biomedical entities. RESULTS: The CNN-based ranking method first generates candidates using handcrafted rules, and then ranks the candidates according to their semantic information modeled by CNN as well as their morphological information. Experiments on two benchmark datasets for biomedical entity normalization show that our proposed CNN-based ranking method outperforms traditional rule-based method with state-of-the-art performance. CONCLUSIONS: We propose a CNN architecture that regards biomedical entity normalization as a ranking problem. Comparison results show that semantic information is beneficial to biomedical entity normalization and can be well combined with morphological information in our CNN architecture for further improvement.
format Online
Article
Text
id pubmed-5629610
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-56296102017-10-13 CNN-based ranking for biomedical entity normalization Li, Haodi Chen, Qingcai Tang, Buzhou Wang, Xiaolong Xu, Hua Wang, Baohua Huang, Dong BMC Bioinformatics Research BACKGROUND: Most state-of-the-art biomedical entity normalization systems, such as rule-based systems, merely rely on morphological information of entity mentions, but rarely consider their semantic information. In this paper, we introduce a novel convolutional neural network (CNN) architecture that regards biomedical entity normalization as a ranking problem and benefits from semantic information of biomedical entities. RESULTS: The CNN-based ranking method first generates candidates using handcrafted rules, and then ranks the candidates according to their semantic information modeled by CNN as well as their morphological information. Experiments on two benchmark datasets for biomedical entity normalization show that our proposed CNN-based ranking method outperforms traditional rule-based method with state-of-the-art performance. CONCLUSIONS: We propose a CNN architecture that regards biomedical entity normalization as a ranking problem. Comparison results show that semantic information is beneficial to biomedical entity normalization and can be well combined with morphological information in our CNN architecture for further improvement. BioMed Central 2017-10-03 /pmc/articles/PMC5629610/ /pubmed/28984180 http://dx.doi.org/10.1186/s12859-017-1805-7 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Li, Haodi
Chen, Qingcai
Tang, Buzhou
Wang, Xiaolong
Xu, Hua
Wang, Baohua
Huang, Dong
CNN-based ranking for biomedical entity normalization
title CNN-based ranking for biomedical entity normalization
title_full CNN-based ranking for biomedical entity normalization
title_fullStr CNN-based ranking for biomedical entity normalization
title_full_unstemmed CNN-based ranking for biomedical entity normalization
title_short CNN-based ranking for biomedical entity normalization
title_sort cnn-based ranking for biomedical entity normalization
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5629610/
https://www.ncbi.nlm.nih.gov/pubmed/28984180
http://dx.doi.org/10.1186/s12859-017-1805-7
work_keys_str_mv AT lihaodi cnnbasedrankingforbiomedicalentitynormalization
AT chenqingcai cnnbasedrankingforbiomedicalentitynormalization
AT tangbuzhou cnnbasedrankingforbiomedicalentitynormalization
AT wangxiaolong cnnbasedrankingforbiomedicalentitynormalization
AT xuhua cnnbasedrankingforbiomedicalentitynormalization
AT wangbaohua cnnbasedrankingforbiomedicalentitynormalization
AT huangdong cnnbasedrankingforbiomedicalentitynormalization