Cargando…
CNN-based ranking for biomedical entity normalization
BACKGROUND: Most state-of-the-art biomedical entity normalization systems, such as rule-based systems, merely rely on morphological information of entity mentions, but rarely consider their semantic information. In this paper, we introduce a novel convolutional neural network (CNN) architecture that...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5629610/ https://www.ncbi.nlm.nih.gov/pubmed/28984180 http://dx.doi.org/10.1186/s12859-017-1805-7 |
_version_ | 1783269079083122688 |
---|---|
author | Li, Haodi Chen, Qingcai Tang, Buzhou Wang, Xiaolong Xu, Hua Wang, Baohua Huang, Dong |
author_facet | Li, Haodi Chen, Qingcai Tang, Buzhou Wang, Xiaolong Xu, Hua Wang, Baohua Huang, Dong |
author_sort | Li, Haodi |
collection | PubMed |
description | BACKGROUND: Most state-of-the-art biomedical entity normalization systems, such as rule-based systems, merely rely on morphological information of entity mentions, but rarely consider their semantic information. In this paper, we introduce a novel convolutional neural network (CNN) architecture that regards biomedical entity normalization as a ranking problem and benefits from semantic information of biomedical entities. RESULTS: The CNN-based ranking method first generates candidates using handcrafted rules, and then ranks the candidates according to their semantic information modeled by CNN as well as their morphological information. Experiments on two benchmark datasets for biomedical entity normalization show that our proposed CNN-based ranking method outperforms traditional rule-based method with state-of-the-art performance. CONCLUSIONS: We propose a CNN architecture that regards biomedical entity normalization as a ranking problem. Comparison results show that semantic information is beneficial to biomedical entity normalization and can be well combined with morphological information in our CNN architecture for further improvement. |
format | Online Article Text |
id | pubmed-5629610 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-56296102017-10-13 CNN-based ranking for biomedical entity normalization Li, Haodi Chen, Qingcai Tang, Buzhou Wang, Xiaolong Xu, Hua Wang, Baohua Huang, Dong BMC Bioinformatics Research BACKGROUND: Most state-of-the-art biomedical entity normalization systems, such as rule-based systems, merely rely on morphological information of entity mentions, but rarely consider their semantic information. In this paper, we introduce a novel convolutional neural network (CNN) architecture that regards biomedical entity normalization as a ranking problem and benefits from semantic information of biomedical entities. RESULTS: The CNN-based ranking method first generates candidates using handcrafted rules, and then ranks the candidates according to their semantic information modeled by CNN as well as their morphological information. Experiments on two benchmark datasets for biomedical entity normalization show that our proposed CNN-based ranking method outperforms traditional rule-based method with state-of-the-art performance. CONCLUSIONS: We propose a CNN architecture that regards biomedical entity normalization as a ranking problem. Comparison results show that semantic information is beneficial to biomedical entity normalization and can be well combined with morphological information in our CNN architecture for further improvement. BioMed Central 2017-10-03 /pmc/articles/PMC5629610/ /pubmed/28984180 http://dx.doi.org/10.1186/s12859-017-1805-7 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Li, Haodi Chen, Qingcai Tang, Buzhou Wang, Xiaolong Xu, Hua Wang, Baohua Huang, Dong CNN-based ranking for biomedical entity normalization |
title | CNN-based ranking for biomedical entity normalization |
title_full | CNN-based ranking for biomedical entity normalization |
title_fullStr | CNN-based ranking for biomedical entity normalization |
title_full_unstemmed | CNN-based ranking for biomedical entity normalization |
title_short | CNN-based ranking for biomedical entity normalization |
title_sort | cnn-based ranking for biomedical entity normalization |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5629610/ https://www.ncbi.nlm.nih.gov/pubmed/28984180 http://dx.doi.org/10.1186/s12859-017-1805-7 |
work_keys_str_mv | AT lihaodi cnnbasedrankingforbiomedicalentitynormalization AT chenqingcai cnnbasedrankingforbiomedicalentitynormalization AT tangbuzhou cnnbasedrankingforbiomedicalentitynormalization AT wangxiaolong cnnbasedrankingforbiomedicalentitynormalization AT xuhua cnnbasedrankingforbiomedicalentitynormalization AT wangbaohua cnnbasedrankingforbiomedicalentitynormalization AT huangdong cnnbasedrankingforbiomedicalentitynormalization |