Cargando…
Distinguished representation of identical mentions in bio-entity coreference resolution
BACKGROUND: Bio-entity Coreference Resolution (CR) is a vital task in biomedical text mining. An important issue in CR is the differential representation of identical mentions as their similar representations may make the coreference more puzzling. However, when extracting features, existing neural...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9063119/ https://www.ncbi.nlm.nih.gov/pubmed/35501781 http://dx.doi.org/10.1186/s12911-022-01862-1 |
_version_ | 1784699099171258368 |
---|---|
author | Li, Yufei Zhou, Xiangyu Ma, Jie Ma, Xiaoyong Cheng, Pengzhen Gong, Tieliang Li, Chen |
author_facet | Li, Yufei Zhou, Xiangyu Ma, Jie Ma, Xiaoyong Cheng, Pengzhen Gong, Tieliang Li, Chen |
author_sort | Li, Yufei |
collection | PubMed |
description | BACKGROUND: Bio-entity Coreference Resolution (CR) is a vital task in biomedical text mining. An important issue in CR is the differential representation of identical mentions as their similar representations may make the coreference more puzzling. However, when extracting features, existing neural network-based models may bring additional noise to the distinction of identical mentions since they tend to get similar or even identical feature representations. METHODS: We propose a context-aware feature attention model to distinguish similar or identical text units effectively for better resolving coreference. The new model can represent the identical mentions based on different contexts by adaptively exploiting features, which enables the model reduce the text noise and capture the semantic information effectively. RESULTS: The experimental results show that the proposed model brings significant improvements on most of the baseline for coreference resolution and mention detection on the BioNLP dataset and CRAFT-CR dataset. The empirical studies further demonstrate its superior performance on the differential representation and coreferential link of identical mentions. CONCLUSIONS: Identical mentions impose difficulties on the current methods of Bio-entity coreference resolution. Thus, we propose the context-aware feature attention model to better distinguish identical mentions and achieve superior performance on both coreference resolution and mention detection, which will further improve the performance of the downstream tasks. |
format | Online Article Text |
id | pubmed-9063119 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-90631192022-05-04 Distinguished representation of identical mentions in bio-entity coreference resolution Li, Yufei Zhou, Xiangyu Ma, Jie Ma, Xiaoyong Cheng, Pengzhen Gong, Tieliang Li, Chen BMC Med Inform Decis Mak Research BACKGROUND: Bio-entity Coreference Resolution (CR) is a vital task in biomedical text mining. An important issue in CR is the differential representation of identical mentions as their similar representations may make the coreference more puzzling. However, when extracting features, existing neural network-based models may bring additional noise to the distinction of identical mentions since they tend to get similar or even identical feature representations. METHODS: We propose a context-aware feature attention model to distinguish similar or identical text units effectively for better resolving coreference. The new model can represent the identical mentions based on different contexts by adaptively exploiting features, which enables the model reduce the text noise and capture the semantic information effectively. RESULTS: The experimental results show that the proposed model brings significant improvements on most of the baseline for coreference resolution and mention detection on the BioNLP dataset and CRAFT-CR dataset. The empirical studies further demonstrate its superior performance on the differential representation and coreferential link of identical mentions. CONCLUSIONS: Identical mentions impose difficulties on the current methods of Bio-entity coreference resolution. Thus, we propose the context-aware feature attention model to better distinguish identical mentions and achieve superior performance on both coreference resolution and mention detection, which will further improve the performance of the downstream tasks. BioMed Central 2022-04-30 /pmc/articles/PMC9063119/ /pubmed/35501781 http://dx.doi.org/10.1186/s12911-022-01862-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Li, Yufei Zhou, Xiangyu Ma, Jie Ma, Xiaoyong Cheng, Pengzhen Gong, Tieliang Li, Chen Distinguished representation of identical mentions in bio-entity coreference resolution |
title | Distinguished representation of identical mentions in bio-entity coreference resolution |
title_full | Distinguished representation of identical mentions in bio-entity coreference resolution |
title_fullStr | Distinguished representation of identical mentions in bio-entity coreference resolution |
title_full_unstemmed | Distinguished representation of identical mentions in bio-entity coreference resolution |
title_short | Distinguished representation of identical mentions in bio-entity coreference resolution |
title_sort | distinguished representation of identical mentions in bio-entity coreference resolution |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9063119/ https://www.ncbi.nlm.nih.gov/pubmed/35501781 http://dx.doi.org/10.1186/s12911-022-01862-1 |
work_keys_str_mv | AT liyufei distinguishedrepresentationofidenticalmentionsinbioentitycoreferenceresolution AT zhouxiangyu distinguishedrepresentationofidenticalmentionsinbioentitycoreferenceresolution AT majie distinguishedrepresentationofidenticalmentionsinbioentitycoreferenceresolution AT maxiaoyong distinguishedrepresentationofidenticalmentionsinbioentitycoreferenceresolution AT chengpengzhen distinguishedrepresentationofidenticalmentionsinbioentitycoreferenceresolution AT gongtieliang distinguishedrepresentationofidenticalmentionsinbioentitycoreferenceresolution AT lichen distinguishedrepresentationofidenticalmentionsinbioentitycoreferenceresolution |