Cargando…

Distinguished representation of identical mentions in bio-entity coreference resolution

BACKGROUND: Bio-entity Coreference Resolution (CR) is a vital task in biomedical text mining. An important issue in CR is the differential representation of identical mentions as their similar representations may make the coreference more puzzling. However, when extracting features, existing neural...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Yufei, Zhou, Xiangyu, Ma, Jie, Ma, Xiaoyong, Cheng, Pengzhen, Gong, Tieliang, Li, Chen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9063119/
https://www.ncbi.nlm.nih.gov/pubmed/35501781
http://dx.doi.org/10.1186/s12911-022-01862-1
_version_ 1784699099171258368
author Li, Yufei
Zhou, Xiangyu
Ma, Jie
Ma, Xiaoyong
Cheng, Pengzhen
Gong, Tieliang
Li, Chen
author_facet Li, Yufei
Zhou, Xiangyu
Ma, Jie
Ma, Xiaoyong
Cheng, Pengzhen
Gong, Tieliang
Li, Chen
author_sort Li, Yufei
collection PubMed
description BACKGROUND: Bio-entity Coreference Resolution (CR) is a vital task in biomedical text mining. An important issue in CR is the differential representation of identical mentions as their similar representations may make the coreference more puzzling. However, when extracting features, existing neural network-based models may bring additional noise to the distinction of identical mentions since they tend to get similar or even identical feature representations. METHODS: We propose a context-aware feature attention model to distinguish similar or identical text units effectively for better resolving coreference. The new model can represent the identical mentions based on different contexts by adaptively exploiting features, which enables the model reduce the text noise and capture the semantic information effectively. RESULTS: The experimental results show that the proposed model brings significant improvements on most of the baseline for coreference resolution and mention detection on the BioNLP dataset and CRAFT-CR dataset. The empirical studies further demonstrate its superior performance on the differential representation and coreferential link of identical mentions. CONCLUSIONS: Identical mentions impose difficulties on the current methods of Bio-entity coreference resolution. Thus, we propose the context-aware feature attention model to better distinguish identical mentions and achieve superior performance on both coreference resolution and mention detection, which will further improve the performance of the downstream tasks.
format Online
Article
Text
id pubmed-9063119
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-90631192022-05-04 Distinguished representation of identical mentions in bio-entity coreference resolution Li, Yufei Zhou, Xiangyu Ma, Jie Ma, Xiaoyong Cheng, Pengzhen Gong, Tieliang Li, Chen BMC Med Inform Decis Mak Research BACKGROUND: Bio-entity Coreference Resolution (CR) is a vital task in biomedical text mining. An important issue in CR is the differential representation of identical mentions as their similar representations may make the coreference more puzzling. However, when extracting features, existing neural network-based models may bring additional noise to the distinction of identical mentions since they tend to get similar or even identical feature representations. METHODS: We propose a context-aware feature attention model to distinguish similar or identical text units effectively for better resolving coreference. The new model can represent the identical mentions based on different contexts by adaptively exploiting features, which enables the model reduce the text noise and capture the semantic information effectively. RESULTS: The experimental results show that the proposed model brings significant improvements on most of the baseline for coreference resolution and mention detection on the BioNLP dataset and CRAFT-CR dataset. The empirical studies further demonstrate its superior performance on the differential representation and coreferential link of identical mentions. CONCLUSIONS: Identical mentions impose difficulties on the current methods of Bio-entity coreference resolution. Thus, we propose the context-aware feature attention model to better distinguish identical mentions and achieve superior performance on both coreference resolution and mention detection, which will further improve the performance of the downstream tasks. BioMed Central 2022-04-30 /pmc/articles/PMC9063119/ /pubmed/35501781 http://dx.doi.org/10.1186/s12911-022-01862-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Li, Yufei
Zhou, Xiangyu
Ma, Jie
Ma, Xiaoyong
Cheng, Pengzhen
Gong, Tieliang
Li, Chen
Distinguished representation of identical mentions in bio-entity coreference resolution
title Distinguished representation of identical mentions in bio-entity coreference resolution
title_full Distinguished representation of identical mentions in bio-entity coreference resolution
title_fullStr Distinguished representation of identical mentions in bio-entity coreference resolution
title_full_unstemmed Distinguished representation of identical mentions in bio-entity coreference resolution
title_short Distinguished representation of identical mentions in bio-entity coreference resolution
title_sort distinguished representation of identical mentions in bio-entity coreference resolution
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9063119/
https://www.ncbi.nlm.nih.gov/pubmed/35501781
http://dx.doi.org/10.1186/s12911-022-01862-1
work_keys_str_mv AT liyufei distinguishedrepresentationofidenticalmentionsinbioentitycoreferenceresolution
AT zhouxiangyu distinguishedrepresentationofidenticalmentionsinbioentitycoreferenceresolution
AT majie distinguishedrepresentationofidenticalmentionsinbioentitycoreferenceresolution
AT maxiaoyong distinguishedrepresentationofidenticalmentionsinbioentitycoreferenceresolution
AT chengpengzhen distinguishedrepresentationofidenticalmentionsinbioentitycoreferenceresolution
AT gongtieliang distinguishedrepresentationofidenticalmentionsinbioentitycoreferenceresolution
AT lichen distinguishedrepresentationofidenticalmentionsinbioentitycoreferenceresolution