Cargando…

Consistency enhancement of model prediction on document-level named entity recognition

SUMMARY: Biomedical named entity recognition (NER) plays a crucial role in extracting information from documents in biomedical applications. However, many of these applications require NER models to operate at a document level, rather than just a sentence level. This presents a challenge, as the ext...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jeong, Minbyul, Kang, Jaewoo
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2023
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10272703/ https://www.ncbi.nlm.nih.gov/pubmed/37261870 http://dx.doi.org/10.1093/bioinformatics/btad361

_version_	1785059554581544960
author	Jeong, Minbyul Kang, Jaewoo
author_facet	Jeong, Minbyul Kang, Jaewoo
author_sort	Jeong, Minbyul
collection	PubMed
description	SUMMARY: Biomedical named entity recognition (NER) plays a crucial role in extracting information from documents in biomedical applications. However, many of these applications require NER models to operate at a document level, rather than just a sentence level. This presents a challenge, as the extension from a sentence model to a document model is not always straightforward. Despite the existence of document NER models that are able to make consistent predictions, they still fall short of meeting the expectations of researchers and practitioners in the field. To address this issue, we have undertaken an investigation into the underlying causes of inconsistent predictions. Our research has led us to believe that the use of adjectives and prepositions within entities may be contributing to low label consistency. In this article, we present our method, ConNER, to enhance a label consistency of modifiers such as adjectives and prepositions. By refining the labels of these modifiers, ConNER is able to improve representations of biomedical entities. The effectiveness of our method is demonstrated on four popular biomedical NER datasets. On three datasets, we achieve a higher F1 score than the previous state-of-the-art model. Our method shows its efficacy on two datasets, resulting in 7.5%–8.6% absolute improvements in the F1 score. Our findings suggest that our ConNER method is effective on datasets with intrinsically low label consistency. Through qualitative analysis, we demonstrate how our approach helps the NER model generate more consistent predictions. AVAILABILITY AND IMPLEMENTATION: Our code and resources are available at https://github.com/dmis-lab/ConNER/.
format	Online Article Text
id	pubmed-10272703
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-102727032023-06-17 Consistency enhancement of model prediction on document-level named entity recognition Jeong, Minbyul Kang, Jaewoo Bioinformatics Original Paper SUMMARY: Biomedical named entity recognition (NER) plays a crucial role in extracting information from documents in biomedical applications. However, many of these applications require NER models to operate at a document level, rather than just a sentence level. This presents a challenge, as the extension from a sentence model to a document model is not always straightforward. Despite the existence of document NER models that are able to make consistent predictions, they still fall short of meeting the expectations of researchers and practitioners in the field. To address this issue, we have undertaken an investigation into the underlying causes of inconsistent predictions. Our research has led us to believe that the use of adjectives and prepositions within entities may be contributing to low label consistency. In this article, we present our method, ConNER, to enhance a label consistency of modifiers such as adjectives and prepositions. By refining the labels of these modifiers, ConNER is able to improve representations of biomedical entities. The effectiveness of our method is demonstrated on four popular biomedical NER datasets. On three datasets, we achieve a higher F1 score than the previous state-of-the-art model. Our method shows its efficacy on two datasets, resulting in 7.5%–8.6% absolute improvements in the F1 score. Our findings suggest that our ConNER method is effective on datasets with intrinsically low label consistency. Through qualitative analysis, we demonstrate how our approach helps the NER model generate more consistent predictions. AVAILABILITY AND IMPLEMENTATION: Our code and resources are available at https://github.com/dmis-lab/ConNER/. Oxford University Press 2023-06-01 /pmc/articles/PMC10272703/ /pubmed/37261870 http://dx.doi.org/10.1093/bioinformatics/btad361 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Original Paper Jeong, Minbyul Kang, Jaewoo Consistency enhancement of model prediction on document-level named entity recognition
title	Consistency enhancement of model prediction on document-level named entity recognition
title_full	Consistency enhancement of model prediction on document-level named entity recognition
title_fullStr	Consistency enhancement of model prediction on document-level named entity recognition
title_full_unstemmed	Consistency enhancement of model prediction on document-level named entity recognition
title_short	Consistency enhancement of model prediction on document-level named entity recognition
title_sort	consistency enhancement of model prediction on document-level named entity recognition
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10272703/ https://www.ncbi.nlm.nih.gov/pubmed/37261870 http://dx.doi.org/10.1093/bioinformatics/btad361
work_keys_str_mv	AT jeongminbyul consistencyenhancementofmodelpredictionondocumentlevelnamedentityrecognition AT kangjaewoo consistencyenhancementofmodelpredictionondocumentlevelnamedentityrecognition

Consistency enhancement of model prediction on document-level named entity recognition

Ejemplares similares