Cargando…

Leveraging network analysis to evaluate biomedical named entity recognition tools

The ever-growing availability of biomedical text sources has resulted in a boost in clinical studies based on their exploitation. Biomedical named-entity recognition (bio-NER) techniques have evolved remarkably in recent years and their application in research is increasingly successful. Still, the...

Descripción completa

Detalles Bibliográficos
Autores principales: García del Valle, Eduardo P., Lagunes García, Gerardo, Prieto Santamaría, Lucía, Zanin, Massimiliano, Menasalvas Ruiz, Ernestina, Rodríguez-González, Alejandro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8242017/
https://www.ncbi.nlm.nih.gov/pubmed/34188248
http://dx.doi.org/10.1038/s41598-021-93018-w
_version_ 1783715537461706752
author García del Valle, Eduardo P.
Lagunes García, Gerardo
Prieto Santamaría, Lucía
Zanin, Massimiliano
Menasalvas Ruiz, Ernestina
Rodríguez-González, Alejandro
author_facet García del Valle, Eduardo P.
Lagunes García, Gerardo
Prieto Santamaría, Lucía
Zanin, Massimiliano
Menasalvas Ruiz, Ernestina
Rodríguez-González, Alejandro
author_sort García del Valle, Eduardo P.
collection PubMed
description The ever-growing availability of biomedical text sources has resulted in a boost in clinical studies based on their exploitation. Biomedical named-entity recognition (bio-NER) techniques have evolved remarkably in recent years and their application in research is increasingly successful. Still, the disparity of tools and the limited available validation resources are barriers preventing a wider diffusion, especially within clinical practice. We here propose the use of omics data and network analysis as an alternative for the assessment of bio-NER tools. Specifically, our method introduces quality criteria based on edge overlap and community detection. The application of these criteria to four bio-NER solutions yielded comparable results to strategies based on annotated corpora, without suffering from their limitations. Our approach can constitute a guide both for the selection of the best bio-NER tool given a specific task, and for the creation and validation of novel approaches.
format Online
Article
Text
id pubmed-8242017
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-82420172021-07-06 Leveraging network analysis to evaluate biomedical named entity recognition tools García del Valle, Eduardo P. Lagunes García, Gerardo Prieto Santamaría, Lucía Zanin, Massimiliano Menasalvas Ruiz, Ernestina Rodríguez-González, Alejandro Sci Rep Article The ever-growing availability of biomedical text sources has resulted in a boost in clinical studies based on their exploitation. Biomedical named-entity recognition (bio-NER) techniques have evolved remarkably in recent years and their application in research is increasingly successful. Still, the disparity of tools and the limited available validation resources are barriers preventing a wider diffusion, especially within clinical practice. We here propose the use of omics data and network analysis as an alternative for the assessment of bio-NER tools. Specifically, our method introduces quality criteria based on edge overlap and community detection. The application of these criteria to four bio-NER solutions yielded comparable results to strategies based on annotated corpora, without suffering from their limitations. Our approach can constitute a guide both for the selection of the best bio-NER tool given a specific task, and for the creation and validation of novel approaches. Nature Publishing Group UK 2021-06-29 /pmc/articles/PMC8242017/ /pubmed/34188248 http://dx.doi.org/10.1038/s41598-021-93018-w Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
García del Valle, Eduardo P.
Lagunes García, Gerardo
Prieto Santamaría, Lucía
Zanin, Massimiliano
Menasalvas Ruiz, Ernestina
Rodríguez-González, Alejandro
Leveraging network analysis to evaluate biomedical named entity recognition tools
title Leveraging network analysis to evaluate biomedical named entity recognition tools
title_full Leveraging network analysis to evaluate biomedical named entity recognition tools
title_fullStr Leveraging network analysis to evaluate biomedical named entity recognition tools
title_full_unstemmed Leveraging network analysis to evaluate biomedical named entity recognition tools
title_short Leveraging network analysis to evaluate biomedical named entity recognition tools
title_sort leveraging network analysis to evaluate biomedical named entity recognition tools
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8242017/
https://www.ncbi.nlm.nih.gov/pubmed/34188248
http://dx.doi.org/10.1038/s41598-021-93018-w
work_keys_str_mv AT garciadelvalleeduardop leveragingnetworkanalysistoevaluatebiomedicalnamedentityrecognitiontools
AT lagunesgarciagerardo leveragingnetworkanalysistoevaluatebiomedicalnamedentityrecognitiontools
AT prietosantamarialucia leveragingnetworkanalysistoevaluatebiomedicalnamedentityrecognitiontools
AT zaninmassimiliano leveragingnetworkanalysistoevaluatebiomedicalnamedentityrecognitiontools
AT menasalvasruizernestina leveragingnetworkanalysistoevaluatebiomedicalnamedentityrecognitiontools
AT rodriguezgonzalezalejandro leveragingnetworkanalysistoevaluatebiomedicalnamedentityrecognitiontools