Cargando…

Concept recognition as a machine translation problem

BACKGROUND: Automated assignment of specific ontology concepts to mentions in text is a critical task in biomedical natural language processing, and the subject of many open shared tasks. Although the current state of the art involves the use of neural network language models as a post-processing st...

Descripción completa

Detalles Bibliográficos
Autores principales:	Boguslav, Mayla R., Hailu, Negacy D., Bada, Michael, Baumgartner, William A., Hunter, Lawrence E.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2021
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8678974/ https://www.ncbi.nlm.nih.gov/pubmed/34920707 http://dx.doi.org/10.1186/s12859-021-04141-4

_version_	1784616418654814208
author	Boguslav, Mayla R. Hailu, Negacy D. Bada, Michael Baumgartner, William A. Hunter, Lawrence E.
author_facet	Boguslav, Mayla R. Hailu, Negacy D. Bada, Michael Baumgartner, William A. Hunter, Lawrence E.
author_sort	Boguslav, Mayla R.
collection	PubMed
description	BACKGROUND: Automated assignment of specific ontology concepts to mentions in text is a critical task in biomedical natural language processing, and the subject of many open shared tasks. Although the current state of the art involves the use of neural network language models as a post-processing step, the very large number of ontology classes to be recognized and the limited amount of gold-standard training data has impeded the creation of end-to-end systems based entirely on machine learning. Recently, Hailu et al. recast the concept recognition problem as a type of machine translation and demonstrated that sequence-to-sequence machine learning models have the potential to outperform multi-class classification approaches. METHODS: We systematically characterize the factors that contribute to the accuracy and efficiency of several approaches to sequence-to-sequence machine learning through extensive studies of alternative methods and hyperparameter selections. We not only identify the best-performing systems and parameters across a wide variety of ontologies but also provide insights into the widely varying resource requirements and hyperparameter robustness of alternative approaches. Analysis of the strengths and weaknesses of such systems suggest promising avenues for future improvements as well as design choices that can increase computational efficiency with small costs in performance. RESULTS: Bidirectional encoder representations from transformers for biomedical text mining (BioBERT) for span detection along with the open-source toolkit for neural machine translation (OpenNMT) for concept normalization achieve state-of-the-art performance for most ontologies annotated in the CRAFT Corpus. This approach uses substantially fewer computational resources, including hardware, memory, and time than several alternative approaches. CONCLUSIONS: Machine translation is a promising avenue for fully machine-learning-based concept recognition that achieves state-of-the-art results on the CRAFT Corpus, evaluated via a direct comparison to previous results from the 2019 CRAFT shared task. Experiments illuminating the reasons for the surprisingly good performance of sequence-to-sequence methods targeting ontology identifiers suggest that further progress may be possible by mapping to alternative target concept representations. All code and models can be found at: https://github.com/UCDenver-ccp/Concept-Recognition-as-Translation.
format	Online Article Text
id	pubmed-8678974
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-86789742021-12-17 Concept recognition as a machine translation problem Boguslav, Mayla R. Hailu, Negacy D. Bada, Michael Baumgartner, William A. Hunter, Lawrence E. BMC Bioinformatics Research BACKGROUND: Automated assignment of specific ontology concepts to mentions in text is a critical task in biomedical natural language processing, and the subject of many open shared tasks. Although the current state of the art involves the use of neural network language models as a post-processing step, the very large number of ontology classes to be recognized and the limited amount of gold-standard training data has impeded the creation of end-to-end systems based entirely on machine learning. Recently, Hailu et al. recast the concept recognition problem as a type of machine translation and demonstrated that sequence-to-sequence machine learning models have the potential to outperform multi-class classification approaches. METHODS: We systematically characterize the factors that contribute to the accuracy and efficiency of several approaches to sequence-to-sequence machine learning through extensive studies of alternative methods and hyperparameter selections. We not only identify the best-performing systems and parameters across a wide variety of ontologies but also provide insights into the widely varying resource requirements and hyperparameter robustness of alternative approaches. Analysis of the strengths and weaknesses of such systems suggest promising avenues for future improvements as well as design choices that can increase computational efficiency with small costs in performance. RESULTS: Bidirectional encoder representations from transformers for biomedical text mining (BioBERT) for span detection along with the open-source toolkit for neural machine translation (OpenNMT) for concept normalization achieve state-of-the-art performance for most ontologies annotated in the CRAFT Corpus. This approach uses substantially fewer computational resources, including hardware, memory, and time than several alternative approaches. CONCLUSIONS: Machine translation is a promising avenue for fully machine-learning-based concept recognition that achieves state-of-the-art results on the CRAFT Corpus, evaluated via a direct comparison to previous results from the 2019 CRAFT shared task. Experiments illuminating the reasons for the surprisingly good performance of sequence-to-sequence methods targeting ontology identifiers suggest that further progress may be possible by mapping to alternative target concept representations. All code and models can be found at: https://github.com/UCDenver-ccp/Concept-Recognition-as-Translation. BioMed Central 2021-12-17 /pmc/articles/PMC8678974/ /pubmed/34920707 http://dx.doi.org/10.1186/s12859-021-04141-4 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Boguslav, Mayla R. Hailu, Negacy D. Bada, Michael Baumgartner, William A. Hunter, Lawrence E. Concept recognition as a machine translation problem
title	Concept recognition as a machine translation problem
title_full	Concept recognition as a machine translation problem
title_fullStr	Concept recognition as a machine translation problem
title_full_unstemmed	Concept recognition as a machine translation problem
title_short	Concept recognition as a machine translation problem
title_sort	concept recognition as a machine translation problem
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8678974/ https://www.ncbi.nlm.nih.gov/pubmed/34920707 http://dx.doi.org/10.1186/s12859-021-04141-4
work_keys_str_mv	AT boguslavmaylar conceptrecognitionasamachinetranslationproblem AT hailunegacyd conceptrecognitionasamachinetranslationproblem AT badamichael conceptrecognitionasamachinetranslationproblem AT baumgartnerwilliama conceptrecognitionasamachinetranslationproblem AT hunterlawrencee conceptrecognitionasamachinetranslationproblem

Concept recognition as a machine translation problem

Ejemplares similares