Cargando…

Hierarchical shared transfer learning for biomedical named entity recognition

BACKGROUND: Biomedical named entity recognition (BioNER) is a basic and important medical information extraction task to extract medical entities with special meaning from medical texts. In recent years, deep learning has become the main research direction of BioNER due to its excellent data-driven...

Descripción completa

Detalles Bibliográficos
Autores principales: Chai, Zhaoying, Jin, Han, Shi, Shenghui, Zhan, Siyan, Zhuo, Lin, Yang, Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8729142/
https://www.ncbi.nlm.nih.gov/pubmed/34983362
http://dx.doi.org/10.1186/s12859-021-04551-4
_version_ 1784626876700950528
author Chai, Zhaoying
Jin, Han
Shi, Shenghui
Zhan, Siyan
Zhuo, Lin
Yang, Yu
author_facet Chai, Zhaoying
Jin, Han
Shi, Shenghui
Zhan, Siyan
Zhuo, Lin
Yang, Yu
author_sort Chai, Zhaoying
collection PubMed
description BACKGROUND: Biomedical named entity recognition (BioNER) is a basic and important medical information extraction task to extract medical entities with special meaning from medical texts. In recent years, deep learning has become the main research direction of BioNER due to its excellent data-driven context coding ability. However, in BioNER task, deep learning has the problem of poor generalization and instability. RESULTS: we propose the hierarchical shared transfer learning, which combines multi-task learning and fine-tuning, and realizes the multi-level information fusion between the underlying entity features and the upper data features. We select 14 datasets containing 4 types of entities for training and evaluate the model. The experimental results showed that the F1-scores of the five gold standard datasets BC5CDR-chemical, BC5CDR-disease, BC2GM, BC4CHEMD, NCBI-disease and LINNAEUS were increased by 0.57, 0.90, 0.42, 0.77, 0.98 and − 2.16 compared to the single-task XLNet-CRF model. BC5CDR-chemical, BC5CDR-disease and BC4CHEMD achieved state-of-the-art results.The reasons why LINNAEUS’s multi-task results are lower than single-task results are discussed at the dataset level. CONCLUSION: Compared with using multi-task learning and fine-tuning alone, the model has more accurate recognition ability of medical entities, and has higher generalization and stability.
format Online
Article
Text
id pubmed-8729142
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-87291422022-01-07 Hierarchical shared transfer learning for biomedical named entity recognition Chai, Zhaoying Jin, Han Shi, Shenghui Zhan, Siyan Zhuo, Lin Yang, Yu BMC Bioinformatics Research BACKGROUND: Biomedical named entity recognition (BioNER) is a basic and important medical information extraction task to extract medical entities with special meaning from medical texts. In recent years, deep learning has become the main research direction of BioNER due to its excellent data-driven context coding ability. However, in BioNER task, deep learning has the problem of poor generalization and instability. RESULTS: we propose the hierarchical shared transfer learning, which combines multi-task learning and fine-tuning, and realizes the multi-level information fusion between the underlying entity features and the upper data features. We select 14 datasets containing 4 types of entities for training and evaluate the model. The experimental results showed that the F1-scores of the five gold standard datasets BC5CDR-chemical, BC5CDR-disease, BC2GM, BC4CHEMD, NCBI-disease and LINNAEUS were increased by 0.57, 0.90, 0.42, 0.77, 0.98 and − 2.16 compared to the single-task XLNet-CRF model. BC5CDR-chemical, BC5CDR-disease and BC4CHEMD achieved state-of-the-art results.The reasons why LINNAEUS’s multi-task results are lower than single-task results are discussed at the dataset level. CONCLUSION: Compared with using multi-task learning and fine-tuning alone, the model has more accurate recognition ability of medical entities, and has higher generalization and stability. BioMed Central 2022-01-04 /pmc/articles/PMC8729142/ /pubmed/34983362 http://dx.doi.org/10.1186/s12859-021-04551-4 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Chai, Zhaoying
Jin, Han
Shi, Shenghui
Zhan, Siyan
Zhuo, Lin
Yang, Yu
Hierarchical shared transfer learning for biomedical named entity recognition
title Hierarchical shared transfer learning for biomedical named entity recognition
title_full Hierarchical shared transfer learning for biomedical named entity recognition
title_fullStr Hierarchical shared transfer learning for biomedical named entity recognition
title_full_unstemmed Hierarchical shared transfer learning for biomedical named entity recognition
title_short Hierarchical shared transfer learning for biomedical named entity recognition
title_sort hierarchical shared transfer learning for biomedical named entity recognition
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8729142/
https://www.ncbi.nlm.nih.gov/pubmed/34983362
http://dx.doi.org/10.1186/s12859-021-04551-4
work_keys_str_mv AT chaizhaoying hierarchicalsharedtransferlearningforbiomedicalnamedentityrecognition
AT jinhan hierarchicalsharedtransferlearningforbiomedicalnamedentityrecognition
AT shishenghui hierarchicalsharedtransferlearningforbiomedicalnamedentityrecognition
AT zhansiyan hierarchicalsharedtransferlearningforbiomedicalnamedentityrecognition
AT zhuolin hierarchicalsharedtransferlearningforbiomedicalnamedentityrecognition
AT yangyu hierarchicalsharedtransferlearningforbiomedicalnamedentityrecognition