Cargando…
Hierarchical shared transfer learning for biomedical named entity recognition
BACKGROUND: Biomedical named entity recognition (BioNER) is a basic and important medical information extraction task to extract medical entities with special meaning from medical texts. In recent years, deep learning has become the main research direction of BioNER due to its excellent data-driven...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8729142/ https://www.ncbi.nlm.nih.gov/pubmed/34983362 http://dx.doi.org/10.1186/s12859-021-04551-4 |
_version_ | 1784626876700950528 |
---|---|
author | Chai, Zhaoying Jin, Han Shi, Shenghui Zhan, Siyan Zhuo, Lin Yang, Yu |
author_facet | Chai, Zhaoying Jin, Han Shi, Shenghui Zhan, Siyan Zhuo, Lin Yang, Yu |
author_sort | Chai, Zhaoying |
collection | PubMed |
description | BACKGROUND: Biomedical named entity recognition (BioNER) is a basic and important medical information extraction task to extract medical entities with special meaning from medical texts. In recent years, deep learning has become the main research direction of BioNER due to its excellent data-driven context coding ability. However, in BioNER task, deep learning has the problem of poor generalization and instability. RESULTS: we propose the hierarchical shared transfer learning, which combines multi-task learning and fine-tuning, and realizes the multi-level information fusion between the underlying entity features and the upper data features. We select 14 datasets containing 4 types of entities for training and evaluate the model. The experimental results showed that the F1-scores of the five gold standard datasets BC5CDR-chemical, BC5CDR-disease, BC2GM, BC4CHEMD, NCBI-disease and LINNAEUS were increased by 0.57, 0.90, 0.42, 0.77, 0.98 and − 2.16 compared to the single-task XLNet-CRF model. BC5CDR-chemical, BC5CDR-disease and BC4CHEMD achieved state-of-the-art results.The reasons why LINNAEUS’s multi-task results are lower than single-task results are discussed at the dataset level. CONCLUSION: Compared with using multi-task learning and fine-tuning alone, the model has more accurate recognition ability of medical entities, and has higher generalization and stability. |
format | Online Article Text |
id | pubmed-8729142 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-87291422022-01-07 Hierarchical shared transfer learning for biomedical named entity recognition Chai, Zhaoying Jin, Han Shi, Shenghui Zhan, Siyan Zhuo, Lin Yang, Yu BMC Bioinformatics Research BACKGROUND: Biomedical named entity recognition (BioNER) is a basic and important medical information extraction task to extract medical entities with special meaning from medical texts. In recent years, deep learning has become the main research direction of BioNER due to its excellent data-driven context coding ability. However, in BioNER task, deep learning has the problem of poor generalization and instability. RESULTS: we propose the hierarchical shared transfer learning, which combines multi-task learning and fine-tuning, and realizes the multi-level information fusion between the underlying entity features and the upper data features. We select 14 datasets containing 4 types of entities for training and evaluate the model. The experimental results showed that the F1-scores of the five gold standard datasets BC5CDR-chemical, BC5CDR-disease, BC2GM, BC4CHEMD, NCBI-disease and LINNAEUS were increased by 0.57, 0.90, 0.42, 0.77, 0.98 and − 2.16 compared to the single-task XLNet-CRF model. BC5CDR-chemical, BC5CDR-disease and BC4CHEMD achieved state-of-the-art results.The reasons why LINNAEUS’s multi-task results are lower than single-task results are discussed at the dataset level. CONCLUSION: Compared with using multi-task learning and fine-tuning alone, the model has more accurate recognition ability of medical entities, and has higher generalization and stability. BioMed Central 2022-01-04 /pmc/articles/PMC8729142/ /pubmed/34983362 http://dx.doi.org/10.1186/s12859-021-04551-4 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Chai, Zhaoying Jin, Han Shi, Shenghui Zhan, Siyan Zhuo, Lin Yang, Yu Hierarchical shared transfer learning for biomedical named entity recognition |
title | Hierarchical shared transfer learning for biomedical named entity recognition |
title_full | Hierarchical shared transfer learning for biomedical named entity recognition |
title_fullStr | Hierarchical shared transfer learning for biomedical named entity recognition |
title_full_unstemmed | Hierarchical shared transfer learning for biomedical named entity recognition |
title_short | Hierarchical shared transfer learning for biomedical named entity recognition |
title_sort | hierarchical shared transfer learning for biomedical named entity recognition |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8729142/ https://www.ncbi.nlm.nih.gov/pubmed/34983362 http://dx.doi.org/10.1186/s12859-021-04551-4 |
work_keys_str_mv | AT chaizhaoying hierarchicalsharedtransferlearningforbiomedicalnamedentityrecognition AT jinhan hierarchicalsharedtransferlearningforbiomedicalnamedentityrecognition AT shishenghui hierarchicalsharedtransferlearningforbiomedicalnamedentityrecognition AT zhansiyan hierarchicalsharedtransferlearningforbiomedicalnamedentityrecognition AT zhuolin hierarchicalsharedtransferlearningforbiomedicalnamedentityrecognition AT yangyu hierarchicalsharedtransferlearningforbiomedicalnamedentityrecognition |