Cargando…
JCBIE: a joint continual learning neural network for biomedical information extraction
Extracting knowledge from heterogeneous data sources is fundamental for the construction of structured biomedical knowledge graphs (BKGs), where entities and relations are represented as nodes and edges in the graphs, respectively. Previous biomedical knowledge extraction methods simply considered l...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9761970/ https://www.ncbi.nlm.nih.gov/pubmed/36536280 http://dx.doi.org/10.1186/s12859-022-05096-w |
_version_ | 1784852776367423488 |
---|---|
author | He, Kai Mao, Rui Gong, Tieliang Cambria, Erik Li, Chen |
author_facet | He, Kai Mao, Rui Gong, Tieliang Cambria, Erik Li, Chen |
author_sort | He, Kai |
collection | PubMed |
description | Extracting knowledge from heterogeneous data sources is fundamental for the construction of structured biomedical knowledge graphs (BKGs), where entities and relations are represented as nodes and edges in the graphs, respectively. Previous biomedical knowledge extraction methods simply considered limited entity types and relations by using a task-specific training set, which is insufficient for large-scale BKGs development and downstream task applications in different scenarios. To alleviate this issue, we propose a joint continual learning biomedical information extraction (JCBIE) network to extract entities and relations from different biomedical information datasets. By empirically studying different joint learning and continual learning strategies, the proposed JCBIE can learn and expand different types of entities and relations from different datasets. JCBIE uses two separated encoders in joint-feature extraction, hence can effectively avoid the feature confusion problem comparing with using one hard-parameter sharing encoder. Specifically, it allows us to adopt entity augmented inputs to establish the interaction between named entity recognition and relation extraction. Finally, a novel evaluation mechanism is proposed for measuring cross-corpus generalization errors, which was ignored by traditional evaluation methods. Our empirical studies show that JCBIE achieves promising performance when continual learning strategy is adopted with multiple corpora. |
format | Online Article Text |
id | pubmed-9761970 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-97619702022-12-20 JCBIE: a joint continual learning neural network for biomedical information extraction He, Kai Mao, Rui Gong, Tieliang Cambria, Erik Li, Chen BMC Bioinformatics Research Extracting knowledge from heterogeneous data sources is fundamental for the construction of structured biomedical knowledge graphs (BKGs), where entities and relations are represented as nodes and edges in the graphs, respectively. Previous biomedical knowledge extraction methods simply considered limited entity types and relations by using a task-specific training set, which is insufficient for large-scale BKGs development and downstream task applications in different scenarios. To alleviate this issue, we propose a joint continual learning biomedical information extraction (JCBIE) network to extract entities and relations from different biomedical information datasets. By empirically studying different joint learning and continual learning strategies, the proposed JCBIE can learn and expand different types of entities and relations from different datasets. JCBIE uses two separated encoders in joint-feature extraction, hence can effectively avoid the feature confusion problem comparing with using one hard-parameter sharing encoder. Specifically, it allows us to adopt entity augmented inputs to establish the interaction between named entity recognition and relation extraction. Finally, a novel evaluation mechanism is proposed for measuring cross-corpus generalization errors, which was ignored by traditional evaluation methods. Our empirical studies show that JCBIE achieves promising performance when continual learning strategy is adopted with multiple corpora. BioMed Central 2022-12-19 /pmc/articles/PMC9761970/ /pubmed/36536280 http://dx.doi.org/10.1186/s12859-022-05096-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research He, Kai Mao, Rui Gong, Tieliang Cambria, Erik Li, Chen JCBIE: a joint continual learning neural network for biomedical information extraction |
title | JCBIE: a joint continual learning neural network for biomedical information extraction |
title_full | JCBIE: a joint continual learning neural network for biomedical information extraction |
title_fullStr | JCBIE: a joint continual learning neural network for biomedical information extraction |
title_full_unstemmed | JCBIE: a joint continual learning neural network for biomedical information extraction |
title_short | JCBIE: a joint continual learning neural network for biomedical information extraction |
title_sort | jcbie: a joint continual learning neural network for biomedical information extraction |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9761970/ https://www.ncbi.nlm.nih.gov/pubmed/36536280 http://dx.doi.org/10.1186/s12859-022-05096-w |
work_keys_str_mv | AT hekai jcbieajointcontinuallearningneuralnetworkforbiomedicalinformationextraction AT maorui jcbieajointcontinuallearningneuralnetworkforbiomedicalinformationextraction AT gongtieliang jcbieajointcontinuallearningneuralnetworkforbiomedicalinformationextraction AT cambriaerik jcbieajointcontinuallearningneuralnetworkforbiomedicalinformationextraction AT lichen jcbieajointcontinuallearningneuralnetworkforbiomedicalinformationextraction |