Cargando…

JCBIE: a joint continual learning neural network for biomedical information extraction

Extracting knowledge from heterogeneous data sources is fundamental for the construction of structured biomedical knowledge graphs (BKGs), where entities and relations are represented as nodes and edges in the graphs, respectively. Previous biomedical knowledge extraction methods simply considered l...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Kai, Mao, Rui, Gong, Tieliang, Cambria, Erik, Li, Chen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9761970/
https://www.ncbi.nlm.nih.gov/pubmed/36536280
http://dx.doi.org/10.1186/s12859-022-05096-w
_version_ 1784852776367423488
author He, Kai
Mao, Rui
Gong, Tieliang
Cambria, Erik
Li, Chen
author_facet He, Kai
Mao, Rui
Gong, Tieliang
Cambria, Erik
Li, Chen
author_sort He, Kai
collection PubMed
description Extracting knowledge from heterogeneous data sources is fundamental for the construction of structured biomedical knowledge graphs (BKGs), where entities and relations are represented as nodes and edges in the graphs, respectively. Previous biomedical knowledge extraction methods simply considered limited entity types and relations by using a task-specific training set, which is insufficient for large-scale BKGs development and downstream task applications in different scenarios. To alleviate this issue, we propose a joint continual learning biomedical information extraction (JCBIE) network to extract entities and relations from different biomedical information datasets. By empirically studying different joint learning and continual learning strategies, the proposed JCBIE can learn and expand different types of entities and relations from different datasets. JCBIE uses two separated encoders in joint-feature extraction, hence can effectively avoid the feature confusion problem comparing with using one hard-parameter sharing encoder. Specifically, it allows us to adopt entity augmented inputs to establish the interaction between named entity recognition and relation extraction. Finally, a novel evaluation mechanism is proposed for measuring cross-corpus generalization errors, which was ignored by traditional evaluation methods. Our empirical studies show that JCBIE achieves promising performance when continual learning strategy is adopted with multiple corpora.
format Online
Article
Text
id pubmed-9761970
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-97619702022-12-20 JCBIE: a joint continual learning neural network for biomedical information extraction He, Kai Mao, Rui Gong, Tieliang Cambria, Erik Li, Chen BMC Bioinformatics Research Extracting knowledge from heterogeneous data sources is fundamental for the construction of structured biomedical knowledge graphs (BKGs), where entities and relations are represented as nodes and edges in the graphs, respectively. Previous biomedical knowledge extraction methods simply considered limited entity types and relations by using a task-specific training set, which is insufficient for large-scale BKGs development and downstream task applications in different scenarios. To alleviate this issue, we propose a joint continual learning biomedical information extraction (JCBIE) network to extract entities and relations from different biomedical information datasets. By empirically studying different joint learning and continual learning strategies, the proposed JCBIE can learn and expand different types of entities and relations from different datasets. JCBIE uses two separated encoders in joint-feature extraction, hence can effectively avoid the feature confusion problem comparing with using one hard-parameter sharing encoder. Specifically, it allows us to adopt entity augmented inputs to establish the interaction between named entity recognition and relation extraction. Finally, a novel evaluation mechanism is proposed for measuring cross-corpus generalization errors, which was ignored by traditional evaluation methods. Our empirical studies show that JCBIE achieves promising performance when continual learning strategy is adopted with multiple corpora. BioMed Central 2022-12-19 /pmc/articles/PMC9761970/ /pubmed/36536280 http://dx.doi.org/10.1186/s12859-022-05096-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
He, Kai
Mao, Rui
Gong, Tieliang
Cambria, Erik
Li, Chen
JCBIE: a joint continual learning neural network for biomedical information extraction
title JCBIE: a joint continual learning neural network for biomedical information extraction
title_full JCBIE: a joint continual learning neural network for biomedical information extraction
title_fullStr JCBIE: a joint continual learning neural network for biomedical information extraction
title_full_unstemmed JCBIE: a joint continual learning neural network for biomedical information extraction
title_short JCBIE: a joint continual learning neural network for biomedical information extraction
title_sort jcbie: a joint continual learning neural network for biomedical information extraction
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9761970/
https://www.ncbi.nlm.nih.gov/pubmed/36536280
http://dx.doi.org/10.1186/s12859-022-05096-w
work_keys_str_mv AT hekai jcbieajointcontinuallearningneuralnetworkforbiomedicalinformationextraction
AT maorui jcbieajointcontinuallearningneuralnetworkforbiomedicalinformationextraction
AT gongtieliang jcbieajointcontinuallearningneuralnetworkforbiomedicalinformationextraction
AT cambriaerik jcbieajointcontinuallearningneuralnetworkforbiomedicalinformationextraction
AT lichen jcbieajointcontinuallearningneuralnetworkforbiomedicalinformationextraction