Cargando…

Multitask learning for biomedical named entity recognition with cross-sharing structure

BACKGROUND: Biomedical named entity recognition (BioNER) is a fundamental and essential task for biomedical literature mining, which affects the performance of downstream tasks. Most BioNER models rely on domain-specific features or hand-crafted rules, but extracting features from massive data requi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wang, Xi, Lyu, Jiagao, Dong, Li, Xu, Ke
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2019
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6697996/ https://www.ncbi.nlm.nih.gov/pubmed/31419937 http://dx.doi.org/10.1186/s12859-019-3000-5

_version_	1783444470483648512
author	Wang, Xi Lyu, Jiagao Dong, Li Xu, Ke
author_facet	Wang, Xi Lyu, Jiagao Dong, Li Xu, Ke
author_sort	Wang, Xi
collection	PubMed
description	BACKGROUND: Biomedical named entity recognition (BioNER) is a fundamental and essential task for biomedical literature mining, which affects the performance of downstream tasks. Most BioNER models rely on domain-specific features or hand-crafted rules, but extracting features from massive data requires much time and human efforts. To solve this, neural network models are used to automatically learn features. Recently, multi-task learning has been applied successfully to neural network models of biomedical literature mining. For BioNER models, using multi-task learning makes use of features from multiple datasets and improves the performance of models. RESULTS: In experiments, we compared our proposed model with other multi-task models and found our model outperformed the others on datasets of gene, protein, disease categories. We also tested the performance of different dataset pairs to find out the best partners of datasets. Besides, we explored and analyzed the influence of different entity types by using sub-datasets. When dataset size was reduced, our model still produced positive results. CONCLUSION: We propose a novel multi-task model for BioNER with the cross-sharing structure to improve the performance of multi-task models. The cross-sharing structure in our model makes use of features from both datasets in the training procedure. Detailed analysis about best partners of datasets and influence between entity categories can provide guidance of choosing proper dataset pairs for multi-task training. Our implementation is available at https://github.com/JogleLew/bioner-cross-sharing.
format	Online Article Text
id	pubmed-6697996
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-66979962019-08-19 Multitask learning for biomedical named entity recognition with cross-sharing structure Wang, Xi Lyu, Jiagao Dong, Li Xu, Ke BMC Bioinformatics Research Article BACKGROUND: Biomedical named entity recognition (BioNER) is a fundamental and essential task for biomedical literature mining, which affects the performance of downstream tasks. Most BioNER models rely on domain-specific features or hand-crafted rules, but extracting features from massive data requires much time and human efforts. To solve this, neural network models are used to automatically learn features. Recently, multi-task learning has been applied successfully to neural network models of biomedical literature mining. For BioNER models, using multi-task learning makes use of features from multiple datasets and improves the performance of models. RESULTS: In experiments, we compared our proposed model with other multi-task models and found our model outperformed the others on datasets of gene, protein, disease categories. We also tested the performance of different dataset pairs to find out the best partners of datasets. Besides, we explored and analyzed the influence of different entity types by using sub-datasets. When dataset size was reduced, our model still produced positive results. CONCLUSION: We propose a novel multi-task model for BioNER with the cross-sharing structure to improve the performance of multi-task models. The cross-sharing structure in our model makes use of features from both datasets in the training procedure. Detailed analysis about best partners of datasets and influence between entity categories can provide guidance of choosing proper dataset pairs for multi-task training. Our implementation is available at https://github.com/JogleLew/bioner-cross-sharing. BioMed Central 2019-08-16 /pmc/articles/PMC6697996/ /pubmed/31419937 http://dx.doi.org/10.1186/s12859-019-3000-5 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Article Wang, Xi Lyu, Jiagao Dong, Li Xu, Ke Multitask learning for biomedical named entity recognition with cross-sharing structure
title	Multitask learning for biomedical named entity recognition with cross-sharing structure
title_full	Multitask learning for biomedical named entity recognition with cross-sharing structure
title_fullStr	Multitask learning for biomedical named entity recognition with cross-sharing structure
title_full_unstemmed	Multitask learning for biomedical named entity recognition with cross-sharing structure
title_short	Multitask learning for biomedical named entity recognition with cross-sharing structure
title_sort	multitask learning for biomedical named entity recognition with cross-sharing structure
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6697996/ https://www.ncbi.nlm.nih.gov/pubmed/31419937 http://dx.doi.org/10.1186/s12859-019-3000-5
work_keys_str_mv	AT wangxi multitasklearningforbiomedicalnamedentityrecognitionwithcrosssharingstructure AT lyujiagao multitasklearningforbiomedicalnamedentityrecognitionwithcrosssharingstructure AT dongli multitasklearningforbiomedicalnamedentityrecognitionwithcrosssharingstructure AT xuke multitasklearningforbiomedicalnamedentityrecognitionwithcrosssharingstructure

Multitask learning for biomedical named entity recognition with cross-sharing structure

Ejemplares similares