Cargando…

A general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning

The automatic extraction of meaningful relations from biomedical literature or clinical records is crucial in various biomedical applications. Most of the current deep learning approaches for medical relation extraction require large-scale training data to prevent overfitting of the training model....

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Tao, Wu, Mingfen, Li, Hexi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6892305/
https://www.ncbi.nlm.nih.gov/pubmed/31800044
http://dx.doi.org/10.1093/database/baz116
_version_ 1783476004875927552
author Chen, Tao
Wu, Mingfen
Li, Hexi
author_facet Chen, Tao
Wu, Mingfen
Li, Hexi
author_sort Chen, Tao
collection PubMed
description The automatic extraction of meaningful relations from biomedical literature or clinical records is crucial in various biomedical applications. Most of the current deep learning approaches for medical relation extraction require large-scale training data to prevent overfitting of the training model. We propose using a pre-trained model and a fine-tuning technique to improve these approaches without additional time-consuming human labeling. Firstly, we show the architecture of Bidirectional Encoder Representations from Transformers (BERT), an approach for pre-training a model on large-scale unstructured text. We then combine BERT with a one-dimensional convolutional neural network (1d-CNN) to fine-tune the pre-trained model for relation extraction. Extensive experiments on three datasets, namely the BioCreative V chemical disease relation corpus, traditional Chinese medicine literature corpus and i2b2 2012 temporal relation challenge corpus, show that the proposed approach achieves state-of-the-art results (giving a relative improvement of 22.2, 7.77, and 38.5% in F1 score, respectively, compared with a traditional 1d-CNN classifier). The source code is available at https://github.com/chentao1999/MedicalRelationExtraction.
format Online
Article
Text
id pubmed-6892305
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-68923052019-12-10 A general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning Chen, Tao Wu, Mingfen Li, Hexi Database (Oxford) Original Article The automatic extraction of meaningful relations from biomedical literature or clinical records is crucial in various biomedical applications. Most of the current deep learning approaches for medical relation extraction require large-scale training data to prevent overfitting of the training model. We propose using a pre-trained model and a fine-tuning technique to improve these approaches without additional time-consuming human labeling. Firstly, we show the architecture of Bidirectional Encoder Representations from Transformers (BERT), an approach for pre-training a model on large-scale unstructured text. We then combine BERT with a one-dimensional convolutional neural network (1d-CNN) to fine-tune the pre-trained model for relation extraction. Extensive experiments on three datasets, namely the BioCreative V chemical disease relation corpus, traditional Chinese medicine literature corpus and i2b2 2012 temporal relation challenge corpus, show that the proposed approach achieves state-of-the-art results (giving a relative improvement of 22.2, 7.77, and 38.5% in F1 score, respectively, compared with a traditional 1d-CNN classifier). The source code is available at https://github.com/chentao1999/MedicalRelationExtraction. Oxford University Press 2019-12-04 /pmc/articles/PMC6892305/ /pubmed/31800044 http://dx.doi.org/10.1093/database/baz116 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Chen, Tao
Wu, Mingfen
Li, Hexi
A general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning
title A general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning
title_full A general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning
title_fullStr A general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning
title_full_unstemmed A general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning
title_short A general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning
title_sort general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6892305/
https://www.ncbi.nlm.nih.gov/pubmed/31800044
http://dx.doi.org/10.1093/database/baz116
work_keys_str_mv AT chentao ageneralapproachforimprovingdeeplearningbasedmedicalrelationextractionusingapretrainedmodelandfinetuning
AT wumingfen ageneralapproachforimprovingdeeplearningbasedmedicalrelationextractionusingapretrainedmodelandfinetuning
AT lihexi ageneralapproachforimprovingdeeplearningbasedmedicalrelationextractionusingapretrainedmodelandfinetuning
AT chentao generalapproachforimprovingdeeplearningbasedmedicalrelationextractionusingapretrainedmodelandfinetuning
AT wumingfen generalapproachforimprovingdeeplearningbasedmedicalrelationextractionusingapretrainedmodelandfinetuning
AT lihexi generalapproachforimprovingdeeplearningbasedmedicalrelationextractionusingapretrainedmodelandfinetuning