Cargando…
A general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning
The automatic extraction of meaningful relations from biomedical literature or clinical records is crucial in various biomedical applications. Most of the current deep learning approaches for medical relation extraction require large-scale training data to prevent overfitting of the training model....
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6892305/ https://www.ncbi.nlm.nih.gov/pubmed/31800044 http://dx.doi.org/10.1093/database/baz116 |
_version_ | 1783476004875927552 |
---|---|
author | Chen, Tao Wu, Mingfen Li, Hexi |
author_facet | Chen, Tao Wu, Mingfen Li, Hexi |
author_sort | Chen, Tao |
collection | PubMed |
description | The automatic extraction of meaningful relations from biomedical literature or clinical records is crucial in various biomedical applications. Most of the current deep learning approaches for medical relation extraction require large-scale training data to prevent overfitting of the training model. We propose using a pre-trained model and a fine-tuning technique to improve these approaches without additional time-consuming human labeling. Firstly, we show the architecture of Bidirectional Encoder Representations from Transformers (BERT), an approach for pre-training a model on large-scale unstructured text. We then combine BERT with a one-dimensional convolutional neural network (1d-CNN) to fine-tune the pre-trained model for relation extraction. Extensive experiments on three datasets, namely the BioCreative V chemical disease relation corpus, traditional Chinese medicine literature corpus and i2b2 2012 temporal relation challenge corpus, show that the proposed approach achieves state-of-the-art results (giving a relative improvement of 22.2, 7.77, and 38.5% in F1 score, respectively, compared with a traditional 1d-CNN classifier). The source code is available at https://github.com/chentao1999/MedicalRelationExtraction. |
format | Online Article Text |
id | pubmed-6892305 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-68923052019-12-10 A general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning Chen, Tao Wu, Mingfen Li, Hexi Database (Oxford) Original Article The automatic extraction of meaningful relations from biomedical literature or clinical records is crucial in various biomedical applications. Most of the current deep learning approaches for medical relation extraction require large-scale training data to prevent overfitting of the training model. We propose using a pre-trained model and a fine-tuning technique to improve these approaches without additional time-consuming human labeling. Firstly, we show the architecture of Bidirectional Encoder Representations from Transformers (BERT), an approach for pre-training a model on large-scale unstructured text. We then combine BERT with a one-dimensional convolutional neural network (1d-CNN) to fine-tune the pre-trained model for relation extraction. Extensive experiments on three datasets, namely the BioCreative V chemical disease relation corpus, traditional Chinese medicine literature corpus and i2b2 2012 temporal relation challenge corpus, show that the proposed approach achieves state-of-the-art results (giving a relative improvement of 22.2, 7.77, and 38.5% in F1 score, respectively, compared with a traditional 1d-CNN classifier). The source code is available at https://github.com/chentao1999/MedicalRelationExtraction. Oxford University Press 2019-12-04 /pmc/articles/PMC6892305/ /pubmed/31800044 http://dx.doi.org/10.1093/database/baz116 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Chen, Tao Wu, Mingfen Li, Hexi A general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning |
title | A general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning |
title_full | A general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning |
title_fullStr | A general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning |
title_full_unstemmed | A general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning |
title_short | A general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning |
title_sort | general approach for improving deep learning-based medical relation extraction using a pre-trained model and fine-tuning |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6892305/ https://www.ncbi.nlm.nih.gov/pubmed/31800044 http://dx.doi.org/10.1093/database/baz116 |
work_keys_str_mv | AT chentao ageneralapproachforimprovingdeeplearningbasedmedicalrelationextractionusingapretrainedmodelandfinetuning AT wumingfen ageneralapproachforimprovingdeeplearningbasedmedicalrelationextractionusingapretrainedmodelandfinetuning AT lihexi ageneralapproachforimprovingdeeplearningbasedmedicalrelationextractionusingapretrainedmodelandfinetuning AT chentao generalapproachforimprovingdeeplearningbasedmedicalrelationextractionusingapretrainedmodelandfinetuning AT wumingfen generalapproachforimprovingdeeplearningbasedmedicalrelationextractionusingapretrainedmodelandfinetuning AT lihexi generalapproachforimprovingdeeplearningbasedmedicalrelationextractionusingapretrainedmodelandfinetuning |