Cargando…

Chemical-induced disease relation extraction via attention-based distant supervision

BACKGROUND: Automatically understanding chemical-disease relations (CDRs) is crucial in various areas of biomedical research and health care. Supervised machine learning provides a feasible solution to automatically extract relations between biomedical entities from scientific literature, its succes...

Descripción completa

Detalles Bibliográficos
Autores principales: Gu, Jinghang, Sun, Fuqing, Qian, Longhua, Zhou, Guodong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6647285/
https://www.ncbi.nlm.nih.gov/pubmed/31331263
http://dx.doi.org/10.1186/s12859-019-2884-4
_version_ 1783437698414936064
author Gu, Jinghang
Sun, Fuqing
Qian, Longhua
Zhou, Guodong
author_facet Gu, Jinghang
Sun, Fuqing
Qian, Longhua
Zhou, Guodong
author_sort Gu, Jinghang
collection PubMed
description BACKGROUND: Automatically understanding chemical-disease relations (CDRs) is crucial in various areas of biomedical research and health care. Supervised machine learning provides a feasible solution to automatically extract relations between biomedical entities from scientific literature, its success, however, heavily depends on large-scale biomedical corpora manually annotated with intensive labor and tremendous investment. RESULTS: We present an attention-based distant supervision paradigm for the BioCreative-V CDR extraction task. Training examples at both intra- and inter-sentence levels are generated automatically from the Comparative Toxicogenomics Database (CTD) without any human intervention. An attention-based neural network and a stacked auto-encoder network are applied respectively to induce learning models and extract relations at both levels. After merging the results of both levels, the document-level CDRs can be finally extracted. It achieves the precision/recall/F1-score of 60.3%/73.8%/66.4%, outperforming the state-of-the-art supervised learning systems without using any annotated corpus. CONCLUSION: Our experiments demonstrate that distant supervision is promising for extracting chemical disease relations from biomedical literature, and capturing both local and global attention features simultaneously is effective in attention-based distantly supervised learning.
format Online
Article
Text
id pubmed-6647285
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-66472852019-07-31 Chemical-induced disease relation extraction via attention-based distant supervision Gu, Jinghang Sun, Fuqing Qian, Longhua Zhou, Guodong BMC Bioinformatics Research Article BACKGROUND: Automatically understanding chemical-disease relations (CDRs) is crucial in various areas of biomedical research and health care. Supervised machine learning provides a feasible solution to automatically extract relations between biomedical entities from scientific literature, its success, however, heavily depends on large-scale biomedical corpora manually annotated with intensive labor and tremendous investment. RESULTS: We present an attention-based distant supervision paradigm for the BioCreative-V CDR extraction task. Training examples at both intra- and inter-sentence levels are generated automatically from the Comparative Toxicogenomics Database (CTD) without any human intervention. An attention-based neural network and a stacked auto-encoder network are applied respectively to induce learning models and extract relations at both levels. After merging the results of both levels, the document-level CDRs can be finally extracted. It achieves the precision/recall/F1-score of 60.3%/73.8%/66.4%, outperforming the state-of-the-art supervised learning systems without using any annotated corpus. CONCLUSION: Our experiments demonstrate that distant supervision is promising for extracting chemical disease relations from biomedical literature, and capturing both local and global attention features simultaneously is effective in attention-based distantly supervised learning. BioMed Central 2019-07-22 /pmc/articles/PMC6647285/ /pubmed/31331263 http://dx.doi.org/10.1186/s12859-019-2884-4 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Gu, Jinghang
Sun, Fuqing
Qian, Longhua
Zhou, Guodong
Chemical-induced disease relation extraction via attention-based distant supervision
title Chemical-induced disease relation extraction via attention-based distant supervision
title_full Chemical-induced disease relation extraction via attention-based distant supervision
title_fullStr Chemical-induced disease relation extraction via attention-based distant supervision
title_full_unstemmed Chemical-induced disease relation extraction via attention-based distant supervision
title_short Chemical-induced disease relation extraction via attention-based distant supervision
title_sort chemical-induced disease relation extraction via attention-based distant supervision
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6647285/
https://www.ncbi.nlm.nih.gov/pubmed/31331263
http://dx.doi.org/10.1186/s12859-019-2884-4
work_keys_str_mv AT gujinghang chemicalinduceddiseaserelationextractionviaattentionbaseddistantsupervision
AT sunfuqing chemicalinduceddiseaserelationextractionviaattentionbaseddistantsupervision
AT qianlonghua chemicalinduceddiseaserelationextractionviaattentionbaseddistantsupervision
AT zhouguodong chemicalinduceddiseaserelationextractionviaattentionbaseddistantsupervision