Cargando…

Automatic Human-like Mining and Constructing Reliable Genetic Association Database with Deep Reinforcement Learning

The increasing amount of scientific literature in biological and biomedical science research has created a challenge in continuous and reliable curation of the latest knowledge discovered, and automatic biomedical text-mining has been one of the answers to this challenge. In this paper, we aim to fu...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Haohan, Liu, Xiang, Tao, Yifeng, Ye, Wenting, Jin, Qiao, Cohen, William W., Xing, Eric P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6417822/
https://www.ncbi.nlm.nih.gov/pubmed/30864315
_version_ 1783403628609929216
author Wang, Haohan
Liu, Xiang
Tao, Yifeng
Ye, Wenting
Jin, Qiao
Cohen, William W.
Xing, Eric P.
author_facet Wang, Haohan
Liu, Xiang
Tao, Yifeng
Ye, Wenting
Jin, Qiao
Cohen, William W.
Xing, Eric P.
author_sort Wang, Haohan
collection PubMed
description The increasing amount of scientific literature in biological and biomedical science research has created a challenge in continuous and reliable curation of the latest knowledge discovered, and automatic biomedical text-mining has been one of the answers to this challenge. In this paper, we aim to further improve the reliability of biomedical text-mining by training the system to directly simulate the human behaviors such as querying the PubMed, selecting articles from queried results, and reading selected articles for knowledge. We take advantage of the efficiency of biomedical text-mining, the flexibility of deep reinforcement learning, and the massive amount of knowledge collected in UMLS into an integrative artificial intelligent reader that can automatically identify the authentic articles and effectively acquire the knowledge conveyed in the articles. We construct a system, whose current primary task is to build the genetic association database between genes and complex traits of human. Our contributions in this paper are three-fold: 1) We propose to improve the reliability of text-mining by building a system that can directly simulate the behavior of a researcher, and we develop corresponding methods, such as Bi-directional LSTM for text mining and Deep Q-Network for organizing behaviors. 2) We demonstrate the effectiveness of our system with an example in constructing a genetic association database. 3) We release our implementation as a generic framework for researchers in the community to conveniently construct other databases.
format Online
Article
Text
id pubmed-6417822
institution National Center for Biotechnology Information
language English
publishDate 2019
record_format MEDLINE/PubMed
spelling pubmed-64178222019-03-14 Automatic Human-like Mining and Constructing Reliable Genetic Association Database with Deep Reinforcement Learning Wang, Haohan Liu, Xiang Tao, Yifeng Ye, Wenting Jin, Qiao Cohen, William W. Xing, Eric P. Pac Symp Biocomput Article The increasing amount of scientific literature in biological and biomedical science research has created a challenge in continuous and reliable curation of the latest knowledge discovered, and automatic biomedical text-mining has been one of the answers to this challenge. In this paper, we aim to further improve the reliability of biomedical text-mining by training the system to directly simulate the human behaviors such as querying the PubMed, selecting articles from queried results, and reading selected articles for knowledge. We take advantage of the efficiency of biomedical text-mining, the flexibility of deep reinforcement learning, and the massive amount of knowledge collected in UMLS into an integrative artificial intelligent reader that can automatically identify the authentic articles and effectively acquire the knowledge conveyed in the articles. We construct a system, whose current primary task is to build the genetic association database between genes and complex traits of human. Our contributions in this paper are three-fold: 1) We propose to improve the reliability of text-mining by building a system that can directly simulate the behavior of a researcher, and we develop corresponding methods, such as Bi-directional LSTM for text mining and Deep Q-Network for organizing behaviors. 2) We demonstrate the effectiveness of our system with an example in constructing a genetic association database. 3) We release our implementation as a generic framework for researchers in the community to conveniently construct other databases. 2019 /pmc/articles/PMC6417822/ /pubmed/30864315 Text en http://creativecommons.org/licenses/by/4.0/ Open Access chapter published by World Scientific Publishing Company and distributed under the terms of the Creative Commons Attribution Non-Commercial (CC BY-NC) 4.0 License.
spellingShingle Article
Wang, Haohan
Liu, Xiang
Tao, Yifeng
Ye, Wenting
Jin, Qiao
Cohen, William W.
Xing, Eric P.
Automatic Human-like Mining and Constructing Reliable Genetic Association Database with Deep Reinforcement Learning
title Automatic Human-like Mining and Constructing Reliable Genetic Association Database with Deep Reinforcement Learning
title_full Automatic Human-like Mining and Constructing Reliable Genetic Association Database with Deep Reinforcement Learning
title_fullStr Automatic Human-like Mining and Constructing Reliable Genetic Association Database with Deep Reinforcement Learning
title_full_unstemmed Automatic Human-like Mining and Constructing Reliable Genetic Association Database with Deep Reinforcement Learning
title_short Automatic Human-like Mining and Constructing Reliable Genetic Association Database with Deep Reinforcement Learning
title_sort automatic human-like mining and constructing reliable genetic association database with deep reinforcement learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6417822/
https://www.ncbi.nlm.nih.gov/pubmed/30864315
work_keys_str_mv AT wanghaohan automatichumanlikeminingandconstructingreliablegeneticassociationdatabasewithdeepreinforcementlearning
AT liuxiang automatichumanlikeminingandconstructingreliablegeneticassociationdatabasewithdeepreinforcementlearning
AT taoyifeng automatichumanlikeminingandconstructingreliablegeneticassociationdatabasewithdeepreinforcementlearning
AT yewenting automatichumanlikeminingandconstructingreliablegeneticassociationdatabasewithdeepreinforcementlearning
AT jinqiao automatichumanlikeminingandconstructingreliablegeneticassociationdatabasewithdeepreinforcementlearning
AT cohenwilliamw automatichumanlikeminingandconstructingreliablegeneticassociationdatabasewithdeepreinforcementlearning
AT xingericp automatichumanlikeminingandconstructingreliablegeneticassociationdatabasewithdeepreinforcementlearning