Cargando…

Network-based analysis reveals distinct association patterns in a semantic MEDLINE-based drug-disease-gene network

BACKGROUND: A huge amount of associations among different biological entities (e.g., disease, drug, and gene) are scattered in millions of biomedical articles. Systematic analysis of such heterogeneous data can infer novel associations among different biological entities in the context of personaliz...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yuji, Tao, Cui, Jiang, Guoqian, Nair, Asha A, Su, Jian, Chute, Christopher G, Liu, Hongfang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4137727/
https://www.ncbi.nlm.nih.gov/pubmed/25170419
http://dx.doi.org/10.1186/2041-1480-5-33
_version_ 1782331153121804288
author Zhang, Yuji
Tao, Cui
Jiang, Guoqian
Nair, Asha A
Su, Jian
Chute, Christopher G
Liu, Hongfang
author_facet Zhang, Yuji
Tao, Cui
Jiang, Guoqian
Nair, Asha A
Su, Jian
Chute, Christopher G
Liu, Hongfang
author_sort Zhang, Yuji
collection PubMed
description BACKGROUND: A huge amount of associations among different biological entities (e.g., disease, drug, and gene) are scattered in millions of biomedical articles. Systematic analysis of such heterogeneous data can infer novel associations among different biological entities in the context of personalized medicine and translational research. Recently, network-based computational approaches have gained popularity in investigating such heterogeneous data, proposing novel therapeutic targets and deciphering disease mechanisms. However, little effort has been devoted to investigating associations among drugs, diseases, and genes in an integrative manner. RESULTS: We propose a novel network-based computational framework to identify statistically over-expressed subnetwork patterns, called network motifs, in an integrated disease-drug-gene network extracted from Semantic MEDLINE. The framework consists of two steps. The first step is to construct an association network by extracting pair-wise associations between diseases, drugs and genes in Semantic MEDLINE using a domain pattern driven strategy. A Resource Description Framework (RDF)-linked data approach is used to re-organize the data to increase the flexibility of data integration, the interoperability within domain ontologies, and the efficiency of data storage. Unique associations among drugs, diseases, and genes are extracted for downstream network-based analysis. The second step is to apply a network-based approach to mine the local network structure of this heterogeneous network. Significant network motifs are then identified as the backbone of the network. A simplified network based on those significant motifs is then constructed to facilitate discovery. We implemented our computational framework and identified five network motifs, each of which corresponds to specific biological meanings. Three case studies demonstrate that novel associations are derived from the network topology analysis of reconstructed networks of significant network motifs, further validated by expert knowledge and functional enrichment analyses. CONCLUSIONS: We have developed a novel network-based computational approach to investigate the heterogeneous drug-gene-disease network extracted from Semantic MEDLINE. We demonstrate the power of this approach by prioritizing candidate disease genes, inferring potential disease relationships, and proposing novel drug targets, within the context of the entire knowledge. The results indicate that such approach will facilitate the formulization of novel research hypotheses, which is critical for translational medicine research and personalized medicine.
format Online
Article
Text
id pubmed-4137727
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41377272014-08-28 Network-based analysis reveals distinct association patterns in a semantic MEDLINE-based drug-disease-gene network Zhang, Yuji Tao, Cui Jiang, Guoqian Nair, Asha A Su, Jian Chute, Christopher G Liu, Hongfang J Biomed Semantics Research BACKGROUND: A huge amount of associations among different biological entities (e.g., disease, drug, and gene) are scattered in millions of biomedical articles. Systematic analysis of such heterogeneous data can infer novel associations among different biological entities in the context of personalized medicine and translational research. Recently, network-based computational approaches have gained popularity in investigating such heterogeneous data, proposing novel therapeutic targets and deciphering disease mechanisms. However, little effort has been devoted to investigating associations among drugs, diseases, and genes in an integrative manner. RESULTS: We propose a novel network-based computational framework to identify statistically over-expressed subnetwork patterns, called network motifs, in an integrated disease-drug-gene network extracted from Semantic MEDLINE. The framework consists of two steps. The first step is to construct an association network by extracting pair-wise associations between diseases, drugs and genes in Semantic MEDLINE using a domain pattern driven strategy. A Resource Description Framework (RDF)-linked data approach is used to re-organize the data to increase the flexibility of data integration, the interoperability within domain ontologies, and the efficiency of data storage. Unique associations among drugs, diseases, and genes are extracted for downstream network-based analysis. The second step is to apply a network-based approach to mine the local network structure of this heterogeneous network. Significant network motifs are then identified as the backbone of the network. A simplified network based on those significant motifs is then constructed to facilitate discovery. We implemented our computational framework and identified five network motifs, each of which corresponds to specific biological meanings. Three case studies demonstrate that novel associations are derived from the network topology analysis of reconstructed networks of significant network motifs, further validated by expert knowledge and functional enrichment analyses. CONCLUSIONS: We have developed a novel network-based computational approach to investigate the heterogeneous drug-gene-disease network extracted from Semantic MEDLINE. We demonstrate the power of this approach by prioritizing candidate disease genes, inferring potential disease relationships, and proposing novel drug targets, within the context of the entire knowledge. The results indicate that such approach will facilitate the formulization of novel research hypotheses, which is critical for translational medicine research and personalized medicine. BioMed Central 2014-08-06 /pmc/articles/PMC4137727/ /pubmed/25170419 http://dx.doi.org/10.1186/2041-1480-5-33 Text en Copyright © 2014 Zhang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.
spellingShingle Research
Zhang, Yuji
Tao, Cui
Jiang, Guoqian
Nair, Asha A
Su, Jian
Chute, Christopher G
Liu, Hongfang
Network-based analysis reveals distinct association patterns in a semantic MEDLINE-based drug-disease-gene network
title Network-based analysis reveals distinct association patterns in a semantic MEDLINE-based drug-disease-gene network
title_full Network-based analysis reveals distinct association patterns in a semantic MEDLINE-based drug-disease-gene network
title_fullStr Network-based analysis reveals distinct association patterns in a semantic MEDLINE-based drug-disease-gene network
title_full_unstemmed Network-based analysis reveals distinct association patterns in a semantic MEDLINE-based drug-disease-gene network
title_short Network-based analysis reveals distinct association patterns in a semantic MEDLINE-based drug-disease-gene network
title_sort network-based analysis reveals distinct association patterns in a semantic medline-based drug-disease-gene network
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4137727/
https://www.ncbi.nlm.nih.gov/pubmed/25170419
http://dx.doi.org/10.1186/2041-1480-5-33
work_keys_str_mv AT zhangyuji networkbasedanalysisrevealsdistinctassociationpatternsinasemanticmedlinebaseddrugdiseasegenenetwork
AT taocui networkbasedanalysisrevealsdistinctassociationpatternsinasemanticmedlinebaseddrugdiseasegenenetwork
AT jiangguoqian networkbasedanalysisrevealsdistinctassociationpatternsinasemanticmedlinebaseddrugdiseasegenenetwork
AT nairashaa networkbasedanalysisrevealsdistinctassociationpatternsinasemanticmedlinebaseddrugdiseasegenenetwork
AT sujian networkbasedanalysisrevealsdistinctassociationpatternsinasemanticmedlinebaseddrugdiseasegenenetwork
AT chutechristopherg networkbasedanalysisrevealsdistinctassociationpatternsinasemanticmedlinebaseddrugdiseasegenenetwork
AT liuhongfang networkbasedanalysisrevealsdistinctassociationpatternsinasemanticmedlinebaseddrugdiseasegenenetwork