Cargando…

An integrative modular approach to systematically predict gene-phenotype associations

BACKGROUND: Complex human diseases are often caused by multiple mutations, each of which contributes only a minor effect to the disease phenotype. To study the basis for these complex phenotypes, we developed a network-based approach to identify coexpression modules specifically activated in particu...

Descripción completa

Detalles Bibliográficos
Autores principales: Mehan, Michael R, Nunez-Iglesias, Juan, Dai, Chao, Waterman, Michael S, Zhou, Xianghong Jasmine
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3009536/
https://www.ncbi.nlm.nih.gov/pubmed/20122238
http://dx.doi.org/10.1186/1471-2105-11-S1-S62
_version_ 1782194701595574272
author Mehan, Michael R
Nunez-Iglesias, Juan
Dai, Chao
Waterman, Michael S
Zhou, Xianghong Jasmine
author_facet Mehan, Michael R
Nunez-Iglesias, Juan
Dai, Chao
Waterman, Michael S
Zhou, Xianghong Jasmine
author_sort Mehan, Michael R
collection PubMed
description BACKGROUND: Complex human diseases are often caused by multiple mutations, each of which contributes only a minor effect to the disease phenotype. To study the basis for these complex phenotypes, we developed a network-based approach to identify coexpression modules specifically activated in particular phenotypes. We integrated these modules, protein-protein interaction data, Gene Ontology annotations, and our database of gene-phenotype associations derived from literature to predict novel human gene-phenotype associations. Our systematic predictions provide us with the opportunity to perform a global analysis of human gene pleiotropy and its underlying regulatory mechanisms. RESULTS: We applied this method to 338 microarray datasets, covering 178 phenotype classes, and identified 193,145 phenotype-specific coexpression modules. We trained random forest classifiers for each phenotype and predicted a total of 6,558 gene-phenotype associations. We showed that 40.9% genes are pleiotropic, highlighting that pleiotropy is more prevalent than previously expected. We collected 77 ChIP-chip datasets studying 69 transcription factors binding over 16,000 targets under various phenotypic conditions. Utilizing this unique data source, we confirmed that dynamic transcriptional regulation is an important force driving the formation of phenotype specific gene modules. CONCLUSION: We created a genome-wide gene to phenotype mapping that has many potential implications, including providing potential new drug targets and uncovering the basis for human disease phenotypes. Our analysis of these phenotype-specific coexpression modules reveals a high prevalence of gene pleiotropy, and suggests that phenotype-specific transcription factor binding may contribute to phenotypic diversity. All resources from our study are made freely available on our online Phenotype Prediction Database [1].
format Text
id pubmed-3009536
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30095362010-12-23 An integrative modular approach to systematically predict gene-phenotype associations Mehan, Michael R Nunez-Iglesias, Juan Dai, Chao Waterman, Michael S Zhou, Xianghong Jasmine BMC Bioinformatics Research BACKGROUND: Complex human diseases are often caused by multiple mutations, each of which contributes only a minor effect to the disease phenotype. To study the basis for these complex phenotypes, we developed a network-based approach to identify coexpression modules specifically activated in particular phenotypes. We integrated these modules, protein-protein interaction data, Gene Ontology annotations, and our database of gene-phenotype associations derived from literature to predict novel human gene-phenotype associations. Our systematic predictions provide us with the opportunity to perform a global analysis of human gene pleiotropy and its underlying regulatory mechanisms. RESULTS: We applied this method to 338 microarray datasets, covering 178 phenotype classes, and identified 193,145 phenotype-specific coexpression modules. We trained random forest classifiers for each phenotype and predicted a total of 6,558 gene-phenotype associations. We showed that 40.9% genes are pleiotropic, highlighting that pleiotropy is more prevalent than previously expected. We collected 77 ChIP-chip datasets studying 69 transcription factors binding over 16,000 targets under various phenotypic conditions. Utilizing this unique data source, we confirmed that dynamic transcriptional regulation is an important force driving the formation of phenotype specific gene modules. CONCLUSION: We created a genome-wide gene to phenotype mapping that has many potential implications, including providing potential new drug targets and uncovering the basis for human disease phenotypes. Our analysis of these phenotype-specific coexpression modules reveals a high prevalence of gene pleiotropy, and suggests that phenotype-specific transcription factor binding may contribute to phenotypic diversity. All resources from our study are made freely available on our online Phenotype Prediction Database [1]. BioMed Central 2010-01-18 /pmc/articles/PMC3009536/ /pubmed/20122238 http://dx.doi.org/10.1186/1471-2105-11-S1-S62 Text en Copyright ©2010 Mehan et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Mehan, Michael R
Nunez-Iglesias, Juan
Dai, Chao
Waterman, Michael S
Zhou, Xianghong Jasmine
An integrative modular approach to systematically predict gene-phenotype associations
title An integrative modular approach to systematically predict gene-phenotype associations
title_full An integrative modular approach to systematically predict gene-phenotype associations
title_fullStr An integrative modular approach to systematically predict gene-phenotype associations
title_full_unstemmed An integrative modular approach to systematically predict gene-phenotype associations
title_short An integrative modular approach to systematically predict gene-phenotype associations
title_sort integrative modular approach to systematically predict gene-phenotype associations
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3009536/
https://www.ncbi.nlm.nih.gov/pubmed/20122238
http://dx.doi.org/10.1186/1471-2105-11-S1-S62
work_keys_str_mv AT mehanmichaelr anintegrativemodularapproachtosystematicallypredictgenephenotypeassociations
AT nuneziglesiasjuan anintegrativemodularapproachtosystematicallypredictgenephenotypeassociations
AT daichao anintegrativemodularapproachtosystematicallypredictgenephenotypeassociations
AT watermanmichaels anintegrativemodularapproachtosystematicallypredictgenephenotypeassociations
AT zhouxianghongjasmine anintegrativemodularapproachtosystematicallypredictgenephenotypeassociations
AT mehanmichaelr integrativemodularapproachtosystematicallypredictgenephenotypeassociations
AT nuneziglesiasjuan integrativemodularapproachtosystematicallypredictgenephenotypeassociations
AT daichao integrativemodularapproachtosystematicallypredictgenephenotypeassociations
AT watermanmichaels integrativemodularapproachtosystematicallypredictgenephenotypeassociations
AT zhouxianghongjasmine integrativemodularapproachtosystematicallypredictgenephenotypeassociations