Cargando…
An integrative modular approach to systematically predict gene-phenotype associations
BACKGROUND: Complex human diseases are often caused by multiple mutations, each of which contributes only a minor effect to the disease phenotype. To study the basis for these complex phenotypes, we developed a network-based approach to identify coexpression modules specifically activated in particu...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3009536/ https://www.ncbi.nlm.nih.gov/pubmed/20122238 http://dx.doi.org/10.1186/1471-2105-11-S1-S62 |
_version_ | 1782194701595574272 |
---|---|
author | Mehan, Michael R Nunez-Iglesias, Juan Dai, Chao Waterman, Michael S Zhou, Xianghong Jasmine |
author_facet | Mehan, Michael R Nunez-Iglesias, Juan Dai, Chao Waterman, Michael S Zhou, Xianghong Jasmine |
author_sort | Mehan, Michael R |
collection | PubMed |
description | BACKGROUND: Complex human diseases are often caused by multiple mutations, each of which contributes only a minor effect to the disease phenotype. To study the basis for these complex phenotypes, we developed a network-based approach to identify coexpression modules specifically activated in particular phenotypes. We integrated these modules, protein-protein interaction data, Gene Ontology annotations, and our database of gene-phenotype associations derived from literature to predict novel human gene-phenotype associations. Our systematic predictions provide us with the opportunity to perform a global analysis of human gene pleiotropy and its underlying regulatory mechanisms. RESULTS: We applied this method to 338 microarray datasets, covering 178 phenotype classes, and identified 193,145 phenotype-specific coexpression modules. We trained random forest classifiers for each phenotype and predicted a total of 6,558 gene-phenotype associations. We showed that 40.9% genes are pleiotropic, highlighting that pleiotropy is more prevalent than previously expected. We collected 77 ChIP-chip datasets studying 69 transcription factors binding over 16,000 targets under various phenotypic conditions. Utilizing this unique data source, we confirmed that dynamic transcriptional regulation is an important force driving the formation of phenotype specific gene modules. CONCLUSION: We created a genome-wide gene to phenotype mapping that has many potential implications, including providing potential new drug targets and uncovering the basis for human disease phenotypes. Our analysis of these phenotype-specific coexpression modules reveals a high prevalence of gene pleiotropy, and suggests that phenotype-specific transcription factor binding may contribute to phenotypic diversity. All resources from our study are made freely available on our online Phenotype Prediction Database [1]. |
format | Text |
id | pubmed-3009536 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-30095362010-12-23 An integrative modular approach to systematically predict gene-phenotype associations Mehan, Michael R Nunez-Iglesias, Juan Dai, Chao Waterman, Michael S Zhou, Xianghong Jasmine BMC Bioinformatics Research BACKGROUND: Complex human diseases are often caused by multiple mutations, each of which contributes only a minor effect to the disease phenotype. To study the basis for these complex phenotypes, we developed a network-based approach to identify coexpression modules specifically activated in particular phenotypes. We integrated these modules, protein-protein interaction data, Gene Ontology annotations, and our database of gene-phenotype associations derived from literature to predict novel human gene-phenotype associations. Our systematic predictions provide us with the opportunity to perform a global analysis of human gene pleiotropy and its underlying regulatory mechanisms. RESULTS: We applied this method to 338 microarray datasets, covering 178 phenotype classes, and identified 193,145 phenotype-specific coexpression modules. We trained random forest classifiers for each phenotype and predicted a total of 6,558 gene-phenotype associations. We showed that 40.9% genes are pleiotropic, highlighting that pleiotropy is more prevalent than previously expected. We collected 77 ChIP-chip datasets studying 69 transcription factors binding over 16,000 targets under various phenotypic conditions. Utilizing this unique data source, we confirmed that dynamic transcriptional regulation is an important force driving the formation of phenotype specific gene modules. CONCLUSION: We created a genome-wide gene to phenotype mapping that has many potential implications, including providing potential new drug targets and uncovering the basis for human disease phenotypes. Our analysis of these phenotype-specific coexpression modules reveals a high prevalence of gene pleiotropy, and suggests that phenotype-specific transcription factor binding may contribute to phenotypic diversity. All resources from our study are made freely available on our online Phenotype Prediction Database [1]. BioMed Central 2010-01-18 /pmc/articles/PMC3009536/ /pubmed/20122238 http://dx.doi.org/10.1186/1471-2105-11-S1-S62 Text en Copyright ©2010 Mehan et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Mehan, Michael R Nunez-Iglesias, Juan Dai, Chao Waterman, Michael S Zhou, Xianghong Jasmine An integrative modular approach to systematically predict gene-phenotype associations |
title | An integrative modular approach to systematically predict gene-phenotype associations |
title_full | An integrative modular approach to systematically predict gene-phenotype associations |
title_fullStr | An integrative modular approach to systematically predict gene-phenotype associations |
title_full_unstemmed | An integrative modular approach to systematically predict gene-phenotype associations |
title_short | An integrative modular approach to systematically predict gene-phenotype associations |
title_sort | integrative modular approach to systematically predict gene-phenotype associations |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3009536/ https://www.ncbi.nlm.nih.gov/pubmed/20122238 http://dx.doi.org/10.1186/1471-2105-11-S1-S62 |
work_keys_str_mv | AT mehanmichaelr anintegrativemodularapproachtosystematicallypredictgenephenotypeassociations AT nuneziglesiasjuan anintegrativemodularapproachtosystematicallypredictgenephenotypeassociations AT daichao anintegrativemodularapproachtosystematicallypredictgenephenotypeassociations AT watermanmichaels anintegrativemodularapproachtosystematicallypredictgenephenotypeassociations AT zhouxianghongjasmine anintegrativemodularapproachtosystematicallypredictgenephenotypeassociations AT mehanmichaelr integrativemodularapproachtosystematicallypredictgenephenotypeassociations AT nuneziglesiasjuan integrativemodularapproachtosystematicallypredictgenephenotypeassociations AT daichao integrativemodularapproachtosystematicallypredictgenephenotypeassociations AT watermanmichaels integrativemodularapproachtosystematicallypredictgenephenotypeassociations AT zhouxianghongjasmine integrativemodularapproachtosystematicallypredictgenephenotypeassociations |