Cargando…
Integrating human omics data to prioritize candidate genes
BACKGROUND: The identification of genes involved in human complex diseases remains a great challenge in computational systems biology. Although methods have been developed to use disease phenotypic similarities with a protein-protein interaction network for the prioritization of candidate genes, oth...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3878333/ https://www.ncbi.nlm.nih.gov/pubmed/24344781 http://dx.doi.org/10.1186/1755-8794-6-57 |
_version_ | 1782297783794925568 |
---|---|
author | Chen, Yong Wu, Xuebing Jiang, Rui |
author_facet | Chen, Yong Wu, Xuebing Jiang, Rui |
author_sort | Chen, Yong |
collection | PubMed |
description | BACKGROUND: The identification of genes involved in human complex diseases remains a great challenge in computational systems biology. Although methods have been developed to use disease phenotypic similarities with a protein-protein interaction network for the prioritization of candidate genes, other valuable omics data sources have been largely overlooked in these methods. METHODS: With this understanding, we proposed a method called BRIDGE to prioritize candidate genes by integrating disease phenotypic similarities with such omics data as protein-protein interactions, gene sequence similarities, gene expression patterns, gene ontology annotations, and gene pathway memberships. BRIDGE utilizes a multiple regression model with lasso penalty to automatically weight different data sources and is capable of discovering genes associated with diseases whose genetic bases are completely unknown. RESULTS: We conducted large-scale cross-validation experiments and demonstrated that more than 60% known disease genes can be ranked top one by BRIDGE in simulated linkage intervals, suggesting the superior performance of this method. We further performed two comprehensive case studies by applying BRIDGE to predict novel genes and transcriptional networks involved in obesity and type II diabetes. CONCLUSION: The proposed method provides an effective and scalable way for integrating multi omics data to infer disease genes. Further applications of BRIDGE will be benefit to providing novel disease genes and underlying mechanisms of human diseases. |
format | Online Article Text |
id | pubmed-3878333 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-38783332014-01-07 Integrating human omics data to prioritize candidate genes Chen, Yong Wu, Xuebing Jiang, Rui BMC Med Genomics Research Article BACKGROUND: The identification of genes involved in human complex diseases remains a great challenge in computational systems biology. Although methods have been developed to use disease phenotypic similarities with a protein-protein interaction network for the prioritization of candidate genes, other valuable omics data sources have been largely overlooked in these methods. METHODS: With this understanding, we proposed a method called BRIDGE to prioritize candidate genes by integrating disease phenotypic similarities with such omics data as protein-protein interactions, gene sequence similarities, gene expression patterns, gene ontology annotations, and gene pathway memberships. BRIDGE utilizes a multiple regression model with lasso penalty to automatically weight different data sources and is capable of discovering genes associated with diseases whose genetic bases are completely unknown. RESULTS: We conducted large-scale cross-validation experiments and demonstrated that more than 60% known disease genes can be ranked top one by BRIDGE in simulated linkage intervals, suggesting the superior performance of this method. We further performed two comprehensive case studies by applying BRIDGE to predict novel genes and transcriptional networks involved in obesity and type II diabetes. CONCLUSION: The proposed method provides an effective and scalable way for integrating multi omics data to infer disease genes. Further applications of BRIDGE will be benefit to providing novel disease genes and underlying mechanisms of human diseases. BioMed Central 2013-12-18 /pmc/articles/PMC3878333/ /pubmed/24344781 http://dx.doi.org/10.1186/1755-8794-6-57 Text en Copyright © 2013 Chen et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Chen, Yong Wu, Xuebing Jiang, Rui Integrating human omics data to prioritize candidate genes |
title | Integrating human omics data to prioritize candidate genes |
title_full | Integrating human omics data to prioritize candidate genes |
title_fullStr | Integrating human omics data to prioritize candidate genes |
title_full_unstemmed | Integrating human omics data to prioritize candidate genes |
title_short | Integrating human omics data to prioritize candidate genes |
title_sort | integrating human omics data to prioritize candidate genes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3878333/ https://www.ncbi.nlm.nih.gov/pubmed/24344781 http://dx.doi.org/10.1186/1755-8794-6-57 |
work_keys_str_mv | AT chenyong integratinghumanomicsdatatoprioritizecandidategenes AT wuxuebing integratinghumanomicsdatatoprioritizecandidategenes AT jiangrui integratinghumanomicsdatatoprioritizecandidategenes |