Cargando…

Integrating human omics data to prioritize candidate genes

BACKGROUND: The identification of genes involved in human complex diseases remains a great challenge in computational systems biology. Although methods have been developed to use disease phenotypic similarities with a protein-protein interaction network for the prioritization of candidate genes, oth...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Yong, Wu, Xuebing, Jiang, Rui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3878333/
https://www.ncbi.nlm.nih.gov/pubmed/24344781
http://dx.doi.org/10.1186/1755-8794-6-57
_version_ 1782297783794925568
author Chen, Yong
Wu, Xuebing
Jiang, Rui
author_facet Chen, Yong
Wu, Xuebing
Jiang, Rui
author_sort Chen, Yong
collection PubMed
description BACKGROUND: The identification of genes involved in human complex diseases remains a great challenge in computational systems biology. Although methods have been developed to use disease phenotypic similarities with a protein-protein interaction network for the prioritization of candidate genes, other valuable omics data sources have been largely overlooked in these methods. METHODS: With this understanding, we proposed a method called BRIDGE to prioritize candidate genes by integrating disease phenotypic similarities with such omics data as protein-protein interactions, gene sequence similarities, gene expression patterns, gene ontology annotations, and gene pathway memberships. BRIDGE utilizes a multiple regression model with lasso penalty to automatically weight different data sources and is capable of discovering genes associated with diseases whose genetic bases are completely unknown. RESULTS: We conducted large-scale cross-validation experiments and demonstrated that more than 60% known disease genes can be ranked top one by BRIDGE in simulated linkage intervals, suggesting the superior performance of this method. We further performed two comprehensive case studies by applying BRIDGE to predict novel genes and transcriptional networks involved in obesity and type II diabetes. CONCLUSION: The proposed method provides an effective and scalable way for integrating multi omics data to infer disease genes. Further applications of BRIDGE will be benefit to providing novel disease genes and underlying mechanisms of human diseases.
format Online
Article
Text
id pubmed-3878333
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38783332014-01-07 Integrating human omics data to prioritize candidate genes Chen, Yong Wu, Xuebing Jiang, Rui BMC Med Genomics Research Article BACKGROUND: The identification of genes involved in human complex diseases remains a great challenge in computational systems biology. Although methods have been developed to use disease phenotypic similarities with a protein-protein interaction network for the prioritization of candidate genes, other valuable omics data sources have been largely overlooked in these methods. METHODS: With this understanding, we proposed a method called BRIDGE to prioritize candidate genes by integrating disease phenotypic similarities with such omics data as protein-protein interactions, gene sequence similarities, gene expression patterns, gene ontology annotations, and gene pathway memberships. BRIDGE utilizes a multiple regression model with lasso penalty to automatically weight different data sources and is capable of discovering genes associated with diseases whose genetic bases are completely unknown. RESULTS: We conducted large-scale cross-validation experiments and demonstrated that more than 60% known disease genes can be ranked top one by BRIDGE in simulated linkage intervals, suggesting the superior performance of this method. We further performed two comprehensive case studies by applying BRIDGE to predict novel genes and transcriptional networks involved in obesity and type II diabetes. CONCLUSION: The proposed method provides an effective and scalable way for integrating multi omics data to infer disease genes. Further applications of BRIDGE will be benefit to providing novel disease genes and underlying mechanisms of human diseases. BioMed Central 2013-12-18 /pmc/articles/PMC3878333/ /pubmed/24344781 http://dx.doi.org/10.1186/1755-8794-6-57 Text en Copyright © 2013 Chen et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Chen, Yong
Wu, Xuebing
Jiang, Rui
Integrating human omics data to prioritize candidate genes
title Integrating human omics data to prioritize candidate genes
title_full Integrating human omics data to prioritize candidate genes
title_fullStr Integrating human omics data to prioritize candidate genes
title_full_unstemmed Integrating human omics data to prioritize candidate genes
title_short Integrating human omics data to prioritize candidate genes
title_sort integrating human omics data to prioritize candidate genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3878333/
https://www.ncbi.nlm.nih.gov/pubmed/24344781
http://dx.doi.org/10.1186/1755-8794-6-57
work_keys_str_mv AT chenyong integratinghumanomicsdatatoprioritizecandidategenes
AT wuxuebing integratinghumanomicsdatatoprioritizecandidategenes
AT jiangrui integratinghumanomicsdatatoprioritizecandidategenes