Cargando…

A computational framework for the prioritization of disease-gene candidates

BACKGROUND: The identification of genes and uncovering the role they play in diseases is an important and complex challenge. Genome-wide linkage and association studies have made advancements in identifying genetic variants that underpin human disease. An important challenge now is to identify meani...

Descripción completa

Detalles Bibliográficos
Autores principales: Browne, Fiona, Wang, Haiying, Zheng, Huiru
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4547404/
https://www.ncbi.nlm.nih.gov/pubmed/26330267
http://dx.doi.org/10.1186/1471-2164-16-S9-S2
_version_ 1782387066221363200
author Browne, Fiona
Wang, Haiying
Zheng, Huiru
author_facet Browne, Fiona
Wang, Haiying
Zheng, Huiru
author_sort Browne, Fiona
collection PubMed
description BACKGROUND: The identification of genes and uncovering the role they play in diseases is an important and complex challenge. Genome-wide linkage and association studies have made advancements in identifying genetic variants that underpin human disease. An important challenge now is to identify meaningful disease-associated genes from a long list of candidate genes implicated by these analyses. The application of gene prioritization can enhance our understanding of disease mechanisms and aid in the discovery of drug targets. The integration of protein-protein interaction networks along with disease datasets and contextual information is an important tool in unraveling the molecular basis of diseases. RESULTS: In this paper we propose a computational pipeline for the prioritization of disease-gene candidates. Diverse heterogeneous data including: gene-expression, protein-protein interaction network, ontology-based similarity and topological measures and tissue-specific are integrated. The pipeline was applied to prioritize Alzheimer's Disease (AD) genes, whereby a list of 32 prioritized genes was generated. This approach correctly identified key AD susceptible genes: PSEN1 and TRAF1. Biological process enrichment analysis revealed the prioritized genes are modulated in AD pathogenesis including: regulation of neurogenesis and generation of neurons. Relatively high predictive performance (AUC: 0.70) was observed when classifying AD and normal gene expression profiles from individuals using leave-one-out cross validation. CONCLUSIONS: This work provides a foundation for future investigation of diverse heterogeneous data integration for disease-gene prioritization.
format Online
Article
Text
id pubmed-4547404
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45474042015-09-10 A computational framework for the prioritization of disease-gene candidates Browne, Fiona Wang, Haiying Zheng, Huiru BMC Genomics Research BACKGROUND: The identification of genes and uncovering the role they play in diseases is an important and complex challenge. Genome-wide linkage and association studies have made advancements in identifying genetic variants that underpin human disease. An important challenge now is to identify meaningful disease-associated genes from a long list of candidate genes implicated by these analyses. The application of gene prioritization can enhance our understanding of disease mechanisms and aid in the discovery of drug targets. The integration of protein-protein interaction networks along with disease datasets and contextual information is an important tool in unraveling the molecular basis of diseases. RESULTS: In this paper we propose a computational pipeline for the prioritization of disease-gene candidates. Diverse heterogeneous data including: gene-expression, protein-protein interaction network, ontology-based similarity and topological measures and tissue-specific are integrated. The pipeline was applied to prioritize Alzheimer's Disease (AD) genes, whereby a list of 32 prioritized genes was generated. This approach correctly identified key AD susceptible genes: PSEN1 and TRAF1. Biological process enrichment analysis revealed the prioritized genes are modulated in AD pathogenesis including: regulation of neurogenesis and generation of neurons. Relatively high predictive performance (AUC: 0.70) was observed when classifying AD and normal gene expression profiles from individuals using leave-one-out cross validation. CONCLUSIONS: This work provides a foundation for future investigation of diverse heterogeneous data integration for disease-gene prioritization. BioMed Central 2015-08-17 /pmc/articles/PMC4547404/ /pubmed/26330267 http://dx.doi.org/10.1186/1471-2164-16-S9-S2 Text en Copyright © 2015 Browne et al. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Browne, Fiona
Wang, Haiying
Zheng, Huiru
A computational framework for the prioritization of disease-gene candidates
title A computational framework for the prioritization of disease-gene candidates
title_full A computational framework for the prioritization of disease-gene candidates
title_fullStr A computational framework for the prioritization of disease-gene candidates
title_full_unstemmed A computational framework for the prioritization of disease-gene candidates
title_short A computational framework for the prioritization of disease-gene candidates
title_sort computational framework for the prioritization of disease-gene candidates
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4547404/
https://www.ncbi.nlm.nih.gov/pubmed/26330267
http://dx.doi.org/10.1186/1471-2164-16-S9-S2
work_keys_str_mv AT brownefiona acomputationalframeworkfortheprioritizationofdiseasegenecandidates
AT wanghaiying acomputationalframeworkfortheprioritizationofdiseasegenecandidates
AT zhenghuiru acomputationalframeworkfortheprioritizationofdiseasegenecandidates
AT brownefiona computationalframeworkfortheprioritizationofdiseasegenecandidates
AT wanghaiying computationalframeworkfortheprioritizationofdiseasegenecandidates
AT zhenghuiru computationalframeworkfortheprioritizationofdiseasegenecandidates