Cargando…
A computational framework for the prioritization of disease-gene candidates
BACKGROUND: The identification of genes and uncovering the role they play in diseases is an important and complex challenge. Genome-wide linkage and association studies have made advancements in identifying genetic variants that underpin human disease. An important challenge now is to identify meani...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4547404/ https://www.ncbi.nlm.nih.gov/pubmed/26330267 http://dx.doi.org/10.1186/1471-2164-16-S9-S2 |
_version_ | 1782387066221363200 |
---|---|
author | Browne, Fiona Wang, Haiying Zheng, Huiru |
author_facet | Browne, Fiona Wang, Haiying Zheng, Huiru |
author_sort | Browne, Fiona |
collection | PubMed |
description | BACKGROUND: The identification of genes and uncovering the role they play in diseases is an important and complex challenge. Genome-wide linkage and association studies have made advancements in identifying genetic variants that underpin human disease. An important challenge now is to identify meaningful disease-associated genes from a long list of candidate genes implicated by these analyses. The application of gene prioritization can enhance our understanding of disease mechanisms and aid in the discovery of drug targets. The integration of protein-protein interaction networks along with disease datasets and contextual information is an important tool in unraveling the molecular basis of diseases. RESULTS: In this paper we propose a computational pipeline for the prioritization of disease-gene candidates. Diverse heterogeneous data including: gene-expression, protein-protein interaction network, ontology-based similarity and topological measures and tissue-specific are integrated. The pipeline was applied to prioritize Alzheimer's Disease (AD) genes, whereby a list of 32 prioritized genes was generated. This approach correctly identified key AD susceptible genes: PSEN1 and TRAF1. Biological process enrichment analysis revealed the prioritized genes are modulated in AD pathogenesis including: regulation of neurogenesis and generation of neurons. Relatively high predictive performance (AUC: 0.70) was observed when classifying AD and normal gene expression profiles from individuals using leave-one-out cross validation. CONCLUSIONS: This work provides a foundation for future investigation of diverse heterogeneous data integration for disease-gene prioritization. |
format | Online Article Text |
id | pubmed-4547404 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-45474042015-09-10 A computational framework for the prioritization of disease-gene candidates Browne, Fiona Wang, Haiying Zheng, Huiru BMC Genomics Research BACKGROUND: The identification of genes and uncovering the role they play in diseases is an important and complex challenge. Genome-wide linkage and association studies have made advancements in identifying genetic variants that underpin human disease. An important challenge now is to identify meaningful disease-associated genes from a long list of candidate genes implicated by these analyses. The application of gene prioritization can enhance our understanding of disease mechanisms and aid in the discovery of drug targets. The integration of protein-protein interaction networks along with disease datasets and contextual information is an important tool in unraveling the molecular basis of diseases. RESULTS: In this paper we propose a computational pipeline for the prioritization of disease-gene candidates. Diverse heterogeneous data including: gene-expression, protein-protein interaction network, ontology-based similarity and topological measures and tissue-specific are integrated. The pipeline was applied to prioritize Alzheimer's Disease (AD) genes, whereby a list of 32 prioritized genes was generated. This approach correctly identified key AD susceptible genes: PSEN1 and TRAF1. Biological process enrichment analysis revealed the prioritized genes are modulated in AD pathogenesis including: regulation of neurogenesis and generation of neurons. Relatively high predictive performance (AUC: 0.70) was observed when classifying AD and normal gene expression profiles from individuals using leave-one-out cross validation. CONCLUSIONS: This work provides a foundation for future investigation of diverse heterogeneous data integration for disease-gene prioritization. BioMed Central 2015-08-17 /pmc/articles/PMC4547404/ /pubmed/26330267 http://dx.doi.org/10.1186/1471-2164-16-S9-S2 Text en Copyright © 2015 Browne et al. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Browne, Fiona Wang, Haiying Zheng, Huiru A computational framework for the prioritization of disease-gene candidates |
title | A computational framework for the prioritization of disease-gene candidates |
title_full | A computational framework for the prioritization of disease-gene candidates |
title_fullStr | A computational framework for the prioritization of disease-gene candidates |
title_full_unstemmed | A computational framework for the prioritization of disease-gene candidates |
title_short | A computational framework for the prioritization of disease-gene candidates |
title_sort | computational framework for the prioritization of disease-gene candidates |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4547404/ https://www.ncbi.nlm.nih.gov/pubmed/26330267 http://dx.doi.org/10.1186/1471-2164-16-S9-S2 |
work_keys_str_mv | AT brownefiona acomputationalframeworkfortheprioritizationofdiseasegenecandidates AT wanghaiying acomputationalframeworkfortheprioritizationofdiseasegenecandidates AT zhenghuiru acomputationalframeworkfortheprioritizationofdiseasegenecandidates AT brownefiona computationalframeworkfortheprioritizationofdiseasegenecandidates AT wanghaiying computationalframeworkfortheprioritizationofdiseasegenecandidates AT zhenghuiru computationalframeworkfortheprioritizationofdiseasegenecandidates |