Cargando…

Prioritizing causal disease genes using unbiased genomic features

BACKGROUND: Cardiovascular disease (CVD) is the leading cause of death in the developed world. Human genetic studies, including genome-wide sequencing and SNP-array approaches, promise to reveal disease genes and mechanisms representing new therapeutic targets. In practice, however, identification o...

Descripción completa

Detalles Bibliográficos
Autores principales: Deo, Rahul C, Musso, Gabriel, Tasan, Murat, Tang, Paul, Poon, Annie, Yuan, Christiana, Felix, Janine F, Vasan, Ramachandran S, Beroukhim, Rameen, De Marco, Teresa, Kwok, Pui-Yan, MacRae, Calum A, Roth, Frederick P
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4279789/
https://www.ncbi.nlm.nih.gov/pubmed/25633252
http://dx.doi.org/10.1186/s13059-014-0534-8
_version_ 1782350761094545408
author Deo, Rahul C
Musso, Gabriel
Tasan, Murat
Tang, Paul
Poon, Annie
Yuan, Christiana
Felix, Janine F
Vasan, Ramachandran S
Beroukhim, Rameen
De Marco, Teresa
Kwok, Pui-Yan
MacRae, Calum A
Roth, Frederick P
author_facet Deo, Rahul C
Musso, Gabriel
Tasan, Murat
Tang, Paul
Poon, Annie
Yuan, Christiana
Felix, Janine F
Vasan, Ramachandran S
Beroukhim, Rameen
De Marco, Teresa
Kwok, Pui-Yan
MacRae, Calum A
Roth, Frederick P
author_sort Deo, Rahul C
collection PubMed
description BACKGROUND: Cardiovascular disease (CVD) is the leading cause of death in the developed world. Human genetic studies, including genome-wide sequencing and SNP-array approaches, promise to reveal disease genes and mechanisms representing new therapeutic targets. In practice, however, identification of the actual genes contributing to disease pathogenesis has lagged behind identification of associated loci, thus limiting the clinical benefits. RESULTS: To aid in localizing causal genes, we develop a machine learning approach, Objective Prioritization for Enhanced Novelty (OPEN), which quantitatively prioritizes gene-disease associations based on a diverse group of genomic features. This approach uses only unbiased predictive features and thus is not hampered by a preference towards previously well-characterized genes. We demonstrate success in identifying genetic determinants for CVD-related traits, including cholesterol levels, blood pressure, and conduction system and cardiomyopathy phenotypes. Using OPEN, we prioritize genes, including FLNC, for association with increased left ventricular diameter, which is a defining feature of a prevalent cardiovascular disorder, dilated cardiomyopathy or DCM. Using a zebrafish model, we experimentally validate FLNC and identify a novel FLNC splice-site mutation in a patient with severe DCM. CONCLUSION: Our approach stands to assist interpretation of large-scale genetic studies without compromising their fundamentally unbiased nature. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-014-0534-8) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4279789
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42797892015-01-22 Prioritizing causal disease genes using unbiased genomic features Deo, Rahul C Musso, Gabriel Tasan, Murat Tang, Paul Poon, Annie Yuan, Christiana Felix, Janine F Vasan, Ramachandran S Beroukhim, Rameen De Marco, Teresa Kwok, Pui-Yan MacRae, Calum A Roth, Frederick P Genome Biol Research BACKGROUND: Cardiovascular disease (CVD) is the leading cause of death in the developed world. Human genetic studies, including genome-wide sequencing and SNP-array approaches, promise to reveal disease genes and mechanisms representing new therapeutic targets. In practice, however, identification of the actual genes contributing to disease pathogenesis has lagged behind identification of associated loci, thus limiting the clinical benefits. RESULTS: To aid in localizing causal genes, we develop a machine learning approach, Objective Prioritization for Enhanced Novelty (OPEN), which quantitatively prioritizes gene-disease associations based on a diverse group of genomic features. This approach uses only unbiased predictive features and thus is not hampered by a preference towards previously well-characterized genes. We demonstrate success in identifying genetic determinants for CVD-related traits, including cholesterol levels, blood pressure, and conduction system and cardiomyopathy phenotypes. Using OPEN, we prioritize genes, including FLNC, for association with increased left ventricular diameter, which is a defining feature of a prevalent cardiovascular disorder, dilated cardiomyopathy or DCM. Using a zebrafish model, we experimentally validate FLNC and identify a novel FLNC splice-site mutation in a patient with severe DCM. CONCLUSION: Our approach stands to assist interpretation of large-scale genetic studies without compromising their fundamentally unbiased nature. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-014-0534-8) contains supplementary material, which is available to authorized users. BioMed Central 2014-12-03 2014 /pmc/articles/PMC4279789/ /pubmed/25633252 http://dx.doi.org/10.1186/s13059-014-0534-8 Text en © Deo et al.; licensee BioMed Central Ltd. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Deo, Rahul C
Musso, Gabriel
Tasan, Murat
Tang, Paul
Poon, Annie
Yuan, Christiana
Felix, Janine F
Vasan, Ramachandran S
Beroukhim, Rameen
De Marco, Teresa
Kwok, Pui-Yan
MacRae, Calum A
Roth, Frederick P
Prioritizing causal disease genes using unbiased genomic features
title Prioritizing causal disease genes using unbiased genomic features
title_full Prioritizing causal disease genes using unbiased genomic features
title_fullStr Prioritizing causal disease genes using unbiased genomic features
title_full_unstemmed Prioritizing causal disease genes using unbiased genomic features
title_short Prioritizing causal disease genes using unbiased genomic features
title_sort prioritizing causal disease genes using unbiased genomic features
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4279789/
https://www.ncbi.nlm.nih.gov/pubmed/25633252
http://dx.doi.org/10.1186/s13059-014-0534-8
work_keys_str_mv AT deorahulc prioritizingcausaldiseasegenesusingunbiasedgenomicfeatures
AT mussogabriel prioritizingcausaldiseasegenesusingunbiasedgenomicfeatures
AT tasanmurat prioritizingcausaldiseasegenesusingunbiasedgenomicfeatures
AT tangpaul prioritizingcausaldiseasegenesusingunbiasedgenomicfeatures
AT poonannie prioritizingcausaldiseasegenesusingunbiasedgenomicfeatures
AT yuanchristiana prioritizingcausaldiseasegenesusingunbiasedgenomicfeatures
AT felixjaninef prioritizingcausaldiseasegenesusingunbiasedgenomicfeatures
AT vasanramachandrans prioritizingcausaldiseasegenesusingunbiasedgenomicfeatures
AT beroukhimrameen prioritizingcausaldiseasegenesusingunbiasedgenomicfeatures
AT demarcoteresa prioritizingcausaldiseasegenesusingunbiasedgenomicfeatures
AT kwokpuiyan prioritizingcausaldiseasegenesusingunbiasedgenomicfeatures
AT macraecaluma prioritizingcausaldiseasegenesusingunbiasedgenomicfeatures
AT rothfrederickp prioritizingcausaldiseasegenesusingunbiasedgenomicfeatures