Cargando…

Improved exome prioritization of disease genes through cross-species phenotype comparison

Numerous new disease-gene associations have been identified by whole-exome sequencing studies in the last few years. However, many cases remain unsolved due to the sheer number of candidate variants remaining after common filtering strategies such as removing low quality and common variants and thos...

Descripción completa

Detalles Bibliográficos
Autores principales: Robinson, Peter N., Köhler, Sebastian, Oellrich, Anika, Wang, Kai, Mungall, Christopher J., Lewis, Suzanna E., Washington, Nicole, Bauer, Sebastian, Seelow, Dominik, Krawitz, Peter, Gilissen, Christian, Haendel, Melissa, Smedley, Damian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3912424/
https://www.ncbi.nlm.nih.gov/pubmed/24162188
http://dx.doi.org/10.1101/gr.160325.113
_version_ 1782302081083768832
author Robinson, Peter N.
Köhler, Sebastian
Oellrich, Anika
Wang, Kai
Mungall, Christopher J.
Lewis, Suzanna E.
Washington, Nicole
Bauer, Sebastian
Seelow, Dominik
Krawitz, Peter
Gilissen, Christian
Haendel, Melissa
Smedley, Damian
author_facet Robinson, Peter N.
Köhler, Sebastian
Oellrich, Anika
Wang, Kai
Mungall, Christopher J.
Lewis, Suzanna E.
Washington, Nicole
Bauer, Sebastian
Seelow, Dominik
Krawitz, Peter
Gilissen, Christian
Haendel, Melissa
Smedley, Damian
author_sort Robinson, Peter N.
collection PubMed
description Numerous new disease-gene associations have been identified by whole-exome sequencing studies in the last few years. However, many cases remain unsolved due to the sheer number of candidate variants remaining after common filtering strategies such as removing low quality and common variants and those deemed unlikely to be pathogenic. The observation that each of our genomes contains about 100 genuine loss-of-function variants makes identification of the causative mutation problematic when using these strategies alone. We propose using the wealth of genotype to phenotype data that already exists from model organism studies to assess the potential impact of these exome variants. Here, we introduce PHenotypic Interpretation of Variants in Exomes (PHIVE), an algorithm that integrates the calculation of phenotype similarity between human diseases and genetically modified mouse models with evaluation of the variants according to allele frequency, pathogenicity, and mode of inheritance approaches in our Exomiser tool. Large-scale validation of PHIVE analysis using 100,000 exomes containing known mutations demonstrated a substantial improvement (up to 54.1-fold) over purely variant-based (frequency and pathogenicity) methods with the correct gene recalled as the top hit in up to 83% of samples, corresponding to an area under the ROC curve of >95%. We conclude that incorporation of phenotype data can play a vital role in translational bioinformatics and propose that exome sequencing projects should systematically capture clinical phenotypes to take advantage of the strategy presented here.
format Online
Article
Text
id pubmed-3912424
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-39124242014-02-18 Improved exome prioritization of disease genes through cross-species phenotype comparison Robinson, Peter N. Köhler, Sebastian Oellrich, Anika Wang, Kai Mungall, Christopher J. Lewis, Suzanna E. Washington, Nicole Bauer, Sebastian Seelow, Dominik Krawitz, Peter Gilissen, Christian Haendel, Melissa Smedley, Damian Genome Res Resource Numerous new disease-gene associations have been identified by whole-exome sequencing studies in the last few years. However, many cases remain unsolved due to the sheer number of candidate variants remaining after common filtering strategies such as removing low quality and common variants and those deemed unlikely to be pathogenic. The observation that each of our genomes contains about 100 genuine loss-of-function variants makes identification of the causative mutation problematic when using these strategies alone. We propose using the wealth of genotype to phenotype data that already exists from model organism studies to assess the potential impact of these exome variants. Here, we introduce PHenotypic Interpretation of Variants in Exomes (PHIVE), an algorithm that integrates the calculation of phenotype similarity between human diseases and genetically modified mouse models with evaluation of the variants according to allele frequency, pathogenicity, and mode of inheritance approaches in our Exomiser tool. Large-scale validation of PHIVE analysis using 100,000 exomes containing known mutations demonstrated a substantial improvement (up to 54.1-fold) over purely variant-based (frequency and pathogenicity) methods with the correct gene recalled as the top hit in up to 83% of samples, corresponding to an area under the ROC curve of >95%. We conclude that incorporation of phenotype data can play a vital role in translational bioinformatics and propose that exome sequencing projects should systematically capture clinical phenotypes to take advantage of the strategy presented here. Cold Spring Harbor Laboratory Press 2014-02 /pmc/articles/PMC3912424/ /pubmed/24162188 http://dx.doi.org/10.1101/gr.160325.113 Text en © 2014 Robinson et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by/3.0/ This article, published in Genome Research, is available under a Creative Commons License (Attribution 3.0 Unported), as described at http://creativecommons.org/licenses/by/3.0/.
spellingShingle Resource
Robinson, Peter N.
Köhler, Sebastian
Oellrich, Anika
Wang, Kai
Mungall, Christopher J.
Lewis, Suzanna E.
Washington, Nicole
Bauer, Sebastian
Seelow, Dominik
Krawitz, Peter
Gilissen, Christian
Haendel, Melissa
Smedley, Damian
Improved exome prioritization of disease genes through cross-species phenotype comparison
title Improved exome prioritization of disease genes through cross-species phenotype comparison
title_full Improved exome prioritization of disease genes through cross-species phenotype comparison
title_fullStr Improved exome prioritization of disease genes through cross-species phenotype comparison
title_full_unstemmed Improved exome prioritization of disease genes through cross-species phenotype comparison
title_short Improved exome prioritization of disease genes through cross-species phenotype comparison
title_sort improved exome prioritization of disease genes through cross-species phenotype comparison
topic Resource
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3912424/
https://www.ncbi.nlm.nih.gov/pubmed/24162188
http://dx.doi.org/10.1101/gr.160325.113
work_keys_str_mv AT robinsonpetern improvedexomeprioritizationofdiseasegenesthroughcrossspeciesphenotypecomparison
AT kohlersebastian improvedexomeprioritizationofdiseasegenesthroughcrossspeciesphenotypecomparison
AT oellrichanika improvedexomeprioritizationofdiseasegenesthroughcrossspeciesphenotypecomparison
AT improvedexomeprioritizationofdiseasegenesthroughcrossspeciesphenotypecomparison
AT wangkai improvedexomeprioritizationofdiseasegenesthroughcrossspeciesphenotypecomparison
AT mungallchristopherj improvedexomeprioritizationofdiseasegenesthroughcrossspeciesphenotypecomparison
AT lewissuzannae improvedexomeprioritizationofdiseasegenesthroughcrossspeciesphenotypecomparison
AT washingtonnicole improvedexomeprioritizationofdiseasegenesthroughcrossspeciesphenotypecomparison
AT bauersebastian improvedexomeprioritizationofdiseasegenesthroughcrossspeciesphenotypecomparison
AT seelowdominik improvedexomeprioritizationofdiseasegenesthroughcrossspeciesphenotypecomparison
AT krawitzpeter improvedexomeprioritizationofdiseasegenesthroughcrossspeciesphenotypecomparison
AT gilissenchristian improvedexomeprioritizationofdiseasegenesthroughcrossspeciesphenotypecomparison
AT haendelmelissa improvedexomeprioritizationofdiseasegenesthroughcrossspeciesphenotypecomparison
AT smedleydamian improvedexomeprioritizationofdiseasegenesthroughcrossspeciesphenotypecomparison