Cargando…

Bayesian models for syndrome- and gene-specific probabilities of novel variant pathogenicity

BACKGROUND: With the advent of affordable and comprehensive sequencing technologies, access to molecular genetics for clinical diagnostics and research applications is increasing. However, variant interpretation remains challenging, and tools that close the gap between data generation and data inter...

Descripción completa

Detalles Bibliográficos
Autores principales: Ruklisa, Dace, Ware, James S, Walsh, Roddy, Balding, David J, Cook, Stuart A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4308924/
https://www.ncbi.nlm.nih.gov/pubmed/25649125
http://dx.doi.org/10.1186/s13073-014-0120-4
_version_ 1782354610970689536
author Ruklisa, Dace
Ware, James S
Walsh, Roddy
Balding, David J
Cook, Stuart A
author_facet Ruklisa, Dace
Ware, James S
Walsh, Roddy
Balding, David J
Cook, Stuart A
author_sort Ruklisa, Dace
collection PubMed
description BACKGROUND: With the advent of affordable and comprehensive sequencing technologies, access to molecular genetics for clinical diagnostics and research applications is increasing. However, variant interpretation remains challenging, and tools that close the gap between data generation and data interpretation are urgently required. Here we present a transferable approach to help address the limitations in variant annotation. METHODS: We develop a network of Bayesian logistic regression models that integrate multiple lines of evidence to evaluate the probability that a rare variant is the cause of an individual’s disease. We present models for genes causing inherited cardiac conditions, though the framework is transferable to other genes and syndromes. RESULTS: Our models report a probability of pathogenicity, rather than a categorisation into pathogenic or benign, which captures the inherent uncertainty of the prediction. We find that gene- and syndrome-specific models outperform genome-wide approaches, and that the integration of multiple lines of evidence performs better than individual predictors. The models are adaptable to incorporate new lines of evidence, and results can be combined with familial segregation data in a transparent and quantitative manner to further enhance predictions. Though the probability scale is continuous, and innately interpretable, performance summaries based on thresholds are useful for comparisons. Using a threshold probability of pathogenicity of 0.9, we obtain a positive predictive value of 0.999 and sensitivity of 0.76 for the classification of variants known to cause long QT syndrome over the three most important genes, which represents sufficient accuracy to inform clinical decision-making. A web tool APPRAISE [http://www.cardiodb.org/APPRAISE] provides access to these models and predictions. CONCLUSIONS: Our Bayesian framework provides a transparent, flexible and robust framework for the analysis and interpretation of rare genetic variants. Models tailored to specific genes outperform genome-wide approaches, and can be sufficiently accurate to inform clinical decision-making. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13073-014-0120-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4308924
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43089242015-02-03 Bayesian models for syndrome- and gene-specific probabilities of novel variant pathogenicity Ruklisa, Dace Ware, James S Walsh, Roddy Balding, David J Cook, Stuart A Genome Med Research BACKGROUND: With the advent of affordable and comprehensive sequencing technologies, access to molecular genetics for clinical diagnostics and research applications is increasing. However, variant interpretation remains challenging, and tools that close the gap between data generation and data interpretation are urgently required. Here we present a transferable approach to help address the limitations in variant annotation. METHODS: We develop a network of Bayesian logistic regression models that integrate multiple lines of evidence to evaluate the probability that a rare variant is the cause of an individual’s disease. We present models for genes causing inherited cardiac conditions, though the framework is transferable to other genes and syndromes. RESULTS: Our models report a probability of pathogenicity, rather than a categorisation into pathogenic or benign, which captures the inherent uncertainty of the prediction. We find that gene- and syndrome-specific models outperform genome-wide approaches, and that the integration of multiple lines of evidence performs better than individual predictors. The models are adaptable to incorporate new lines of evidence, and results can be combined with familial segregation data in a transparent and quantitative manner to further enhance predictions. Though the probability scale is continuous, and innately interpretable, performance summaries based on thresholds are useful for comparisons. Using a threshold probability of pathogenicity of 0.9, we obtain a positive predictive value of 0.999 and sensitivity of 0.76 for the classification of variants known to cause long QT syndrome over the three most important genes, which represents sufficient accuracy to inform clinical decision-making. A web tool APPRAISE [http://www.cardiodb.org/APPRAISE] provides access to these models and predictions. CONCLUSIONS: Our Bayesian framework provides a transparent, flexible and robust framework for the analysis and interpretation of rare genetic variants. Models tailored to specific genes outperform genome-wide approaches, and can be sufficiently accurate to inform clinical decision-making. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13073-014-0120-4) contains supplementary material, which is available to authorized users. BioMed Central 2015-01-28 /pmc/articles/PMC4308924/ /pubmed/25649125 http://dx.doi.org/10.1186/s13073-014-0120-4 Text en © Ruklisa et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Ruklisa, Dace
Ware, James S
Walsh, Roddy
Balding, David J
Cook, Stuart A
Bayesian models for syndrome- and gene-specific probabilities of novel variant pathogenicity
title Bayesian models for syndrome- and gene-specific probabilities of novel variant pathogenicity
title_full Bayesian models for syndrome- and gene-specific probabilities of novel variant pathogenicity
title_fullStr Bayesian models for syndrome- and gene-specific probabilities of novel variant pathogenicity
title_full_unstemmed Bayesian models for syndrome- and gene-specific probabilities of novel variant pathogenicity
title_short Bayesian models for syndrome- and gene-specific probabilities of novel variant pathogenicity
title_sort bayesian models for syndrome- and gene-specific probabilities of novel variant pathogenicity
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4308924/
https://www.ncbi.nlm.nih.gov/pubmed/25649125
http://dx.doi.org/10.1186/s13073-014-0120-4
work_keys_str_mv AT ruklisadace bayesianmodelsforsyndromeandgenespecificprobabilitiesofnovelvariantpathogenicity
AT warejamess bayesianmodelsforsyndromeandgenespecificprobabilitiesofnovelvariantpathogenicity
AT walshroddy bayesianmodelsforsyndromeandgenespecificprobabilitiesofnovelvariantpathogenicity
AT baldingdavidj bayesianmodelsforsyndromeandgenespecificprobabilitiesofnovelvariantpathogenicity
AT cookstuarta bayesianmodelsforsyndromeandgenespecificprobabilitiesofnovelvariantpathogenicity