Cargando…

Better prediction of functional effects for sequence variants

Elucidating the effects of naturally occurring genetic variation is one of the major challenges for personalized health and personalized medicine. Here, we introduce SNAP2, a novel neural network based classifier that improves over the state-of-the-art in distinguishing between effect and neutral va...

Descripción completa

Detalles Bibliográficos
Autores principales: Hecht, Maximilian, Bromberg, Yana, Rost, Burkhard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4480835/
https://www.ncbi.nlm.nih.gov/pubmed/26110438
http://dx.doi.org/10.1186/1471-2164-16-S8-S1
_version_ 1782378198658449408
author Hecht, Maximilian
Bromberg, Yana
Rost, Burkhard
author_facet Hecht, Maximilian
Bromberg, Yana
Rost, Burkhard
author_sort Hecht, Maximilian
collection PubMed
description Elucidating the effects of naturally occurring genetic variation is one of the major challenges for personalized health and personalized medicine. Here, we introduce SNAP2, a novel neural network based classifier that improves over the state-of-the-art in distinguishing between effect and neutral variants. Our method's improved performance results from screening many potentially relevant protein features and from refining our development data sets. Cross-validated on >100k experimentally annotated variants, SNAP2 significantly outperformed other methods, attaining a two-state accuracy (effect/neutral) of 83%. SNAP2 also outperformed combinations of other methods. Performance increased for human variants but much more so for other organisms. Our method's carefully calibrated reliability index informs selection of variants for experimental follow up, with the most strongly predicted half of all effect variants predicted at over 96% accuracy. As expected, the evolutionary information from automatically generated multiple sequence alignments gave the strongest signal for the prediction. However, we also optimized our new method to perform surprisingly well even without alignments. This feature reduces prediction runtime by over two orders of magnitude, enables cross-genome comparisons, and renders our new method as the best solution for the 10-20% of sequence orphans. SNAP2 is available at: https://rostlab.org/services/snap2web DEFINITIONS USED: Delta, input feature that results from computing the difference feature scores for native amino acid and feature scores for variant amino acid; nsSNP, non-synoymous SNP; PMD, Protein Mutant Database; SNAP, Screening for non-acceptable polymorphisms; SNP, single nucleotide polymorphism; variant, any amino acid changing sequence variant.
format Online
Article
Text
id pubmed-4480835
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44808352015-07-10 Better prediction of functional effects for sequence variants Hecht, Maximilian Bromberg, Yana Rost, Burkhard BMC Genomics Research Elucidating the effects of naturally occurring genetic variation is one of the major challenges for personalized health and personalized medicine. Here, we introduce SNAP2, a novel neural network based classifier that improves over the state-of-the-art in distinguishing between effect and neutral variants. Our method's improved performance results from screening many potentially relevant protein features and from refining our development data sets. Cross-validated on >100k experimentally annotated variants, SNAP2 significantly outperformed other methods, attaining a two-state accuracy (effect/neutral) of 83%. SNAP2 also outperformed combinations of other methods. Performance increased for human variants but much more so for other organisms. Our method's carefully calibrated reliability index informs selection of variants for experimental follow up, with the most strongly predicted half of all effect variants predicted at over 96% accuracy. As expected, the evolutionary information from automatically generated multiple sequence alignments gave the strongest signal for the prediction. However, we also optimized our new method to perform surprisingly well even without alignments. This feature reduces prediction runtime by over two orders of magnitude, enables cross-genome comparisons, and renders our new method as the best solution for the 10-20% of sequence orphans. SNAP2 is available at: https://rostlab.org/services/snap2web DEFINITIONS USED: Delta, input feature that results from computing the difference feature scores for native amino acid and feature scores for variant amino acid; nsSNP, non-synoymous SNP; PMD, Protein Mutant Database; SNAP, Screening for non-acceptable polymorphisms; SNP, single nucleotide polymorphism; variant, any amino acid changing sequence variant. BioMed Central 2015-06-18 /pmc/articles/PMC4480835/ /pubmed/26110438 http://dx.doi.org/10.1186/1471-2164-16-S8-S1 Text en Copyright © 2015 Hecht et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Hecht, Maximilian
Bromberg, Yana
Rost, Burkhard
Better prediction of functional effects for sequence variants
title Better prediction of functional effects for sequence variants
title_full Better prediction of functional effects for sequence variants
title_fullStr Better prediction of functional effects for sequence variants
title_full_unstemmed Better prediction of functional effects for sequence variants
title_short Better prediction of functional effects for sequence variants
title_sort better prediction of functional effects for sequence variants
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4480835/
https://www.ncbi.nlm.nih.gov/pubmed/26110438
http://dx.doi.org/10.1186/1471-2164-16-S8-S1
work_keys_str_mv AT hechtmaximilian betterpredictionoffunctionaleffectsforsequencevariants
AT brombergyana betterpredictionoffunctionaleffectsforsequencevariants
AT rostburkhard betterpredictionoffunctionaleffectsforsequencevariants