Cargando…
Better prediction of functional effects for sequence variants
Elucidating the effects of naturally occurring genetic variation is one of the major challenges for personalized health and personalized medicine. Here, we introduce SNAP2, a novel neural network based classifier that improves over the state-of-the-art in distinguishing between effect and neutral va...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4480835/ https://www.ncbi.nlm.nih.gov/pubmed/26110438 http://dx.doi.org/10.1186/1471-2164-16-S8-S1 |
_version_ | 1782378198658449408 |
---|---|
author | Hecht, Maximilian Bromberg, Yana Rost, Burkhard |
author_facet | Hecht, Maximilian Bromberg, Yana Rost, Burkhard |
author_sort | Hecht, Maximilian |
collection | PubMed |
description | Elucidating the effects of naturally occurring genetic variation is one of the major challenges for personalized health and personalized medicine. Here, we introduce SNAP2, a novel neural network based classifier that improves over the state-of-the-art in distinguishing between effect and neutral variants. Our method's improved performance results from screening many potentially relevant protein features and from refining our development data sets. Cross-validated on >100k experimentally annotated variants, SNAP2 significantly outperformed other methods, attaining a two-state accuracy (effect/neutral) of 83%. SNAP2 also outperformed combinations of other methods. Performance increased for human variants but much more so for other organisms. Our method's carefully calibrated reliability index informs selection of variants for experimental follow up, with the most strongly predicted half of all effect variants predicted at over 96% accuracy. As expected, the evolutionary information from automatically generated multiple sequence alignments gave the strongest signal for the prediction. However, we also optimized our new method to perform surprisingly well even without alignments. This feature reduces prediction runtime by over two orders of magnitude, enables cross-genome comparisons, and renders our new method as the best solution for the 10-20% of sequence orphans. SNAP2 is available at: https://rostlab.org/services/snap2web DEFINITIONS USED: Delta, input feature that results from computing the difference feature scores for native amino acid and feature scores for variant amino acid; nsSNP, non-synoymous SNP; PMD, Protein Mutant Database; SNAP, Screening for non-acceptable polymorphisms; SNP, single nucleotide polymorphism; variant, any amino acid changing sequence variant. |
format | Online Article Text |
id | pubmed-4480835 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-44808352015-07-10 Better prediction of functional effects for sequence variants Hecht, Maximilian Bromberg, Yana Rost, Burkhard BMC Genomics Research Elucidating the effects of naturally occurring genetic variation is one of the major challenges for personalized health and personalized medicine. Here, we introduce SNAP2, a novel neural network based classifier that improves over the state-of-the-art in distinguishing between effect and neutral variants. Our method's improved performance results from screening many potentially relevant protein features and from refining our development data sets. Cross-validated on >100k experimentally annotated variants, SNAP2 significantly outperformed other methods, attaining a two-state accuracy (effect/neutral) of 83%. SNAP2 also outperformed combinations of other methods. Performance increased for human variants but much more so for other organisms. Our method's carefully calibrated reliability index informs selection of variants for experimental follow up, with the most strongly predicted half of all effect variants predicted at over 96% accuracy. As expected, the evolutionary information from automatically generated multiple sequence alignments gave the strongest signal for the prediction. However, we also optimized our new method to perform surprisingly well even without alignments. This feature reduces prediction runtime by over two orders of magnitude, enables cross-genome comparisons, and renders our new method as the best solution for the 10-20% of sequence orphans. SNAP2 is available at: https://rostlab.org/services/snap2web DEFINITIONS USED: Delta, input feature that results from computing the difference feature scores for native amino acid and feature scores for variant amino acid; nsSNP, non-synoymous SNP; PMD, Protein Mutant Database; SNAP, Screening for non-acceptable polymorphisms; SNP, single nucleotide polymorphism; variant, any amino acid changing sequence variant. BioMed Central 2015-06-18 /pmc/articles/PMC4480835/ /pubmed/26110438 http://dx.doi.org/10.1186/1471-2164-16-S8-S1 Text en Copyright © 2015 Hecht et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Hecht, Maximilian Bromberg, Yana Rost, Burkhard Better prediction of functional effects for sequence variants |
title | Better prediction of functional effects for sequence variants |
title_full | Better prediction of functional effects for sequence variants |
title_fullStr | Better prediction of functional effects for sequence variants |
title_full_unstemmed | Better prediction of functional effects for sequence variants |
title_short | Better prediction of functional effects for sequence variants |
title_sort | better prediction of functional effects for sequence variants |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4480835/ https://www.ncbi.nlm.nih.gov/pubmed/26110438 http://dx.doi.org/10.1186/1471-2164-16-S8-S1 |
work_keys_str_mv | AT hechtmaximilian betterpredictionoffunctionaleffectsforsequencevariants AT brombergyana betterpredictionoffunctionaleffectsforsequencevariants AT rostburkhard betterpredictionoffunctionaleffectsforsequencevariants |