Cargando…
PredictSNP: Robust and Accurate Consensus Classifier for Prediction of Disease-Related Mutations
Single nucleotide variants represent a prevalent form of genetic variation. Mutations in the coding regions are frequently associated with the development of various genetic diseases. Computational tools for the prediction of the effects of mutations on protein function are very important for analys...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3894168/ https://www.ncbi.nlm.nih.gov/pubmed/24453961 http://dx.doi.org/10.1371/journal.pcbi.1003440 |
_version_ | 1782299809571405824 |
---|---|
author | Bendl, Jaroslav Stourac, Jan Salanda, Ondrej Pavelka, Antonin Wieben, Eric D. Zendulka, Jaroslav Brezovsky, Jan Damborsky, Jiri |
author_facet | Bendl, Jaroslav Stourac, Jan Salanda, Ondrej Pavelka, Antonin Wieben, Eric D. Zendulka, Jaroslav Brezovsky, Jan Damborsky, Jiri |
author_sort | Bendl, Jaroslav |
collection | PubMed |
description | Single nucleotide variants represent a prevalent form of genetic variation. Mutations in the coding regions are frequently associated with the development of various genetic diseases. Computational tools for the prediction of the effects of mutations on protein function are very important for analysis of single nucleotide variants and their prioritization for experimental characterization. Many computational tools are already widely employed for this purpose. Unfortunately, their comparison and further improvement is hindered by large overlaps between the training datasets and benchmark datasets, which lead to biased and overly optimistic reported performances. In this study, we have constructed three independent datasets by removing all duplicities, inconsistencies and mutations previously used in the training of evaluated tools. The benchmark dataset containing over 43,000 mutations was employed for the unbiased evaluation of eight established prediction tools: MAPP, nsSNPAnalyzer, PANTHER, PhD-SNP, PolyPhen-1, PolyPhen-2, SIFT and SNAP. The six best performing tools were combined into a consensus classifier PredictSNP, resulting into significantly improved prediction performance, and at the same time returned results for all mutations, confirming that consensus prediction represents an accurate and robust alternative to the predictions delivered by individual tools. A user-friendly web interface enables easy access to all eight prediction tools, the consensus classifier PredictSNP and annotations from the Protein Mutant Database and the UniProt database. The web server and the datasets are freely available to the academic community at http://loschmidt.chemi.muni.cz/predictsnp. |
format | Online Article Text |
id | pubmed-3894168 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-38941682014-01-21 PredictSNP: Robust and Accurate Consensus Classifier for Prediction of Disease-Related Mutations Bendl, Jaroslav Stourac, Jan Salanda, Ondrej Pavelka, Antonin Wieben, Eric D. Zendulka, Jaroslav Brezovsky, Jan Damborsky, Jiri PLoS Comput Biol Research Article Single nucleotide variants represent a prevalent form of genetic variation. Mutations in the coding regions are frequently associated with the development of various genetic diseases. Computational tools for the prediction of the effects of mutations on protein function are very important for analysis of single nucleotide variants and their prioritization for experimental characterization. Many computational tools are already widely employed for this purpose. Unfortunately, their comparison and further improvement is hindered by large overlaps between the training datasets and benchmark datasets, which lead to biased and overly optimistic reported performances. In this study, we have constructed three independent datasets by removing all duplicities, inconsistencies and mutations previously used in the training of evaluated tools. The benchmark dataset containing over 43,000 mutations was employed for the unbiased evaluation of eight established prediction tools: MAPP, nsSNPAnalyzer, PANTHER, PhD-SNP, PolyPhen-1, PolyPhen-2, SIFT and SNAP. The six best performing tools were combined into a consensus classifier PredictSNP, resulting into significantly improved prediction performance, and at the same time returned results for all mutations, confirming that consensus prediction represents an accurate and robust alternative to the predictions delivered by individual tools. A user-friendly web interface enables easy access to all eight prediction tools, the consensus classifier PredictSNP and annotations from the Protein Mutant Database and the UniProt database. The web server and the datasets are freely available to the academic community at http://loschmidt.chemi.muni.cz/predictsnp. Public Library of Science 2014-01-16 /pmc/articles/PMC3894168/ /pubmed/24453961 http://dx.doi.org/10.1371/journal.pcbi.1003440 Text en http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Bendl, Jaroslav Stourac, Jan Salanda, Ondrej Pavelka, Antonin Wieben, Eric D. Zendulka, Jaroslav Brezovsky, Jan Damborsky, Jiri PredictSNP: Robust and Accurate Consensus Classifier for Prediction of Disease-Related Mutations |
title | PredictSNP: Robust and Accurate Consensus Classifier for Prediction of Disease-Related Mutations |
title_full | PredictSNP: Robust and Accurate Consensus Classifier for Prediction of Disease-Related Mutations |
title_fullStr | PredictSNP: Robust and Accurate Consensus Classifier for Prediction of Disease-Related Mutations |
title_full_unstemmed | PredictSNP: Robust and Accurate Consensus Classifier for Prediction of Disease-Related Mutations |
title_short | PredictSNP: Robust and Accurate Consensus Classifier for Prediction of Disease-Related Mutations |
title_sort | predictsnp: robust and accurate consensus classifier for prediction of disease-related mutations |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3894168/ https://www.ncbi.nlm.nih.gov/pubmed/24453961 http://dx.doi.org/10.1371/journal.pcbi.1003440 |
work_keys_str_mv | AT bendljaroslav predictsnprobustandaccurateconsensusclassifierforpredictionofdiseaserelatedmutations AT stouracjan predictsnprobustandaccurateconsensusclassifierforpredictionofdiseaserelatedmutations AT salandaondrej predictsnprobustandaccurateconsensusclassifierforpredictionofdiseaserelatedmutations AT pavelkaantonin predictsnprobustandaccurateconsensusclassifierforpredictionofdiseaserelatedmutations AT wiebenericd predictsnprobustandaccurateconsensusclassifierforpredictionofdiseaserelatedmutations AT zendulkajaroslav predictsnprobustandaccurateconsensusclassifierforpredictionofdiseaserelatedmutations AT brezovskyjan predictsnprobustandaccurateconsensusclassifierforpredictionofdiseaserelatedmutations AT damborskyjiri predictsnprobustandaccurateconsensusclassifierforpredictionofdiseaserelatedmutations |