Cargando…

PredictSNP: Robust and Accurate Consensus Classifier for Prediction of Disease-Related Mutations

Single nucleotide variants represent a prevalent form of genetic variation. Mutations in the coding regions are frequently associated with the development of various genetic diseases. Computational tools for the prediction of the effects of mutations on protein function are very important for analys...

Descripción completa

Detalles Bibliográficos
Autores principales: Bendl, Jaroslav, Stourac, Jan, Salanda, Ondrej, Pavelka, Antonin, Wieben, Eric D., Zendulka, Jaroslav, Brezovsky, Jan, Damborsky, Jiri
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3894168/
https://www.ncbi.nlm.nih.gov/pubmed/24453961
http://dx.doi.org/10.1371/journal.pcbi.1003440
_version_ 1782299809571405824
author Bendl, Jaroslav
Stourac, Jan
Salanda, Ondrej
Pavelka, Antonin
Wieben, Eric D.
Zendulka, Jaroslav
Brezovsky, Jan
Damborsky, Jiri
author_facet Bendl, Jaroslav
Stourac, Jan
Salanda, Ondrej
Pavelka, Antonin
Wieben, Eric D.
Zendulka, Jaroslav
Brezovsky, Jan
Damborsky, Jiri
author_sort Bendl, Jaroslav
collection PubMed
description Single nucleotide variants represent a prevalent form of genetic variation. Mutations in the coding regions are frequently associated with the development of various genetic diseases. Computational tools for the prediction of the effects of mutations on protein function are very important for analysis of single nucleotide variants and their prioritization for experimental characterization. Many computational tools are already widely employed for this purpose. Unfortunately, their comparison and further improvement is hindered by large overlaps between the training datasets and benchmark datasets, which lead to biased and overly optimistic reported performances. In this study, we have constructed three independent datasets by removing all duplicities, inconsistencies and mutations previously used in the training of evaluated tools. The benchmark dataset containing over 43,000 mutations was employed for the unbiased evaluation of eight established prediction tools: MAPP, nsSNPAnalyzer, PANTHER, PhD-SNP, PolyPhen-1, PolyPhen-2, SIFT and SNAP. The six best performing tools were combined into a consensus classifier PredictSNP, resulting into significantly improved prediction performance, and at the same time returned results for all mutations, confirming that consensus prediction represents an accurate and robust alternative to the predictions delivered by individual tools. A user-friendly web interface enables easy access to all eight prediction tools, the consensus classifier PredictSNP and annotations from the Protein Mutant Database and the UniProt database. The web server and the datasets are freely available to the academic community at http://loschmidt.chemi.muni.cz/predictsnp.
format Online
Article
Text
id pubmed-3894168
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-38941682014-01-21 PredictSNP: Robust and Accurate Consensus Classifier for Prediction of Disease-Related Mutations Bendl, Jaroslav Stourac, Jan Salanda, Ondrej Pavelka, Antonin Wieben, Eric D. Zendulka, Jaroslav Brezovsky, Jan Damborsky, Jiri PLoS Comput Biol Research Article Single nucleotide variants represent a prevalent form of genetic variation. Mutations in the coding regions are frequently associated with the development of various genetic diseases. Computational tools for the prediction of the effects of mutations on protein function are very important for analysis of single nucleotide variants and their prioritization for experimental characterization. Many computational tools are already widely employed for this purpose. Unfortunately, their comparison and further improvement is hindered by large overlaps between the training datasets and benchmark datasets, which lead to biased and overly optimistic reported performances. In this study, we have constructed three independent datasets by removing all duplicities, inconsistencies and mutations previously used in the training of evaluated tools. The benchmark dataset containing over 43,000 mutations was employed for the unbiased evaluation of eight established prediction tools: MAPP, nsSNPAnalyzer, PANTHER, PhD-SNP, PolyPhen-1, PolyPhen-2, SIFT and SNAP. The six best performing tools were combined into a consensus classifier PredictSNP, resulting into significantly improved prediction performance, and at the same time returned results for all mutations, confirming that consensus prediction represents an accurate and robust alternative to the predictions delivered by individual tools. A user-friendly web interface enables easy access to all eight prediction tools, the consensus classifier PredictSNP and annotations from the Protein Mutant Database and the UniProt database. The web server and the datasets are freely available to the academic community at http://loschmidt.chemi.muni.cz/predictsnp. Public Library of Science 2014-01-16 /pmc/articles/PMC3894168/ /pubmed/24453961 http://dx.doi.org/10.1371/journal.pcbi.1003440 Text en http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Bendl, Jaroslav
Stourac, Jan
Salanda, Ondrej
Pavelka, Antonin
Wieben, Eric D.
Zendulka, Jaroslav
Brezovsky, Jan
Damborsky, Jiri
PredictSNP: Robust and Accurate Consensus Classifier for Prediction of Disease-Related Mutations
title PredictSNP: Robust and Accurate Consensus Classifier for Prediction of Disease-Related Mutations
title_full PredictSNP: Robust and Accurate Consensus Classifier for Prediction of Disease-Related Mutations
title_fullStr PredictSNP: Robust and Accurate Consensus Classifier for Prediction of Disease-Related Mutations
title_full_unstemmed PredictSNP: Robust and Accurate Consensus Classifier for Prediction of Disease-Related Mutations
title_short PredictSNP: Robust and Accurate Consensus Classifier for Prediction of Disease-Related Mutations
title_sort predictsnp: robust and accurate consensus classifier for prediction of disease-related mutations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3894168/
https://www.ncbi.nlm.nih.gov/pubmed/24453961
http://dx.doi.org/10.1371/journal.pcbi.1003440
work_keys_str_mv AT bendljaroslav predictsnprobustandaccurateconsensusclassifierforpredictionofdiseaserelatedmutations
AT stouracjan predictsnprobustandaccurateconsensusclassifierforpredictionofdiseaserelatedmutations
AT salandaondrej predictsnprobustandaccurateconsensusclassifierforpredictionofdiseaserelatedmutations
AT pavelkaantonin predictsnprobustandaccurateconsensusclassifierforpredictionofdiseaserelatedmutations
AT wiebenericd predictsnprobustandaccurateconsensusclassifierforpredictionofdiseaserelatedmutations
AT zendulkajaroslav predictsnprobustandaccurateconsensusclassifierforpredictionofdiseaserelatedmutations
AT brezovskyjan predictsnprobustandaccurateconsensusclassifierforpredictionofdiseaserelatedmutations
AT damborskyjiri predictsnprobustandaccurateconsensusclassifierforpredictionofdiseaserelatedmutations