Cargando…
IMHOTEP—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants
The in silico prediction of the functional consequences of mutations is an important goal of human pathogenetics. However, bioinformatic tools that classify mutations according to their functionality employ different algorithms so that predictions may vary markedly between tools. We therefore integr...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5388428/ https://www.ncbi.nlm.nih.gov/pubmed/28180317 http://dx.doi.org/10.1093/nar/gkw886 |
_version_ | 1782521129401843712 |
---|---|
author | Knecht, Carolin Mort, Matthew Junge, Olaf Cooper, David N. Krawczak, Michael Caliebe, Amke |
author_facet | Knecht, Carolin Mort, Matthew Junge, Olaf Cooper, David N. Krawczak, Michael Caliebe, Amke |
author_sort | Knecht, Carolin |
collection | PubMed |
description | The in silico prediction of the functional consequences of mutations is an important goal of human pathogenetics. However, bioinformatic tools that classify mutations according to their functionality employ different algorithms so that predictions may vary markedly between tools. We therefore integrated nine popular prediction tools (PolyPhen-2, SNPs&GO, MutPred, SIFT, MutationTaster2, Mutation Assessor and FATHMM as well as conservation-based Grantham Score and PhyloP) into a single predictor. The optimal combination of these tools was selected by means of a wide range of statistical modeling techniques, drawing upon 10 029 disease-causing single nucleotide variants (SNVs) from Human Gene Mutation Database and 10 002 putatively ‘benign’ non-synonymous SNVs from UCSC. Predictive performance was found to be markedly improved by model-based integration, whilst maximum predictive capability was obtained with either random forest, decision tree or logistic regression analysis. A combination of PolyPhen-2, SNPs&GO, MutPred, MutationTaster2 and FATHMM was found to perform as well as all tools combined. Comparison of our approach with other integrative approaches such as Condel, CoVEC, CAROL, CADD, MetaSVM and MetaLR using an independent validation dataset, revealed the superiority of our newly proposed integrative approach. An online implementation of this approach, IMHOTEP (‘Integrating Molecular Heuristics and Other Tools for Effect Prediction’), is provided at http://www.uni-kiel.de/medinfo/cgi-bin/predictor/. |
format | Online Article Text |
id | pubmed-5388428 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-53884282017-04-18 IMHOTEP—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants Knecht, Carolin Mort, Matthew Junge, Olaf Cooper, David N. Krawczak, Michael Caliebe, Amke Nucleic Acids Res Methods Online The in silico prediction of the functional consequences of mutations is an important goal of human pathogenetics. However, bioinformatic tools that classify mutations according to their functionality employ different algorithms so that predictions may vary markedly between tools. We therefore integrated nine popular prediction tools (PolyPhen-2, SNPs&GO, MutPred, SIFT, MutationTaster2, Mutation Assessor and FATHMM as well as conservation-based Grantham Score and PhyloP) into a single predictor. The optimal combination of these tools was selected by means of a wide range of statistical modeling techniques, drawing upon 10 029 disease-causing single nucleotide variants (SNVs) from Human Gene Mutation Database and 10 002 putatively ‘benign’ non-synonymous SNVs from UCSC. Predictive performance was found to be markedly improved by model-based integration, whilst maximum predictive capability was obtained with either random forest, decision tree or logistic regression analysis. A combination of PolyPhen-2, SNPs&GO, MutPred, MutationTaster2 and FATHMM was found to perform as well as all tools combined. Comparison of our approach with other integrative approaches such as Condel, CoVEC, CAROL, CADD, MetaSVM and MetaLR using an independent validation dataset, revealed the superiority of our newly proposed integrative approach. An online implementation of this approach, IMHOTEP (‘Integrating Molecular Heuristics and Other Tools for Effect Prediction’), is provided at http://www.uni-kiel.de/medinfo/cgi-bin/predictor/. Oxford University Press 2017-02-17 2016-10-03 /pmc/articles/PMC5388428/ /pubmed/28180317 http://dx.doi.org/10.1093/nar/gkw886 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methods Online Knecht, Carolin Mort, Matthew Junge, Olaf Cooper, David N. Krawczak, Michael Caliebe, Amke IMHOTEP—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants |
title | IMHOTEP—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants |
title_full | IMHOTEP—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants |
title_fullStr | IMHOTEP—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants |
title_full_unstemmed | IMHOTEP—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants |
title_short | IMHOTEP—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants |
title_sort | imhotep—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5388428/ https://www.ncbi.nlm.nih.gov/pubmed/28180317 http://dx.doi.org/10.1093/nar/gkw886 |
work_keys_str_mv | AT knechtcarolin imhotepacompositescoreintegratingpopulartoolsforpredictingthefunctionalconsequencesofnonsynonymoussequencevariants AT mortmatthew imhotepacompositescoreintegratingpopulartoolsforpredictingthefunctionalconsequencesofnonsynonymoussequencevariants AT jungeolaf imhotepacompositescoreintegratingpopulartoolsforpredictingthefunctionalconsequencesofnonsynonymoussequencevariants AT cooperdavidn imhotepacompositescoreintegratingpopulartoolsforpredictingthefunctionalconsequencesofnonsynonymoussequencevariants AT krawczakmichael imhotepacompositescoreintegratingpopulartoolsforpredictingthefunctionalconsequencesofnonsynonymoussequencevariants AT caliebeamke imhotepacompositescoreintegratingpopulartoolsforpredictingthefunctionalconsequencesofnonsynonymoussequencevariants |