Cargando…

IMHOTEP—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants

The in silico prediction of the functional consequences of mutations is an important goal of human pathogenetics. However, bioinformatic tools that classify mutations according to their functionality employ different algorithms so that predictions may vary markedly between tools. We therefore integr...

Descripción completa

Detalles Bibliográficos
Autores principales: Knecht, Carolin, Mort, Matthew, Junge, Olaf, Cooper, David N., Krawczak, Michael, Caliebe, Amke
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5388428/
https://www.ncbi.nlm.nih.gov/pubmed/28180317
http://dx.doi.org/10.1093/nar/gkw886
_version_ 1782521129401843712
author Knecht, Carolin
Mort, Matthew
Junge, Olaf
Cooper, David N.
Krawczak, Michael
Caliebe, Amke
author_facet Knecht, Carolin
Mort, Matthew
Junge, Olaf
Cooper, David N.
Krawczak, Michael
Caliebe, Amke
author_sort Knecht, Carolin
collection PubMed
description The in silico prediction of the functional consequences of mutations is an important goal of human pathogenetics. However, bioinformatic tools that classify mutations according to their functionality employ different algorithms so that predictions may vary markedly between tools. We therefore integrated nine popular prediction tools (PolyPhen-2, SNPs&GO, MutPred, SIFT, MutationTaster2, Mutation Assessor and FATHMM as well as conservation-based Grantham Score and PhyloP) into a single predictor. The optimal combination of these tools was selected by means of a wide range of statistical modeling techniques, drawing upon 10 029 disease-causing single nucleotide variants (SNVs) from Human Gene Mutation Database and 10 002 putatively ‘benign’ non-synonymous SNVs from UCSC. Predictive performance was found to be markedly improved by model-based integration, whilst maximum predictive capability was obtained with either random forest, decision tree or logistic regression analysis. A combination of PolyPhen-2, SNPs&GO, MutPred, MutationTaster2 and FATHMM was found to perform as well as all tools combined. Comparison of our approach with other integrative approaches such as Condel, CoVEC, CAROL, CADD, MetaSVM and MetaLR using an independent validation dataset, revealed the superiority of our newly proposed integrative approach. An online implementation of this approach, IMHOTEP (‘Integrating Molecular Heuristics and Other Tools for Effect Prediction’), is provided at http://www.uni-kiel.de/medinfo/cgi-bin/predictor/.
format Online
Article
Text
id pubmed-5388428
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-53884282017-04-18 IMHOTEP—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants Knecht, Carolin Mort, Matthew Junge, Olaf Cooper, David N. Krawczak, Michael Caliebe, Amke Nucleic Acids Res Methods Online The in silico prediction of the functional consequences of mutations is an important goal of human pathogenetics. However, bioinformatic tools that classify mutations according to their functionality employ different algorithms so that predictions may vary markedly between tools. We therefore integrated nine popular prediction tools (PolyPhen-2, SNPs&GO, MutPred, SIFT, MutationTaster2, Mutation Assessor and FATHMM as well as conservation-based Grantham Score and PhyloP) into a single predictor. The optimal combination of these tools was selected by means of a wide range of statistical modeling techniques, drawing upon 10 029 disease-causing single nucleotide variants (SNVs) from Human Gene Mutation Database and 10 002 putatively ‘benign’ non-synonymous SNVs from UCSC. Predictive performance was found to be markedly improved by model-based integration, whilst maximum predictive capability was obtained with either random forest, decision tree or logistic regression analysis. A combination of PolyPhen-2, SNPs&GO, MutPred, MutationTaster2 and FATHMM was found to perform as well as all tools combined. Comparison of our approach with other integrative approaches such as Condel, CoVEC, CAROL, CADD, MetaSVM and MetaLR using an independent validation dataset, revealed the superiority of our newly proposed integrative approach. An online implementation of this approach, IMHOTEP (‘Integrating Molecular Heuristics and Other Tools for Effect Prediction’), is provided at http://www.uni-kiel.de/medinfo/cgi-bin/predictor/. Oxford University Press 2017-02-17 2016-10-03 /pmc/articles/PMC5388428/ /pubmed/28180317 http://dx.doi.org/10.1093/nar/gkw886 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Knecht, Carolin
Mort, Matthew
Junge, Olaf
Cooper, David N.
Krawczak, Michael
Caliebe, Amke
IMHOTEP—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants
title IMHOTEP—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants
title_full IMHOTEP—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants
title_fullStr IMHOTEP—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants
title_full_unstemmed IMHOTEP—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants
title_short IMHOTEP—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants
title_sort imhotep—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5388428/
https://www.ncbi.nlm.nih.gov/pubmed/28180317
http://dx.doi.org/10.1093/nar/gkw886
work_keys_str_mv AT knechtcarolin imhotepacompositescoreintegratingpopulartoolsforpredictingthefunctionalconsequencesofnonsynonymoussequencevariants
AT mortmatthew imhotepacompositescoreintegratingpopulartoolsforpredictingthefunctionalconsequencesofnonsynonymoussequencevariants
AT jungeolaf imhotepacompositescoreintegratingpopulartoolsforpredictingthefunctionalconsequencesofnonsynonymoussequencevariants
AT cooperdavidn imhotepacompositescoreintegratingpopulartoolsforpredictingthefunctionalconsequencesofnonsynonymoussequencevariants
AT krawczakmichael imhotepacompositescoreintegratingpopulartoolsforpredictingthefunctionalconsequencesofnonsynonymoussequencevariants
AT caliebeamke imhotepacompositescoreintegratingpopulartoolsforpredictingthefunctionalconsequencesofnonsynonymoussequencevariants