Cargando…

Evidence-based tailoring of bioinformatics approaches to optimize methods that predict the effects of nonsynonymous amino acid substitutions in glucokinase

Computational methods that allow predicting the effects of nonsynonymous substitutions are an integral part of exome studies. Here, we validated and improved their specificity by performing a comprehensive bioinformatics analysis combined with experimental and clinical data on a model of glucokinase...

Descripción completa

Detalles Bibliográficos
Autores principales: Šimčíková, Daniela, Kocková, Lucie, Vackářová, Kateřina, Těšínský, Miroslav, Heneberg, Petr
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5573313/
https://www.ncbi.nlm.nih.gov/pubmed/28842611
http://dx.doi.org/10.1038/s41598-017-09810-0
_version_ 1783259634057871360
author Šimčíková, Daniela
Kocková, Lucie
Vackářová, Kateřina
Těšínský, Miroslav
Heneberg, Petr
author_facet Šimčíková, Daniela
Kocková, Lucie
Vackářová, Kateřina
Těšínský, Miroslav
Heneberg, Petr
author_sort Šimčíková, Daniela
collection PubMed
description Computational methods that allow predicting the effects of nonsynonymous substitutions are an integral part of exome studies. Here, we validated and improved their specificity by performing a comprehensive bioinformatics analysis combined with experimental and clinical data on a model of glucokinase (GCK): 8835 putative variations, including 515 disease-associated variations from 1596 families with diagnoses of monogenic diabetes (GCK-MODY) or persistent hyperinsulinemic hypoglycemia of infancy (PHHI), and 126 variations with available or newly reported (19 variations) data on enzyme kinetics. We also proved that high frequency of disease-associated variations found in patients is closely related to their evolutionary conservation. The default set prediction methods predicted correctly the effects of only a part of the GCK-MODY-associated variations and completely failed to predict the normoglycemic or PHHI-associated variations. Therefore, we calculated evidence-based thresholds that improved significantly the specificity of predictions (≤75%). The combined prediction analysis even allowed to distinguish activating from inactivating variations and identified a group of putatively highly pathogenic variations (EVmutation score <−7.5 and SNAP2 score >70), which were surprisingly underrepresented among MODY patients and thus under negative selection during molecular evolution. We suggested and validated the first robust evidence-based thresholds, which allow improved, highly specific predictions of disease-associated GCK variations.
format Online
Article
Text
id pubmed-5573313
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-55733132017-09-01 Evidence-based tailoring of bioinformatics approaches to optimize methods that predict the effects of nonsynonymous amino acid substitutions in glucokinase Šimčíková, Daniela Kocková, Lucie Vackářová, Kateřina Těšínský, Miroslav Heneberg, Petr Sci Rep Article Computational methods that allow predicting the effects of nonsynonymous substitutions are an integral part of exome studies. Here, we validated and improved their specificity by performing a comprehensive bioinformatics analysis combined with experimental and clinical data on a model of glucokinase (GCK): 8835 putative variations, including 515 disease-associated variations from 1596 families with diagnoses of monogenic diabetes (GCK-MODY) or persistent hyperinsulinemic hypoglycemia of infancy (PHHI), and 126 variations with available or newly reported (19 variations) data on enzyme kinetics. We also proved that high frequency of disease-associated variations found in patients is closely related to their evolutionary conservation. The default set prediction methods predicted correctly the effects of only a part of the GCK-MODY-associated variations and completely failed to predict the normoglycemic or PHHI-associated variations. Therefore, we calculated evidence-based thresholds that improved significantly the specificity of predictions (≤75%). The combined prediction analysis even allowed to distinguish activating from inactivating variations and identified a group of putatively highly pathogenic variations (EVmutation score <−7.5 and SNAP2 score >70), which were surprisingly underrepresented among MODY patients and thus under negative selection during molecular evolution. We suggested and validated the first robust evidence-based thresholds, which allow improved, highly specific predictions of disease-associated GCK variations. Nature Publishing Group UK 2017-08-25 /pmc/articles/PMC5573313/ /pubmed/28842611 http://dx.doi.org/10.1038/s41598-017-09810-0 Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Šimčíková, Daniela
Kocková, Lucie
Vackářová, Kateřina
Těšínský, Miroslav
Heneberg, Petr
Evidence-based tailoring of bioinformatics approaches to optimize methods that predict the effects of nonsynonymous amino acid substitutions in glucokinase
title Evidence-based tailoring of bioinformatics approaches to optimize methods that predict the effects of nonsynonymous amino acid substitutions in glucokinase
title_full Evidence-based tailoring of bioinformatics approaches to optimize methods that predict the effects of nonsynonymous amino acid substitutions in glucokinase
title_fullStr Evidence-based tailoring of bioinformatics approaches to optimize methods that predict the effects of nonsynonymous amino acid substitutions in glucokinase
title_full_unstemmed Evidence-based tailoring of bioinformatics approaches to optimize methods that predict the effects of nonsynonymous amino acid substitutions in glucokinase
title_short Evidence-based tailoring of bioinformatics approaches to optimize methods that predict the effects of nonsynonymous amino acid substitutions in glucokinase
title_sort evidence-based tailoring of bioinformatics approaches to optimize methods that predict the effects of nonsynonymous amino acid substitutions in glucokinase
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5573313/
https://www.ncbi.nlm.nih.gov/pubmed/28842611
http://dx.doi.org/10.1038/s41598-017-09810-0
work_keys_str_mv AT simcikovadaniela evidencebasedtailoringofbioinformaticsapproachestooptimizemethodsthatpredicttheeffectsofnonsynonymousaminoacidsubstitutionsinglucokinase
AT kockovalucie evidencebasedtailoringofbioinformaticsapproachestooptimizemethodsthatpredicttheeffectsofnonsynonymousaminoacidsubstitutionsinglucokinase
AT vackarovakaterina evidencebasedtailoringofbioinformaticsapproachestooptimizemethodsthatpredicttheeffectsofnonsynonymousaminoacidsubstitutionsinglucokinase
AT tesinskymiroslav evidencebasedtailoringofbioinformaticsapproachestooptimizemethodsthatpredicttheeffectsofnonsynonymousaminoacidsubstitutionsinglucokinase
AT henebergpetr evidencebasedtailoringofbioinformaticsapproachestooptimizemethodsthatpredicttheeffectsofnonsynonymousaminoacidsubstitutionsinglucokinase