Cargando…
Evidence-based tailoring of bioinformatics approaches to optimize methods that predict the effects of nonsynonymous amino acid substitutions in glucokinase
Computational methods that allow predicting the effects of nonsynonymous substitutions are an integral part of exome studies. Here, we validated and improved their specificity by performing a comprehensive bioinformatics analysis combined with experimental and clinical data on a model of glucokinase...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5573313/ https://www.ncbi.nlm.nih.gov/pubmed/28842611 http://dx.doi.org/10.1038/s41598-017-09810-0 |
_version_ | 1783259634057871360 |
---|---|
author | Šimčíková, Daniela Kocková, Lucie Vackářová, Kateřina Těšínský, Miroslav Heneberg, Petr |
author_facet | Šimčíková, Daniela Kocková, Lucie Vackářová, Kateřina Těšínský, Miroslav Heneberg, Petr |
author_sort | Šimčíková, Daniela |
collection | PubMed |
description | Computational methods that allow predicting the effects of nonsynonymous substitutions are an integral part of exome studies. Here, we validated and improved their specificity by performing a comprehensive bioinformatics analysis combined with experimental and clinical data on a model of glucokinase (GCK): 8835 putative variations, including 515 disease-associated variations from 1596 families with diagnoses of monogenic diabetes (GCK-MODY) or persistent hyperinsulinemic hypoglycemia of infancy (PHHI), and 126 variations with available or newly reported (19 variations) data on enzyme kinetics. We also proved that high frequency of disease-associated variations found in patients is closely related to their evolutionary conservation. The default set prediction methods predicted correctly the effects of only a part of the GCK-MODY-associated variations and completely failed to predict the normoglycemic or PHHI-associated variations. Therefore, we calculated evidence-based thresholds that improved significantly the specificity of predictions (≤75%). The combined prediction analysis even allowed to distinguish activating from inactivating variations and identified a group of putatively highly pathogenic variations (EVmutation score <−7.5 and SNAP2 score >70), which were surprisingly underrepresented among MODY patients and thus under negative selection during molecular evolution. We suggested and validated the first robust evidence-based thresholds, which allow improved, highly specific predictions of disease-associated GCK variations. |
format | Online Article Text |
id | pubmed-5573313 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-55733132017-09-01 Evidence-based tailoring of bioinformatics approaches to optimize methods that predict the effects of nonsynonymous amino acid substitutions in glucokinase Šimčíková, Daniela Kocková, Lucie Vackářová, Kateřina Těšínský, Miroslav Heneberg, Petr Sci Rep Article Computational methods that allow predicting the effects of nonsynonymous substitutions are an integral part of exome studies. Here, we validated and improved their specificity by performing a comprehensive bioinformatics analysis combined with experimental and clinical data on a model of glucokinase (GCK): 8835 putative variations, including 515 disease-associated variations from 1596 families with diagnoses of monogenic diabetes (GCK-MODY) or persistent hyperinsulinemic hypoglycemia of infancy (PHHI), and 126 variations with available or newly reported (19 variations) data on enzyme kinetics. We also proved that high frequency of disease-associated variations found in patients is closely related to their evolutionary conservation. The default set prediction methods predicted correctly the effects of only a part of the GCK-MODY-associated variations and completely failed to predict the normoglycemic or PHHI-associated variations. Therefore, we calculated evidence-based thresholds that improved significantly the specificity of predictions (≤75%). The combined prediction analysis even allowed to distinguish activating from inactivating variations and identified a group of putatively highly pathogenic variations (EVmutation score <−7.5 and SNAP2 score >70), which were surprisingly underrepresented among MODY patients and thus under negative selection during molecular evolution. We suggested and validated the first robust evidence-based thresholds, which allow improved, highly specific predictions of disease-associated GCK variations. Nature Publishing Group UK 2017-08-25 /pmc/articles/PMC5573313/ /pubmed/28842611 http://dx.doi.org/10.1038/s41598-017-09810-0 Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Šimčíková, Daniela Kocková, Lucie Vackářová, Kateřina Těšínský, Miroslav Heneberg, Petr Evidence-based tailoring of bioinformatics approaches to optimize methods that predict the effects of nonsynonymous amino acid substitutions in glucokinase |
title | Evidence-based tailoring of bioinformatics approaches to optimize methods that predict the effects of nonsynonymous amino acid substitutions in glucokinase |
title_full | Evidence-based tailoring of bioinformatics approaches to optimize methods that predict the effects of nonsynonymous amino acid substitutions in glucokinase |
title_fullStr | Evidence-based tailoring of bioinformatics approaches to optimize methods that predict the effects of nonsynonymous amino acid substitutions in glucokinase |
title_full_unstemmed | Evidence-based tailoring of bioinformatics approaches to optimize methods that predict the effects of nonsynonymous amino acid substitutions in glucokinase |
title_short | Evidence-based tailoring of bioinformatics approaches to optimize methods that predict the effects of nonsynonymous amino acid substitutions in glucokinase |
title_sort | evidence-based tailoring of bioinformatics approaches to optimize methods that predict the effects of nonsynonymous amino acid substitutions in glucokinase |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5573313/ https://www.ncbi.nlm.nih.gov/pubmed/28842611 http://dx.doi.org/10.1038/s41598-017-09810-0 |
work_keys_str_mv | AT simcikovadaniela evidencebasedtailoringofbioinformaticsapproachestooptimizemethodsthatpredicttheeffectsofnonsynonymousaminoacidsubstitutionsinglucokinase AT kockovalucie evidencebasedtailoringofbioinformaticsapproachestooptimizemethodsthatpredicttheeffectsofnonsynonymousaminoacidsubstitutionsinglucokinase AT vackarovakaterina evidencebasedtailoringofbioinformaticsapproachestooptimizemethodsthatpredicttheeffectsofnonsynonymousaminoacidsubstitutionsinglucokinase AT tesinskymiroslav evidencebasedtailoringofbioinformaticsapproachestooptimizemethodsthatpredicttheeffectsofnonsynonymousaminoacidsubstitutionsinglucokinase AT henebergpetr evidencebasedtailoringofbioinformaticsapproachestooptimizemethodsthatpredicttheeffectsofnonsynonymousaminoacidsubstitutionsinglucokinase |