Cargando…

Deep2Full: Evaluating strategies for selecting the minimal mutational experiments for optimal computational predictions of deep mutational scan outcomes

Performing a complete deep mutational scan with all single point mutations may not be practical, and may not even be required, especially if predictive computational models can be developed. Computational models are however naive to cellular response in the myriads of assay-conditions. In a realisti...

Descripción completa

Detalles Bibliográficos
Autores principales: Sruthi, C. K., Prakash, Meher
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6954071/
https://www.ncbi.nlm.nih.gov/pubmed/31923916
http://dx.doi.org/10.1371/journal.pone.0227621
_version_ 1783486731307188224
author Sruthi, C. K.
Prakash, Meher
author_facet Sruthi, C. K.
Prakash, Meher
author_sort Sruthi, C. K.
collection PubMed
description Performing a complete deep mutational scan with all single point mutations may not be practical, and may not even be required, especially if predictive computational models can be developed. Computational models are however naive to cellular response in the myriads of assay-conditions. In a realistic paradigm of assay context-aware predictive hybrid models that combine minimal experimental data from deep mutational scans with structure, sequence information and computational models, we define and evaluate different strategies for choosing this minimal set. We evaluated the trivial strategy of a systematic reduction in the number of mutational studies from 85% to 15%, along with several others about the choice of the types of mutations such as random versus site-directed with the same 15% data completeness. Interestingly, the predictive capabilities by training on a random set of mutations and using a systematic substitution of all amino acids to alanine, asparagine and histidine (ANH) were comparable. Another strategy we explored, augmenting the training data with measurements of the same mutants at multiple assay conditions, did not improve the prediction quality. For the six proteins we analyzed, the bin-wise error in prediction is optimal when 50-100 mutations per bin are used in training the computational model, suggesting that good prediction quality may be achieved with a library of 500-1000 mutations.
format Online
Article
Text
id pubmed-6954071
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-69540712020-01-21 Deep2Full: Evaluating strategies for selecting the minimal mutational experiments for optimal computational predictions of deep mutational scan outcomes Sruthi, C. K. Prakash, Meher PLoS One Research Article Performing a complete deep mutational scan with all single point mutations may not be practical, and may not even be required, especially if predictive computational models can be developed. Computational models are however naive to cellular response in the myriads of assay-conditions. In a realistic paradigm of assay context-aware predictive hybrid models that combine minimal experimental data from deep mutational scans with structure, sequence information and computational models, we define and evaluate different strategies for choosing this minimal set. We evaluated the trivial strategy of a systematic reduction in the number of mutational studies from 85% to 15%, along with several others about the choice of the types of mutations such as random versus site-directed with the same 15% data completeness. Interestingly, the predictive capabilities by training on a random set of mutations and using a systematic substitution of all amino acids to alanine, asparagine and histidine (ANH) were comparable. Another strategy we explored, augmenting the training data with measurements of the same mutants at multiple assay conditions, did not improve the prediction quality. For the six proteins we analyzed, the bin-wise error in prediction is optimal when 50-100 mutations per bin are used in training the computational model, suggesting that good prediction quality may be achieved with a library of 500-1000 mutations. Public Library of Science 2020-01-10 /pmc/articles/PMC6954071/ /pubmed/31923916 http://dx.doi.org/10.1371/journal.pone.0227621 Text en © 2020 Sruthi, Prakash http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Sruthi, C. K.
Prakash, Meher
Deep2Full: Evaluating strategies for selecting the minimal mutational experiments for optimal computational predictions of deep mutational scan outcomes
title Deep2Full: Evaluating strategies for selecting the minimal mutational experiments for optimal computational predictions of deep mutational scan outcomes
title_full Deep2Full: Evaluating strategies for selecting the minimal mutational experiments for optimal computational predictions of deep mutational scan outcomes
title_fullStr Deep2Full: Evaluating strategies for selecting the minimal mutational experiments for optimal computational predictions of deep mutational scan outcomes
title_full_unstemmed Deep2Full: Evaluating strategies for selecting the minimal mutational experiments for optimal computational predictions of deep mutational scan outcomes
title_short Deep2Full: Evaluating strategies for selecting the minimal mutational experiments for optimal computational predictions of deep mutational scan outcomes
title_sort deep2full: evaluating strategies for selecting the minimal mutational experiments for optimal computational predictions of deep mutational scan outcomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6954071/
https://www.ncbi.nlm.nih.gov/pubmed/31923916
http://dx.doi.org/10.1371/journal.pone.0227621
work_keys_str_mv AT sruthick deep2fullevaluatingstrategiesforselectingtheminimalmutationalexperimentsforoptimalcomputationalpredictionsofdeepmutationalscanoutcomes
AT prakashmeher deep2fullevaluatingstrategiesforselectingtheminimalmutationalexperimentsforoptimalcomputationalpredictionsofdeepmutationalscanoutcomes