Cargando…

Integrating deep mutational scanning and low-throughput mutagenesis data to predict the impact of amino acid variants

BACKGROUND: Evaluating the impact of amino acid variants has been a critical challenge for studying protein function and interpreting genomic data. High-throughput experimental methods like deep mutational scanning (DMS) can measure the effect of large numbers of variants in a target protein, but be...

Descripción completa

Detalles Bibliográficos
Autores principales: Fu, Yunfan, Bedő, Justin, Papenfuss, Anthony T, Rubin, Alan F
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10506130/
https://www.ncbi.nlm.nih.gov/pubmed/37721410
http://dx.doi.org/10.1093/gigascience/giad073
_version_ 1785107057299423232
author Fu, Yunfan
Bedő, Justin
Papenfuss, Anthony T
Rubin, Alan F
author_facet Fu, Yunfan
Bedő, Justin
Papenfuss, Anthony T
Rubin, Alan F
author_sort Fu, Yunfan
collection PubMed
description BACKGROUND: Evaluating the impact of amino acid variants has been a critical challenge for studying protein function and interpreting genomic data. High-throughput experimental methods like deep mutational scanning (DMS) can measure the effect of large numbers of variants in a target protein, but because DMS studies have not been performed on all proteins, researchers also model DMS data computationally to estimate variant impacts by predictors. RESULTS: In this study, we extended a linear regression-based predictor to explore whether incorporating data from alanine scanning (AS), a widely used low-throughput mutagenesis method, would improve prediction results. To evaluate our model, we collected 146 AS datasets, mapping to 54 DMS datasets across 22 distinct proteins. CONCLUSIONS: We show that improved model performance depends on the compatibility of the DMS and AS assays, and the scale of improvement is closely related to the correlation between DMS and AS results.
format Online
Article
Text
id pubmed-10506130
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-105061302023-09-19 Integrating deep mutational scanning and low-throughput mutagenesis data to predict the impact of amino acid variants Fu, Yunfan Bedő, Justin Papenfuss, Anthony T Rubin, Alan F Gigascience Research BACKGROUND: Evaluating the impact of amino acid variants has been a critical challenge for studying protein function and interpreting genomic data. High-throughput experimental methods like deep mutational scanning (DMS) can measure the effect of large numbers of variants in a target protein, but because DMS studies have not been performed on all proteins, researchers also model DMS data computationally to estimate variant impacts by predictors. RESULTS: In this study, we extended a linear regression-based predictor to explore whether incorporating data from alanine scanning (AS), a widely used low-throughput mutagenesis method, would improve prediction results. To evaluate our model, we collected 146 AS datasets, mapping to 54 DMS datasets across 22 distinct proteins. CONCLUSIONS: We show that improved model performance depends on the compatibility of the DMS and AS assays, and the scale of improvement is closely related to the correlation between DMS and AS results. Oxford University Press 2023-09-18 /pmc/articles/PMC10506130/ /pubmed/37721410 http://dx.doi.org/10.1093/gigascience/giad073 Text en © The Author(s) 2023. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Fu, Yunfan
Bedő, Justin
Papenfuss, Anthony T
Rubin, Alan F
Integrating deep mutational scanning and low-throughput mutagenesis data to predict the impact of amino acid variants
title Integrating deep mutational scanning and low-throughput mutagenesis data to predict the impact of amino acid variants
title_full Integrating deep mutational scanning and low-throughput mutagenesis data to predict the impact of amino acid variants
title_fullStr Integrating deep mutational scanning and low-throughput mutagenesis data to predict the impact of amino acid variants
title_full_unstemmed Integrating deep mutational scanning and low-throughput mutagenesis data to predict the impact of amino acid variants
title_short Integrating deep mutational scanning and low-throughput mutagenesis data to predict the impact of amino acid variants
title_sort integrating deep mutational scanning and low-throughput mutagenesis data to predict the impact of amino acid variants
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10506130/
https://www.ncbi.nlm.nih.gov/pubmed/37721410
http://dx.doi.org/10.1093/gigascience/giad073
work_keys_str_mv AT fuyunfan integratingdeepmutationalscanningandlowthroughputmutagenesisdatatopredicttheimpactofaminoacidvariants
AT bedojustin integratingdeepmutationalscanningandlowthroughputmutagenesisdatatopredicttheimpactofaminoacidvariants
AT papenfussanthonyt integratingdeepmutationalscanningandlowthroughputmutagenesisdatatopredicttheimpactofaminoacidvariants
AT rubinalanf integratingdeepmutationalscanningandlowthroughputmutagenesisdatatopredicttheimpactofaminoacidvariants