Cargando…

Higher-order epistasis and phenotypic prediction

Contemporary high-throughput mutagenesis experiments are providing an increasingly detailed view of the complex patterns of genetic interaction that occur between multiple mutations within a single protein or regulatory element. By simultaneously measuring the effects of thousands of combinations of...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Juannan, Wong, Mandy S., Chen, Wei-Chia, Krainer, Adrian R., Kinney, Justin B., McCandlish, David M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9522415/
https://www.ncbi.nlm.nih.gov/pubmed/36129941
http://dx.doi.org/10.1073/pnas.2204233119
_version_ 1784800061315612672
author Zhou, Juannan
Wong, Mandy S.
Chen, Wei-Chia
Krainer, Adrian R.
Kinney, Justin B.
McCandlish, David M.
author_facet Zhou, Juannan
Wong, Mandy S.
Chen, Wei-Chia
Krainer, Adrian R.
Kinney, Justin B.
McCandlish, David M.
author_sort Zhou, Juannan
collection PubMed
description Contemporary high-throughput mutagenesis experiments are providing an increasingly detailed view of the complex patterns of genetic interaction that occur between multiple mutations within a single protein or regulatory element. By simultaneously measuring the effects of thousands of combinations of mutations, these experiments have revealed that the genotype–phenotype relationship typically reflects not only genetic interactions between pairs of sites but also higher-order interactions among larger numbers of sites. However, modeling and understanding these higher-order interactions remains challenging. Here we present a method for reconstructing sequence-to-function mappings from partially observed data that can accommodate all orders of genetic interaction. The main idea is to make predictions for unobserved genotypes that match the type and extent of epistasis found in the observed data. This information on the type and extent of epistasis can be extracted by considering how phenotypic correlations change as a function of mutational distance, which is equivalent to estimating the fraction of phenotypic variance due to each order of genetic interaction (additive, pairwise, three-way, etc.). Using these estimated variance components, we then define an empirical Bayes prior that in expectation matches the observed pattern of epistasis and reconstruct the genotype–phenotype mapping by conducting Gaussian process regression under this prior. To demonstrate the power of this approach, we present an application to the antibody-binding domain GB1 and also provide a detailed exploration of a dataset consisting of high-throughput measurements for the splicing efficiency of human pre-mRNA [Formula: see text] splice sites, for which we also validate our model predictions via additional low-throughput experiments.
format Online
Article
Text
id pubmed-9522415
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-95224152022-09-30 Higher-order epistasis and phenotypic prediction Zhou, Juannan Wong, Mandy S. Chen, Wei-Chia Krainer, Adrian R. Kinney, Justin B. McCandlish, David M. Proc Natl Acad Sci U S A Biological Sciences Contemporary high-throughput mutagenesis experiments are providing an increasingly detailed view of the complex patterns of genetic interaction that occur between multiple mutations within a single protein or regulatory element. By simultaneously measuring the effects of thousands of combinations of mutations, these experiments have revealed that the genotype–phenotype relationship typically reflects not only genetic interactions between pairs of sites but also higher-order interactions among larger numbers of sites. However, modeling and understanding these higher-order interactions remains challenging. Here we present a method for reconstructing sequence-to-function mappings from partially observed data that can accommodate all orders of genetic interaction. The main idea is to make predictions for unobserved genotypes that match the type and extent of epistasis found in the observed data. This information on the type and extent of epistasis can be extracted by considering how phenotypic correlations change as a function of mutational distance, which is equivalent to estimating the fraction of phenotypic variance due to each order of genetic interaction (additive, pairwise, three-way, etc.). Using these estimated variance components, we then define an empirical Bayes prior that in expectation matches the observed pattern of epistasis and reconstruct the genotype–phenotype mapping by conducting Gaussian process regression under this prior. To demonstrate the power of this approach, we present an application to the antibody-binding domain GB1 and also provide a detailed exploration of a dataset consisting of high-throughput measurements for the splicing efficiency of human pre-mRNA [Formula: see text] splice sites, for which we also validate our model predictions via additional low-throughput experiments. National Academy of Sciences 2022-09-21 2022-09-27 /pmc/articles/PMC9522415/ /pubmed/36129941 http://dx.doi.org/10.1073/pnas.2204233119 Text en Copyright © 2022 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Biological Sciences
Zhou, Juannan
Wong, Mandy S.
Chen, Wei-Chia
Krainer, Adrian R.
Kinney, Justin B.
McCandlish, David M.
Higher-order epistasis and phenotypic prediction
title Higher-order epistasis and phenotypic prediction
title_full Higher-order epistasis and phenotypic prediction
title_fullStr Higher-order epistasis and phenotypic prediction
title_full_unstemmed Higher-order epistasis and phenotypic prediction
title_short Higher-order epistasis and phenotypic prediction
title_sort higher-order epistasis and phenotypic prediction
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9522415/
https://www.ncbi.nlm.nih.gov/pubmed/36129941
http://dx.doi.org/10.1073/pnas.2204233119
work_keys_str_mv AT zhoujuannan higherorderepistasisandphenotypicprediction
AT wongmandys higherorderepistasisandphenotypicprediction
AT chenweichia higherorderepistasisandphenotypicprediction
AT kraineradrianr higherorderepistasisandphenotypicprediction
AT kinneyjustinb higherorderepistasisandphenotypicprediction
AT mccandlishdavidm higherorderepistasisandphenotypicprediction