Cargando…

Discovering pathways through ribozyme fitness landscapes using information theoretic quantification of epistasis

The identification of catalytic RNAs is typically achieved through primarily experimental means. However, only a small fraction of sequence space can be analyzed even with high-throughput techniques. Methods to extrapolate from a limited data set to predict additional ribozyme sequences, particularl...

Descripción completa

Detalles Bibliográficos
Autores principales: Charest, Nathaniel, Shen, Yuning, Lai, Yei-Chen, Chen, Irene A., Shea, Joan-Emma
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10578471/
https://www.ncbi.nlm.nih.gov/pubmed/37580126
http://dx.doi.org/10.1261/rna.079541.122
_version_ 1785121525370716160
author Charest, Nathaniel
Shen, Yuning
Lai, Yei-Chen
Chen, Irene A.
Shea, Joan-Emma
author_facet Charest, Nathaniel
Shen, Yuning
Lai, Yei-Chen
Chen, Irene A.
Shea, Joan-Emma
author_sort Charest, Nathaniel
collection PubMed
description The identification of catalytic RNAs is typically achieved through primarily experimental means. However, only a small fraction of sequence space can be analyzed even with high-throughput techniques. Methods to extrapolate from a limited data set to predict additional ribozyme sequences, particularly in a human-interpretable fashion, could be useful both for designing new functional RNAs and for generating greater understanding about a ribozyme fitness landscape. Using information theory, we express the effects of epistasis (i.e., deviations from additivity) on a ribozyme. This representation was incorporated into a simple model of the epistatic fitness landscape, which identified potentially exploitable combinations of mutations. We used this model to theoretically predict mutants of high activity for a self-aminoacylating ribozyme, identifying potentially active triple and quadruple mutants beyond the experimental data set of single and double mutants. The predictions were validated experimentally, with nine out of nine sequences being accurately predicted to have high activity. This set of sequences included mutants that form a previously unknown evolutionary “bridge” between two ribozyme families that share a common motif. Individual steps in the method could be examined, understood, and guided by a human, combining interpretability and performance in a simple model to predict ribozyme sequences by extrapolation.
format Online
Article
Text
id pubmed-10578471
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-105784712023-11-01 Discovering pathways through ribozyme fitness landscapes using information theoretic quantification of epistasis Charest, Nathaniel Shen, Yuning Lai, Yei-Chen Chen, Irene A. Shea, Joan-Emma RNA Bioinformatics The identification of catalytic RNAs is typically achieved through primarily experimental means. However, only a small fraction of sequence space can be analyzed even with high-throughput techniques. Methods to extrapolate from a limited data set to predict additional ribozyme sequences, particularly in a human-interpretable fashion, could be useful both for designing new functional RNAs and for generating greater understanding about a ribozyme fitness landscape. Using information theory, we express the effects of epistasis (i.e., deviations from additivity) on a ribozyme. This representation was incorporated into a simple model of the epistatic fitness landscape, which identified potentially exploitable combinations of mutations. We used this model to theoretically predict mutants of high activity for a self-aminoacylating ribozyme, identifying potentially active triple and quadruple mutants beyond the experimental data set of single and double mutants. The predictions were validated experimentally, with nine out of nine sequences being accurately predicted to have high activity. This set of sequences included mutants that form a previously unknown evolutionary “bridge” between two ribozyme families that share a common motif. Individual steps in the method could be examined, understood, and guided by a human, combining interpretability and performance in a simple model to predict ribozyme sequences by extrapolation. Cold Spring Harbor Laboratory Press 2023-11 /pmc/articles/PMC10578471/ /pubmed/37580126 http://dx.doi.org/10.1261/rna.079541.122 Text en © 2023 Charest et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society https://creativecommons.org/licenses/by-nc/4.0/This article, published in RNA, is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Bioinformatics
Charest, Nathaniel
Shen, Yuning
Lai, Yei-Chen
Chen, Irene A.
Shea, Joan-Emma
Discovering pathways through ribozyme fitness landscapes using information theoretic quantification of epistasis
title Discovering pathways through ribozyme fitness landscapes using information theoretic quantification of epistasis
title_full Discovering pathways through ribozyme fitness landscapes using information theoretic quantification of epistasis
title_fullStr Discovering pathways through ribozyme fitness landscapes using information theoretic quantification of epistasis
title_full_unstemmed Discovering pathways through ribozyme fitness landscapes using information theoretic quantification of epistasis
title_short Discovering pathways through ribozyme fitness landscapes using information theoretic quantification of epistasis
title_sort discovering pathways through ribozyme fitness landscapes using information theoretic quantification of epistasis
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10578471/
https://www.ncbi.nlm.nih.gov/pubmed/37580126
http://dx.doi.org/10.1261/rna.079541.122
work_keys_str_mv AT charestnathaniel discoveringpathwaysthroughribozymefitnesslandscapesusinginformationtheoreticquantificationofepistasis
AT shenyuning discoveringpathwaysthroughribozymefitnesslandscapesusinginformationtheoreticquantificationofepistasis
AT laiyeichen discoveringpathwaysthroughribozymefitnesslandscapesusinginformationtheoreticquantificationofepistasis
AT chenirenea discoveringpathwaysthroughribozymefitnesslandscapesusinginformationtheoreticquantificationofepistasis
AT sheajoanemma discoveringpathwaysthroughribozymefitnesslandscapesusinginformationtheoreticquantificationofepistasis