Cargando…

Prediction of prime editing insertion efficiencies using sequence features and DNA repair determinants

Most short sequences can be precisely written into a selected genomic target using prime editing; however, it remains unclear what factors govern insertion. We design a library of 3,604 sequences of various lengths and measure the frequency of their insertion into four genomic sites in three human c...

Descripción completa

Detalles Bibliográficos
Autores principales: Koeppel, Jonas, Weller, Juliane, Peets, Elin Madli, Pallaseni, Ananth, Kuzmin, Ivan, Raudvere, Uku, Peterson, Hedi, Liberante, Fabio Giuseppe, Parts, Leopold
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10567557/
https://www.ncbi.nlm.nih.gov/pubmed/36797492
http://dx.doi.org/10.1038/s41587-023-01678-y
_version_ 1785119155038453760
author Koeppel, Jonas
Weller, Juliane
Peets, Elin Madli
Pallaseni, Ananth
Kuzmin, Ivan
Raudvere, Uku
Peterson, Hedi
Liberante, Fabio Giuseppe
Parts, Leopold
author_facet Koeppel, Jonas
Weller, Juliane
Peets, Elin Madli
Pallaseni, Ananth
Kuzmin, Ivan
Raudvere, Uku
Peterson, Hedi
Liberante, Fabio Giuseppe
Parts, Leopold
author_sort Koeppel, Jonas
collection PubMed
description Most short sequences can be precisely written into a selected genomic target using prime editing; however, it remains unclear what factors govern insertion. We design a library of 3,604 sequences of various lengths and measure the frequency of their insertion into four genomic sites in three human cell lines, using different prime editor systems in varying DNA repair contexts. We find that length, nucleotide composition and secondary structure of the insertion sequence all affect insertion rates. We also discover that the 3′ flap nucleases TREX1 and TREX2 suppress the insertion of longer sequences. Combining the sequence and repair features into a machine learning model, we can predict relative frequency of insertions into a site with R = 0.70. Finally, we demonstrate how our accurate prediction and user-friendly software help choose codon variants of common fusion tags that insert at high efficiency, and provide a catalog of empirically determined insertion rates for over a hundred useful sequences.
format Online
Article
Text
id pubmed-10567557
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group US
record_format MEDLINE/PubMed
spelling pubmed-105675572023-10-13 Prediction of prime editing insertion efficiencies using sequence features and DNA repair determinants Koeppel, Jonas Weller, Juliane Peets, Elin Madli Pallaseni, Ananth Kuzmin, Ivan Raudvere, Uku Peterson, Hedi Liberante, Fabio Giuseppe Parts, Leopold Nat Biotechnol Article Most short sequences can be precisely written into a selected genomic target using prime editing; however, it remains unclear what factors govern insertion. We design a library of 3,604 sequences of various lengths and measure the frequency of their insertion into four genomic sites in three human cell lines, using different prime editor systems in varying DNA repair contexts. We find that length, nucleotide composition and secondary structure of the insertion sequence all affect insertion rates. We also discover that the 3′ flap nucleases TREX1 and TREX2 suppress the insertion of longer sequences. Combining the sequence and repair features into a machine learning model, we can predict relative frequency of insertions into a site with R = 0.70. Finally, we demonstrate how our accurate prediction and user-friendly software help choose codon variants of common fusion tags that insert at high efficiency, and provide a catalog of empirically determined insertion rates for over a hundred useful sequences. Nature Publishing Group US 2023-02-16 2023 /pmc/articles/PMC10567557/ /pubmed/36797492 http://dx.doi.org/10.1038/s41587-023-01678-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Koeppel, Jonas
Weller, Juliane
Peets, Elin Madli
Pallaseni, Ananth
Kuzmin, Ivan
Raudvere, Uku
Peterson, Hedi
Liberante, Fabio Giuseppe
Parts, Leopold
Prediction of prime editing insertion efficiencies using sequence features and DNA repair determinants
title Prediction of prime editing insertion efficiencies using sequence features and DNA repair determinants
title_full Prediction of prime editing insertion efficiencies using sequence features and DNA repair determinants
title_fullStr Prediction of prime editing insertion efficiencies using sequence features and DNA repair determinants
title_full_unstemmed Prediction of prime editing insertion efficiencies using sequence features and DNA repair determinants
title_short Prediction of prime editing insertion efficiencies using sequence features and DNA repair determinants
title_sort prediction of prime editing insertion efficiencies using sequence features and dna repair determinants
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10567557/
https://www.ncbi.nlm.nih.gov/pubmed/36797492
http://dx.doi.org/10.1038/s41587-023-01678-y
work_keys_str_mv AT koeppeljonas predictionofprimeeditinginsertionefficienciesusingsequencefeaturesanddnarepairdeterminants
AT wellerjuliane predictionofprimeeditinginsertionefficienciesusingsequencefeaturesanddnarepairdeterminants
AT peetselinmadli predictionofprimeeditinginsertionefficienciesusingsequencefeaturesanddnarepairdeterminants
AT pallaseniananth predictionofprimeeditinginsertionefficienciesusingsequencefeaturesanddnarepairdeterminants
AT kuzminivan predictionofprimeeditinginsertionefficienciesusingsequencefeaturesanddnarepairdeterminants
AT raudvereuku predictionofprimeeditinginsertionefficienciesusingsequencefeaturesanddnarepairdeterminants
AT petersonhedi predictionofprimeeditinginsertionefficienciesusingsequencefeaturesanddnarepairdeterminants
AT liberantefabiogiuseppe predictionofprimeeditinginsertionefficienciesusingsequencefeaturesanddnarepairdeterminants
AT partsleopold predictionofprimeeditinginsertionefficienciesusingsequencefeaturesanddnarepairdeterminants