Cargando…

Prediction of prime editing insertion efficiencies using sequence features and DNA repair determinants

Most short sequences can be precisely written into a selected genomic target using prime editing; however, it remains unclear what factors govern insertion. We design a library of 3,604 sequences of various lengths and measure the frequency of their insertion into four genomic sites in three human c...

Descripción completa

Detalles Bibliográficos
Autores principales: Koeppel, Jonas, Weller, Juliane, Peets, Elin Madli, Pallaseni, Ananth, Kuzmin, Ivan, Raudvere, Uku, Peterson, Hedi, Liberante, Fabio Giuseppe, Parts, Leopold
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10567557/
https://www.ncbi.nlm.nih.gov/pubmed/36797492
http://dx.doi.org/10.1038/s41587-023-01678-y
Descripción
Sumario:Most short sequences can be precisely written into a selected genomic target using prime editing; however, it remains unclear what factors govern insertion. We design a library of 3,604 sequences of various lengths and measure the frequency of their insertion into four genomic sites in three human cell lines, using different prime editor systems in varying DNA repair contexts. We find that length, nucleotide composition and secondary structure of the insertion sequence all affect insertion rates. We also discover that the 3′ flap nucleases TREX1 and TREX2 suppress the insertion of longer sequences. Combining the sequence and repair features into a machine learning model, we can predict relative frequency of insertions into a site with R = 0.70. Finally, we demonstrate how our accurate prediction and user-friendly software help choose codon variants of common fusion tags that insert at high efficiency, and provide a catalog of empirically determined insertion rates for over a hundred useful sequences.