Cargando…

Optimization of C-to-G base editors with sequence context preference predictable by machine learning methods

Efficient and precise base editors (BEs) for C-to-G transversion are highly desirable. However, the sequence context affecting editing outcome largely remains unclear. Here we report engineered C-to-G BEs of high efficiency and fidelity, with the sequence context predictable via machine-learning met...

Descripción completa

Detalles Bibliográficos
Autores principales: Yuan, Tanglong, Yan, Nana, Fei, Tianyi, Zheng, Jitan, Meng, Juan, Li, Nana, Liu, Jing, Zhang, Haihang, Xie, Long, Ying, Wenqin, Li, Di, Shi, Lei, Sun, Yongsen, Li, Yongyao, Li, Yixue, Sun, Yidi, Zuo, Erwei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8361092/
https://www.ncbi.nlm.nih.gov/pubmed/34385461
http://dx.doi.org/10.1038/s41467-021-25217-y
_version_ 1783737886125850624
author Yuan, Tanglong
Yan, Nana
Fei, Tianyi
Zheng, Jitan
Meng, Juan
Li, Nana
Liu, Jing
Zhang, Haihang
Xie, Long
Ying, Wenqin
Li, Di
Shi, Lei
Sun, Yongsen
Li, Yongyao
Li, Yixue
Sun, Yidi
Zuo, Erwei
author_facet Yuan, Tanglong
Yan, Nana
Fei, Tianyi
Zheng, Jitan
Meng, Juan
Li, Nana
Liu, Jing
Zhang, Haihang
Xie, Long
Ying, Wenqin
Li, Di
Shi, Lei
Sun, Yongsen
Li, Yongyao
Li, Yixue
Sun, Yidi
Zuo, Erwei
author_sort Yuan, Tanglong
collection PubMed
description Efficient and precise base editors (BEs) for C-to-G transversion are highly desirable. However, the sequence context affecting editing outcome largely remains unclear. Here we report engineered C-to-G BEs of high efficiency and fidelity, with the sequence context predictable via machine-learning methods. By changing the species origin and relative position of uracil-DNA glycosylase and deaminase, together with codon optimization, we obtain optimized C-to-G BEs (OPTI-CGBEs) for efficient C-to-G transversion. The motif preference of OPTI-CGBEs for editing 100 endogenous sites is determined in HEK293T cells. Using a sgRNA library comprising 41,388 sequences, we develop a deep-learning model that accurately predicts the OPTI-CGBE editing outcome for targeted sites with specific sequence context. These OPTI-CGBEs are further shown to be capable of efficient base editing in mouse embryos for generating Tyr-edited offspring. Thus, these engineered CGBEs are useful for efficient and precise base editing, with outcome predictable based on sequence context of targeted sites.
format Online
Article
Text
id pubmed-8361092
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-83610922021-08-19 Optimization of C-to-G base editors with sequence context preference predictable by machine learning methods Yuan, Tanglong Yan, Nana Fei, Tianyi Zheng, Jitan Meng, Juan Li, Nana Liu, Jing Zhang, Haihang Xie, Long Ying, Wenqin Li, Di Shi, Lei Sun, Yongsen Li, Yongyao Li, Yixue Sun, Yidi Zuo, Erwei Nat Commun Article Efficient and precise base editors (BEs) for C-to-G transversion are highly desirable. However, the sequence context affecting editing outcome largely remains unclear. Here we report engineered C-to-G BEs of high efficiency and fidelity, with the sequence context predictable via machine-learning methods. By changing the species origin and relative position of uracil-DNA glycosylase and deaminase, together with codon optimization, we obtain optimized C-to-G BEs (OPTI-CGBEs) for efficient C-to-G transversion. The motif preference of OPTI-CGBEs for editing 100 endogenous sites is determined in HEK293T cells. Using a sgRNA library comprising 41,388 sequences, we develop a deep-learning model that accurately predicts the OPTI-CGBE editing outcome for targeted sites with specific sequence context. These OPTI-CGBEs are further shown to be capable of efficient base editing in mouse embryos for generating Tyr-edited offspring. Thus, these engineered CGBEs are useful for efficient and precise base editing, with outcome predictable based on sequence context of targeted sites. Nature Publishing Group UK 2021-08-12 /pmc/articles/PMC8361092/ /pubmed/34385461 http://dx.doi.org/10.1038/s41467-021-25217-y Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Yuan, Tanglong
Yan, Nana
Fei, Tianyi
Zheng, Jitan
Meng, Juan
Li, Nana
Liu, Jing
Zhang, Haihang
Xie, Long
Ying, Wenqin
Li, Di
Shi, Lei
Sun, Yongsen
Li, Yongyao
Li, Yixue
Sun, Yidi
Zuo, Erwei
Optimization of C-to-G base editors with sequence context preference predictable by machine learning methods
title Optimization of C-to-G base editors with sequence context preference predictable by machine learning methods
title_full Optimization of C-to-G base editors with sequence context preference predictable by machine learning methods
title_fullStr Optimization of C-to-G base editors with sequence context preference predictable by machine learning methods
title_full_unstemmed Optimization of C-to-G base editors with sequence context preference predictable by machine learning methods
title_short Optimization of C-to-G base editors with sequence context preference predictable by machine learning methods
title_sort optimization of c-to-g base editors with sequence context preference predictable by machine learning methods
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8361092/
https://www.ncbi.nlm.nih.gov/pubmed/34385461
http://dx.doi.org/10.1038/s41467-021-25217-y
work_keys_str_mv AT yuantanglong optimizationofctogbaseeditorswithsequencecontextpreferencepredictablebymachinelearningmethods
AT yannana optimizationofctogbaseeditorswithsequencecontextpreferencepredictablebymachinelearningmethods
AT feitianyi optimizationofctogbaseeditorswithsequencecontextpreferencepredictablebymachinelearningmethods
AT zhengjitan optimizationofctogbaseeditorswithsequencecontextpreferencepredictablebymachinelearningmethods
AT mengjuan optimizationofctogbaseeditorswithsequencecontextpreferencepredictablebymachinelearningmethods
AT linana optimizationofctogbaseeditorswithsequencecontextpreferencepredictablebymachinelearningmethods
AT liujing optimizationofctogbaseeditorswithsequencecontextpreferencepredictablebymachinelearningmethods
AT zhanghaihang optimizationofctogbaseeditorswithsequencecontextpreferencepredictablebymachinelearningmethods
AT xielong optimizationofctogbaseeditorswithsequencecontextpreferencepredictablebymachinelearningmethods
AT yingwenqin optimizationofctogbaseeditorswithsequencecontextpreferencepredictablebymachinelearningmethods
AT lidi optimizationofctogbaseeditorswithsequencecontextpreferencepredictablebymachinelearningmethods
AT shilei optimizationofctogbaseeditorswithsequencecontextpreferencepredictablebymachinelearningmethods
AT sunyongsen optimizationofctogbaseeditorswithsequencecontextpreferencepredictablebymachinelearningmethods
AT liyongyao optimizationofctogbaseeditorswithsequencecontextpreferencepredictablebymachinelearningmethods
AT liyixue optimizationofctogbaseeditorswithsequencecontextpreferencepredictablebymachinelearningmethods
AT sunyidi optimizationofctogbaseeditorswithsequencecontextpreferencepredictablebymachinelearningmethods
AT zuoerwei optimizationofctogbaseeditorswithsequencecontextpreferencepredictablebymachinelearningmethods