Cargando…

Prediction of sgRNA on-target activity in bacteria by deep learning

BACKGROUND: One of the main challenges for the CRISPR-Cas9 system is selecting optimal single-guide RNAs (sgRNAs). Recently, deep learning has enhanced sgRNA prediction in eukaryotes. However, the prokaryotic chromatin structure is different from eukaryotes, so models trained on eukaryotes may not a...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Lei, Zhang, Juhua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6814057/
https://www.ncbi.nlm.nih.gov/pubmed/31651233
http://dx.doi.org/10.1186/s12859-019-3151-4
_version_ 1783462949784911872
author Wang, Lei
Zhang, Juhua
author_facet Wang, Lei
Zhang, Juhua
author_sort Wang, Lei
collection PubMed
description BACKGROUND: One of the main challenges for the CRISPR-Cas9 system is selecting optimal single-guide RNAs (sgRNAs). Recently, deep learning has enhanced sgRNA prediction in eukaryotes. However, the prokaryotic chromatin structure is different from eukaryotes, so models trained on eukaryotes may not apply to prokaryotes. RESULTS: We designed and implemented a convolutional neural network to predict sgRNA activity in Escherichia coli. The network was trained and tested on the recently-released sgRNA activity dataset. Our convolutional neural network achieved excellent performance, yielding average Spearman correlation coefficients of 0.5817, 0.7105, and 0.3602, respectively for Cas9, eSpCas9 and Cas9 with a recA coding region deletion. We confirmed that the sgRNA prediction models trained on prokaryotes do not apply to eukaryotes and vice versa. We adopted perturbation-based approaches to analyze distinct biological patterns between prokaryotic and eukaryotic editing. Then, we improved the predictive performance of the prokaryotic Cas9 system by transfer learning. Finally, we determined that potential off-target scores accumulated on a genome-wide scale affect on-target activity, which could slightly improve on-target predictive performance. CONCLUSIONS: We developed convolutional neural networks to predict sgRNA activity for wild type and mutant Cas9 in prokaryotes. Our results show that the prediction accuracy of our method is improved over state-of-the-art models.
format Online
Article
Text
id pubmed-6814057
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68140572019-10-31 Prediction of sgRNA on-target activity in bacteria by deep learning Wang, Lei Zhang, Juhua BMC Bioinformatics Research Article BACKGROUND: One of the main challenges for the CRISPR-Cas9 system is selecting optimal single-guide RNAs (sgRNAs). Recently, deep learning has enhanced sgRNA prediction in eukaryotes. However, the prokaryotic chromatin structure is different from eukaryotes, so models trained on eukaryotes may not apply to prokaryotes. RESULTS: We designed and implemented a convolutional neural network to predict sgRNA activity in Escherichia coli. The network was trained and tested on the recently-released sgRNA activity dataset. Our convolutional neural network achieved excellent performance, yielding average Spearman correlation coefficients of 0.5817, 0.7105, and 0.3602, respectively for Cas9, eSpCas9 and Cas9 with a recA coding region deletion. We confirmed that the sgRNA prediction models trained on prokaryotes do not apply to eukaryotes and vice versa. We adopted perturbation-based approaches to analyze distinct biological patterns between prokaryotic and eukaryotic editing. Then, we improved the predictive performance of the prokaryotic Cas9 system by transfer learning. Finally, we determined that potential off-target scores accumulated on a genome-wide scale affect on-target activity, which could slightly improve on-target predictive performance. CONCLUSIONS: We developed convolutional neural networks to predict sgRNA activity for wild type and mutant Cas9 in prokaryotes. Our results show that the prediction accuracy of our method is improved over state-of-the-art models. BioMed Central 2019-10-24 /pmc/articles/PMC6814057/ /pubmed/31651233 http://dx.doi.org/10.1186/s12859-019-3151-4 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Wang, Lei
Zhang, Juhua
Prediction of sgRNA on-target activity in bacteria by deep learning
title Prediction of sgRNA on-target activity in bacteria by deep learning
title_full Prediction of sgRNA on-target activity in bacteria by deep learning
title_fullStr Prediction of sgRNA on-target activity in bacteria by deep learning
title_full_unstemmed Prediction of sgRNA on-target activity in bacteria by deep learning
title_short Prediction of sgRNA on-target activity in bacteria by deep learning
title_sort prediction of sgrna on-target activity in bacteria by deep learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6814057/
https://www.ncbi.nlm.nih.gov/pubmed/31651233
http://dx.doi.org/10.1186/s12859-019-3151-4
work_keys_str_mv AT wanglei predictionofsgrnaontargetactivityinbacteriabydeeplearning
AT zhangjuhua predictionofsgrnaontargetactivityinbacteriabydeeplearning