Cargando…
Prediction of sgRNA on-target activity in bacteria by deep learning
BACKGROUND: One of the main challenges for the CRISPR-Cas9 system is selecting optimal single-guide RNAs (sgRNAs). Recently, deep learning has enhanced sgRNA prediction in eukaryotes. However, the prokaryotic chromatin structure is different from eukaryotes, so models trained on eukaryotes may not a...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6814057/ https://www.ncbi.nlm.nih.gov/pubmed/31651233 http://dx.doi.org/10.1186/s12859-019-3151-4 |
_version_ | 1783462949784911872 |
---|---|
author | Wang, Lei Zhang, Juhua |
author_facet | Wang, Lei Zhang, Juhua |
author_sort | Wang, Lei |
collection | PubMed |
description | BACKGROUND: One of the main challenges for the CRISPR-Cas9 system is selecting optimal single-guide RNAs (sgRNAs). Recently, deep learning has enhanced sgRNA prediction in eukaryotes. However, the prokaryotic chromatin structure is different from eukaryotes, so models trained on eukaryotes may not apply to prokaryotes. RESULTS: We designed and implemented a convolutional neural network to predict sgRNA activity in Escherichia coli. The network was trained and tested on the recently-released sgRNA activity dataset. Our convolutional neural network achieved excellent performance, yielding average Spearman correlation coefficients of 0.5817, 0.7105, and 0.3602, respectively for Cas9, eSpCas9 and Cas9 with a recA coding region deletion. We confirmed that the sgRNA prediction models trained on prokaryotes do not apply to eukaryotes and vice versa. We adopted perturbation-based approaches to analyze distinct biological patterns between prokaryotic and eukaryotic editing. Then, we improved the predictive performance of the prokaryotic Cas9 system by transfer learning. Finally, we determined that potential off-target scores accumulated on a genome-wide scale affect on-target activity, which could slightly improve on-target predictive performance. CONCLUSIONS: We developed convolutional neural networks to predict sgRNA activity for wild type and mutant Cas9 in prokaryotes. Our results show that the prediction accuracy of our method is improved over state-of-the-art models. |
format | Online Article Text |
id | pubmed-6814057 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-68140572019-10-31 Prediction of sgRNA on-target activity in bacteria by deep learning Wang, Lei Zhang, Juhua BMC Bioinformatics Research Article BACKGROUND: One of the main challenges for the CRISPR-Cas9 system is selecting optimal single-guide RNAs (sgRNAs). Recently, deep learning has enhanced sgRNA prediction in eukaryotes. However, the prokaryotic chromatin structure is different from eukaryotes, so models trained on eukaryotes may not apply to prokaryotes. RESULTS: We designed and implemented a convolutional neural network to predict sgRNA activity in Escherichia coli. The network was trained and tested on the recently-released sgRNA activity dataset. Our convolutional neural network achieved excellent performance, yielding average Spearman correlation coefficients of 0.5817, 0.7105, and 0.3602, respectively for Cas9, eSpCas9 and Cas9 with a recA coding region deletion. We confirmed that the sgRNA prediction models trained on prokaryotes do not apply to eukaryotes and vice versa. We adopted perturbation-based approaches to analyze distinct biological patterns between prokaryotic and eukaryotic editing. Then, we improved the predictive performance of the prokaryotic Cas9 system by transfer learning. Finally, we determined that potential off-target scores accumulated on a genome-wide scale affect on-target activity, which could slightly improve on-target predictive performance. CONCLUSIONS: We developed convolutional neural networks to predict sgRNA activity for wild type and mutant Cas9 in prokaryotes. Our results show that the prediction accuracy of our method is improved over state-of-the-art models. BioMed Central 2019-10-24 /pmc/articles/PMC6814057/ /pubmed/31651233 http://dx.doi.org/10.1186/s12859-019-3151-4 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Wang, Lei Zhang, Juhua Prediction of sgRNA on-target activity in bacteria by deep learning |
title | Prediction of sgRNA on-target activity in bacteria by deep learning |
title_full | Prediction of sgRNA on-target activity in bacteria by deep learning |
title_fullStr | Prediction of sgRNA on-target activity in bacteria by deep learning |
title_full_unstemmed | Prediction of sgRNA on-target activity in bacteria by deep learning |
title_short | Prediction of sgRNA on-target activity in bacteria by deep learning |
title_sort | prediction of sgrna on-target activity in bacteria by deep learning |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6814057/ https://www.ncbi.nlm.nih.gov/pubmed/31651233 http://dx.doi.org/10.1186/s12859-019-3151-4 |
work_keys_str_mv | AT wanglei predictionofsgrnaontargetactivityinbacteriabydeeplearning AT zhangjuhua predictionofsgrnaontargetactivityinbacteriabydeeplearning |