Cargando…

Off-target predictions in CRISPR-Cas9 gene editing using deep learning

MOTIVATION: The prediction of off-target mutations in CRISPR-Cas9 is a hot topic due to its relevance to gene editing research. Existing prediction methods have been developed; however, most of them just calculated scores based on mismatches to the guide sequence in CRISPR-Cas9. Therefore, the exist...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lin, Jiecong, Wong, Ka-Chun
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2018
Materias:	Eccb 2018: European Conference on Computational Biology Proceedings
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6129261/ https://www.ncbi.nlm.nih.gov/pubmed/30423072 http://dx.doi.org/10.1093/bioinformatics/bty554

_version_	1783353769787916288
author	Lin, Jiecong Wong, Ka-Chun
author_facet	Lin, Jiecong Wong, Ka-Chun
author_sort	Lin, Jiecong
collection	PubMed
description	MOTIVATION: The prediction of off-target mutations in CRISPR-Cas9 is a hot topic due to its relevance to gene editing research. Existing prediction methods have been developed; however, most of them just calculated scores based on mismatches to the guide sequence in CRISPR-Cas9. Therefore, the existing prediction methods are unable to scale and improve their performance with the rapid expansion of experimental data in CRISPR-Cas9. Moreover, the existing methods still cannot satisfy enough precision in off-target predictions for gene editing at the clinical level. RESULTS: To address it, we design and implement two algorithms using deep neural networks to predict off-target mutations in CRISPR-Cas9 gene editing (i.e. deep convolutional neural network and deep feedforward neural network). The models were trained and tested on the recently released off-target dataset, CRISPOR dataset, for performance benchmark. Another off-target dataset identified by GUIDE-seq was adopted for additional evaluation. We demonstrate that convolutional neural network achieves the best performance on CRISPOR dataset, yielding an average classification area under the ROC curve (AUC) of 97.2% under stratified 5-fold cross-validation. Interestingly, the deep feedforward neural network can also be competitive at the average AUC of 97.0% under the same setting. We compare the two deep neural network models with the state-of-the-art off-target prediction methods (i.e. CFD, MIT, CROP-IT, and CCTop) and three traditional machine learning models (i.e. random forest, gradient boosting trees, and logistic regression) on both datasets in terms of AUC values, demonstrating the competitive edges of the proposed algorithms. Additional analyses are conducted to investigate the underlying reasons from different perspectives. AVAILABILITY AND IMPLEMENTATION: The example code are available at https://github.com/MichaelLinn/off_target_prediction. The related datasets are available at https://github.com/MichaelLinn/off_target_prediction/tree/master/data.
format	Online Article Text
id	pubmed-6129261
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-61292612018-09-12 Off-target predictions in CRISPR-Cas9 gene editing using deep learning Lin, Jiecong Wong, Ka-Chun Bioinformatics Eccb 2018: European Conference on Computational Biology Proceedings MOTIVATION: The prediction of off-target mutations in CRISPR-Cas9 is a hot topic due to its relevance to gene editing research. Existing prediction methods have been developed; however, most of them just calculated scores based on mismatches to the guide sequence in CRISPR-Cas9. Therefore, the existing prediction methods are unable to scale and improve their performance with the rapid expansion of experimental data in CRISPR-Cas9. Moreover, the existing methods still cannot satisfy enough precision in off-target predictions for gene editing at the clinical level. RESULTS: To address it, we design and implement two algorithms using deep neural networks to predict off-target mutations in CRISPR-Cas9 gene editing (i.e. deep convolutional neural network and deep feedforward neural network). The models were trained and tested on the recently released off-target dataset, CRISPOR dataset, for performance benchmark. Another off-target dataset identified by GUIDE-seq was adopted for additional evaluation. We demonstrate that convolutional neural network achieves the best performance on CRISPOR dataset, yielding an average classification area under the ROC curve (AUC) of 97.2% under stratified 5-fold cross-validation. Interestingly, the deep feedforward neural network can also be competitive at the average AUC of 97.0% under the same setting. We compare the two deep neural network models with the state-of-the-art off-target prediction methods (i.e. CFD, MIT, CROP-IT, and CCTop) and three traditional machine learning models (i.e. random forest, gradient boosting trees, and logistic regression) on both datasets in terms of AUC values, demonstrating the competitive edges of the proposed algorithms. Additional analyses are conducted to investigate the underlying reasons from different perspectives. AVAILABILITY AND IMPLEMENTATION: The example code are available at https://github.com/MichaelLinn/off_target_prediction. The related datasets are available at https://github.com/MichaelLinn/off_target_prediction/tree/master/data. Oxford University Press 2018-09-01 2018-09-08 /pmc/articles/PMC6129261/ /pubmed/30423072 http://dx.doi.org/10.1093/bioinformatics/bty554 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle	Eccb 2018: European Conference on Computational Biology Proceedings Lin, Jiecong Wong, Ka-Chun Off-target predictions in CRISPR-Cas9 gene editing using deep learning
title	Off-target predictions in CRISPR-Cas9 gene editing using deep learning
title_full	Off-target predictions in CRISPR-Cas9 gene editing using deep learning
title_fullStr	Off-target predictions in CRISPR-Cas9 gene editing using deep learning
title_full_unstemmed	Off-target predictions in CRISPR-Cas9 gene editing using deep learning
title_short	Off-target predictions in CRISPR-Cas9 gene editing using deep learning
title_sort	off-target predictions in crispr-cas9 gene editing using deep learning
topic	Eccb 2018: European Conference on Computational Biology Proceedings
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6129261/ https://www.ncbi.nlm.nih.gov/pubmed/30423072 http://dx.doi.org/10.1093/bioinformatics/bty554
work_keys_str_mv	AT linjiecong offtargetpredictionsincrisprcas9geneeditingusingdeeplearning AT wongkachun offtargetpredictionsincrisprcas9geneeditingusingdeeplearning

Off-target predictions in CRISPR-Cas9 gene editing using deep learning

Ejemplares similares