Cargando…

A Novel Hybrid CNN-SVR for CRISPR/Cas9 Guide RNA Activity Prediction

Accurate prediction of guide RNA (gRNA) on-target efficacy is critical for effective application of CRISPR/Cas9 system. Although some machine learning-based and convolutional neural network (CNN)-based methods have been proposed, prediction accuracy remains to be improved. Here, firstly we improved...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Guishan, Dai, Zhiming, Dai, Xianhua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6960259/
https://www.ncbi.nlm.nih.gov/pubmed/31969902
http://dx.doi.org/10.3389/fgene.2019.01303
_version_ 1783487755265769472
author Zhang, Guishan
Dai, Zhiming
Dai, Xianhua
author_facet Zhang, Guishan
Dai, Zhiming
Dai, Xianhua
author_sort Zhang, Guishan
collection PubMed
description Accurate prediction of guide RNA (gRNA) on-target efficacy is critical for effective application of CRISPR/Cas9 system. Although some machine learning-based and convolutional neural network (CNN)-based methods have been proposed, prediction accuracy remains to be improved. Here, firstly we improved architectures of current CNNs for predicting gRNA on-target efficacy. Secondly, we proposed a novel hybrid system which combines our improved CNN with support vector regression (SVR). This CNN-SVR system is composed of two major components: a merged CNN as the front-end for extracting gRNA feature and an SVR as the back-end for regression and predicting gRNA cleavage efficiency. We demonstrate that CNN-SVR can effectively exploit features interactions from feed-forward directions to learn deeper features of gRNAs and their corresponding epigenetic features. Experiments on commonly used datasets show that our CNN-SVR system outperforms available state-of-the-art methods in terms of prediction accuracy, generalization, and robustness. Source codes are available at https://github.com/Peppags/CNN-SVR.
format Online
Article
Text
id pubmed-6960259
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-69602592020-01-22 A Novel Hybrid CNN-SVR for CRISPR/Cas9 Guide RNA Activity Prediction Zhang, Guishan Dai, Zhiming Dai, Xianhua Front Genet Genetics Accurate prediction of guide RNA (gRNA) on-target efficacy is critical for effective application of CRISPR/Cas9 system. Although some machine learning-based and convolutional neural network (CNN)-based methods have been proposed, prediction accuracy remains to be improved. Here, firstly we improved architectures of current CNNs for predicting gRNA on-target efficacy. Secondly, we proposed a novel hybrid system which combines our improved CNN with support vector regression (SVR). This CNN-SVR system is composed of two major components: a merged CNN as the front-end for extracting gRNA feature and an SVR as the back-end for regression and predicting gRNA cleavage efficiency. We demonstrate that CNN-SVR can effectively exploit features interactions from feed-forward directions to learn deeper features of gRNAs and their corresponding epigenetic features. Experiments on commonly used datasets show that our CNN-SVR system outperforms available state-of-the-art methods in terms of prediction accuracy, generalization, and robustness. Source codes are available at https://github.com/Peppags/CNN-SVR. Frontiers Media S.A. 2020-01-08 /pmc/articles/PMC6960259/ /pubmed/31969902 http://dx.doi.org/10.3389/fgene.2019.01303 Text en Copyright © 2020 Zhang, Dai and Dai http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Zhang, Guishan
Dai, Zhiming
Dai, Xianhua
A Novel Hybrid CNN-SVR for CRISPR/Cas9 Guide RNA Activity Prediction
title A Novel Hybrid CNN-SVR for CRISPR/Cas9 Guide RNA Activity Prediction
title_full A Novel Hybrid CNN-SVR for CRISPR/Cas9 Guide RNA Activity Prediction
title_fullStr A Novel Hybrid CNN-SVR for CRISPR/Cas9 Guide RNA Activity Prediction
title_full_unstemmed A Novel Hybrid CNN-SVR for CRISPR/Cas9 Guide RNA Activity Prediction
title_short A Novel Hybrid CNN-SVR for CRISPR/Cas9 Guide RNA Activity Prediction
title_sort novel hybrid cnn-svr for crispr/cas9 guide rna activity prediction
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6960259/
https://www.ncbi.nlm.nih.gov/pubmed/31969902
http://dx.doi.org/10.3389/fgene.2019.01303
work_keys_str_mv AT zhangguishan anovelhybridcnnsvrforcrisprcas9guidernaactivityprediction
AT daizhiming anovelhybridcnnsvrforcrisprcas9guidernaactivityprediction
AT daixianhua anovelhybridcnnsvrforcrisprcas9guidernaactivityprediction
AT zhangguishan novelhybridcnnsvrforcrisprcas9guidernaactivityprediction
AT daizhiming novelhybridcnnsvrforcrisprcas9guidernaactivityprediction
AT daixianhua novelhybridcnnsvrforcrisprcas9guidernaactivityprediction