Cargando…

A machine learning approach for predicting CRISPR-Cas9 cleavage efficiencies and patterns underlying its mechanism of action

The adaptation of the CRISPR-Cas9 system as a genome editing technique has generated much excitement in recent years owing to its ability to manipulate targeted genes and genomic regions that are complementary to a programmed single guide RNA (sgRNA). However, the efficacy of a specific sgRNA is not...

Descripción completa

Detalles Bibliográficos
Autores principales: Abadi, Shiran, Yan, Winston X., Amar, David, Mayrose, Itay
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5658169/
https://www.ncbi.nlm.nih.gov/pubmed/29036168
http://dx.doi.org/10.1371/journal.pcbi.1005807
_version_ 1783273947009122304
author Abadi, Shiran
Yan, Winston X.
Amar, David
Mayrose, Itay
author_facet Abadi, Shiran
Yan, Winston X.
Amar, David
Mayrose, Itay
author_sort Abadi, Shiran
collection PubMed
description The adaptation of the CRISPR-Cas9 system as a genome editing technique has generated much excitement in recent years owing to its ability to manipulate targeted genes and genomic regions that are complementary to a programmed single guide RNA (sgRNA). However, the efficacy of a specific sgRNA is not uniquely defined by exact sequence homology to the target site, thus unintended off-targets might additionally be cleaved. Current methods for sgRNA design are mainly concerned with predicting off-targets for a given sgRNA using basic sequence features and employ elementary rules for ranking possible sgRNAs. Here, we introduce CRISTA (CRISPR Target Assessment), a novel algorithm within the machine learning framework that determines the propensity of a genomic site to be cleaved by a given sgRNA. We show that the predictions made with CRISTA are more accurate than other available methodologies. We further demonstrate that the occurrence of bulges is not a rare phenomenon and should be accounted for in the prediction process. Beyond predicting cleavage efficiencies, the learning process provides inferences regarding patterns that underlie the mechanism of action of the CRISPR-Cas9 system. We discover that attributes that describe the spatial structure and rigidity of the entire genomic site as well as those surrounding the PAM region are a major component of the prediction capabilities.
format Online
Article
Text
id pubmed-5658169
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-56581692017-11-09 A machine learning approach for predicting CRISPR-Cas9 cleavage efficiencies and patterns underlying its mechanism of action Abadi, Shiran Yan, Winston X. Amar, David Mayrose, Itay PLoS Comput Biol Research Article The adaptation of the CRISPR-Cas9 system as a genome editing technique has generated much excitement in recent years owing to its ability to manipulate targeted genes and genomic regions that are complementary to a programmed single guide RNA (sgRNA). However, the efficacy of a specific sgRNA is not uniquely defined by exact sequence homology to the target site, thus unintended off-targets might additionally be cleaved. Current methods for sgRNA design are mainly concerned with predicting off-targets for a given sgRNA using basic sequence features and employ elementary rules for ranking possible sgRNAs. Here, we introduce CRISTA (CRISPR Target Assessment), a novel algorithm within the machine learning framework that determines the propensity of a genomic site to be cleaved by a given sgRNA. We show that the predictions made with CRISTA are more accurate than other available methodologies. We further demonstrate that the occurrence of bulges is not a rare phenomenon and should be accounted for in the prediction process. Beyond predicting cleavage efficiencies, the learning process provides inferences regarding patterns that underlie the mechanism of action of the CRISPR-Cas9 system. We discover that attributes that describe the spatial structure and rigidity of the entire genomic site as well as those surrounding the PAM region are a major component of the prediction capabilities. Public Library of Science 2017-10-16 /pmc/articles/PMC5658169/ /pubmed/29036168 http://dx.doi.org/10.1371/journal.pcbi.1005807 Text en © 2017 Abadi et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Abadi, Shiran
Yan, Winston X.
Amar, David
Mayrose, Itay
A machine learning approach for predicting CRISPR-Cas9 cleavage efficiencies and patterns underlying its mechanism of action
title A machine learning approach for predicting CRISPR-Cas9 cleavage efficiencies and patterns underlying its mechanism of action
title_full A machine learning approach for predicting CRISPR-Cas9 cleavage efficiencies and patterns underlying its mechanism of action
title_fullStr A machine learning approach for predicting CRISPR-Cas9 cleavage efficiencies and patterns underlying its mechanism of action
title_full_unstemmed A machine learning approach for predicting CRISPR-Cas9 cleavage efficiencies and patterns underlying its mechanism of action
title_short A machine learning approach for predicting CRISPR-Cas9 cleavage efficiencies and patterns underlying its mechanism of action
title_sort machine learning approach for predicting crispr-cas9 cleavage efficiencies and patterns underlying its mechanism of action
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5658169/
https://www.ncbi.nlm.nih.gov/pubmed/29036168
http://dx.doi.org/10.1371/journal.pcbi.1005807
work_keys_str_mv AT abadishiran amachinelearningapproachforpredictingcrisprcas9cleavageefficienciesandpatternsunderlyingitsmechanismofaction
AT yanwinstonx amachinelearningapproachforpredictingcrisprcas9cleavageefficienciesandpatternsunderlyingitsmechanismofaction
AT amardavid amachinelearningapproachforpredictingcrisprcas9cleavageefficienciesandpatternsunderlyingitsmechanismofaction
AT mayroseitay amachinelearningapproachforpredictingcrisprcas9cleavageefficienciesandpatternsunderlyingitsmechanismofaction
AT abadishiran machinelearningapproachforpredictingcrisprcas9cleavageefficienciesandpatternsunderlyingitsmechanismofaction
AT yanwinstonx machinelearningapproachforpredictingcrisprcas9cleavageefficienciesandpatternsunderlyingitsmechanismofaction
AT amardavid machinelearningapproachforpredictingcrisprcas9cleavageefficienciesandpatternsunderlyingitsmechanismofaction
AT mayroseitay machinelearningapproachforpredictingcrisprcas9cleavageefficienciesandpatternsunderlyingitsmechanismofaction