Cargando…

Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm

BACKGROUND: The beauty and power of the genome editing mechanism, CRISPR Cas9 endonuclease system, lies in the fact that it is RNA-programmable such that Cas9 can be guided to any genomic loci complementary to a 20-nt RNA, single guide RNA (sgRNA), to cleave double stranded DNA, allowing the introdu...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Hong, Zhou, Michael, Li, Daisy, Manthey, Joseph, Lioutikova, Ekaterina, Wang, Hong, Zeng, Xiao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5773898/
https://www.ncbi.nlm.nih.gov/pubmed/29219081
http://dx.doi.org/10.1186/s12864-017-4225-1
_version_ 1783293660395208704
author Zhou, Hong
Zhou, Michael
Li, Daisy
Manthey, Joseph
Lioutikova, Ekaterina
Wang, Hong
Zeng, Xiao
author_facet Zhou, Hong
Zhou, Michael
Li, Daisy
Manthey, Joseph
Lioutikova, Ekaterina
Wang, Hong
Zeng, Xiao
author_sort Zhou, Hong
collection PubMed
description BACKGROUND: The beauty and power of the genome editing mechanism, CRISPR Cas9 endonuclease system, lies in the fact that it is RNA-programmable such that Cas9 can be guided to any genomic loci complementary to a 20-nt RNA, single guide RNA (sgRNA), to cleave double stranded DNA, allowing the introduction of wanted mutations. Unfortunately, it has been reported repeatedly that the sgRNA can also guide Cas9 to off-target sites where the DNA sequence is homologous to sgRNA. RESULTS: Using human genome and Streptococcus pyogenes Cas9 (SpCas9) as an example, this article mathematically analyzed the probabilities of off-target homologies of sgRNAs and discovered that for large genome size such as human genome, potential off-target homologies are inevitable for sgRNA selection. A highly efficient computationl algorithm was developed for whole genome sgRNA design and off-target homology searches. By means of a dynamically constructed sequence-indexed database and a simplified sequence alignment method, this algorithm achieves very high efficiency while guaranteeing the identification of all existing potential off-target homologies. Via this algorithm, 1,876,775 sgRNAs were designed for the 19,153 human mRNA genes and only two sgRNAs were found to be free of off-target homology. CONCLUSIONS: By means of the novel and efficient sgRNA homology search algorithm introduced in this article, genome wide sgRNA design and off-target analysis were conducted and the results confirmed the mathematical analysis that for a sgRNA sequence, it is almost impossible to escape potential off-target homologies. Future innovations on the CRISPR Cas9 gene editing technology need to focus on how to eliminate the Cas9 off-target activity.
format Online
Article
Text
id pubmed-5773898
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-57738982018-01-26 Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm Zhou, Hong Zhou, Michael Li, Daisy Manthey, Joseph Lioutikova, Ekaterina Wang, Hong Zeng, Xiao BMC Genomics Research BACKGROUND: The beauty and power of the genome editing mechanism, CRISPR Cas9 endonuclease system, lies in the fact that it is RNA-programmable such that Cas9 can be guided to any genomic loci complementary to a 20-nt RNA, single guide RNA (sgRNA), to cleave double stranded DNA, allowing the introduction of wanted mutations. Unfortunately, it has been reported repeatedly that the sgRNA can also guide Cas9 to off-target sites where the DNA sequence is homologous to sgRNA. RESULTS: Using human genome and Streptococcus pyogenes Cas9 (SpCas9) as an example, this article mathematically analyzed the probabilities of off-target homologies of sgRNAs and discovered that for large genome size such as human genome, potential off-target homologies are inevitable for sgRNA selection. A highly efficient computationl algorithm was developed for whole genome sgRNA design and off-target homology searches. By means of a dynamically constructed sequence-indexed database and a simplified sequence alignment method, this algorithm achieves very high efficiency while guaranteeing the identification of all existing potential off-target homologies. Via this algorithm, 1,876,775 sgRNAs were designed for the 19,153 human mRNA genes and only two sgRNAs were found to be free of off-target homology. CONCLUSIONS: By means of the novel and efficient sgRNA homology search algorithm introduced in this article, genome wide sgRNA design and off-target analysis were conducted and the results confirmed the mathematical analysis that for a sgRNA sequence, it is almost impossible to escape potential off-target homologies. Future innovations on the CRISPR Cas9 gene editing technology need to focus on how to eliminate the Cas9 off-target activity. BioMed Central 2017-11-17 /pmc/articles/PMC5773898/ /pubmed/29219081 http://dx.doi.org/10.1186/s12864-017-4225-1 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Zhou, Hong
Zhou, Michael
Li, Daisy
Manthey, Joseph
Lioutikova, Ekaterina
Wang, Hong
Zeng, Xiao
Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm
title Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm
title_full Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm
title_fullStr Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm
title_full_unstemmed Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm
title_short Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm
title_sort whole genome analysis of crispr cas9 sgrna off-target homologies via an efficient computational algorithm
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5773898/
https://www.ncbi.nlm.nih.gov/pubmed/29219081
http://dx.doi.org/10.1186/s12864-017-4225-1
work_keys_str_mv AT zhouhong wholegenomeanalysisofcrisprcas9sgrnaofftargethomologiesviaanefficientcomputationalalgorithm
AT zhoumichael wholegenomeanalysisofcrisprcas9sgrnaofftargethomologiesviaanefficientcomputationalalgorithm
AT lidaisy wholegenomeanalysisofcrisprcas9sgrnaofftargethomologiesviaanefficientcomputationalalgorithm
AT mantheyjoseph wholegenomeanalysisofcrisprcas9sgrnaofftargethomologiesviaanefficientcomputationalalgorithm
AT lioutikovaekaterina wholegenomeanalysisofcrisprcas9sgrnaofftargethomologiesviaanefficientcomputationalalgorithm
AT wanghong wholegenomeanalysisofcrisprcas9sgrnaofftargethomologiesviaanefficientcomputationalalgorithm
AT zengxiao wholegenomeanalysisofcrisprcas9sgrnaofftargethomologiesviaanefficientcomputationalalgorithm