Cargando…

CRISPR-SE: a brute force search engine for CRISPR design

CRISPR is a revolutionary genome-editing tool that has been broadly used and integrated within novel biotechnologies. A major component of existing CRISPR design tools is the search engines that find the off-targets up to a predefined number of mismatches. Many CRISPR design tools adapted sequence a...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Bin, Chen, Poshen B, Diao, Yarui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7902234/
https://www.ncbi.nlm.nih.gov/pubmed/33655210
http://dx.doi.org/10.1093/nargab/lqab013
_version_ 1783654519238819840
author Li, Bin
Chen, Poshen B
Diao, Yarui
author_facet Li, Bin
Chen, Poshen B
Diao, Yarui
author_sort Li, Bin
collection PubMed
description CRISPR is a revolutionary genome-editing tool that has been broadly used and integrated within novel biotechnologies. A major component of existing CRISPR design tools is the search engines that find the off-targets up to a predefined number of mismatches. Many CRISPR design tools adapted sequence alignment tools as the search engines to speed up the process. These commonly used alignment tools include BLAST, BLAT, Bowtie, Bowtie2 and BWA. Alignment tools use heuristic algorithm to align large amount of sequences with high performance. However, due to the seed-and-extend algorithms implemented in the sequence alignment tools, these methods are likely to provide incomplete off-targets information for ultra-short sequences, such as 20-bp guide RNAs (gRNA). An incomplete list of off-targets sites may lead to erroneous CRISPR design. To address this problem, we derived four sets of gRNAs to evaluate the accuracy of existing search engines; further, we introduce a search engine, namely CRISPR-SE. CRISPR-SE is an accurate and fast search engine using a brute force approach. In CRISPR-SE, all gRNAs are virtually compared with query gRNA, therefore, the accuracies are guaranteed. We performed the accuracy benchmark with multiple search engines. The results show that as expected, alignment tools reported an incomplete and varied list of off-target sites. CRISPR-SE performs well in both accuracy and speed. CRISPR-SE will improve the quality of CRISPR design as an accurate high-performance search engine.
format Online
Article
Text
id pubmed-7902234
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-79022342021-03-01 CRISPR-SE: a brute force search engine for CRISPR design Li, Bin Chen, Poshen B Diao, Yarui NAR Genom Bioinform Methods Article CRISPR is a revolutionary genome-editing tool that has been broadly used and integrated within novel biotechnologies. A major component of existing CRISPR design tools is the search engines that find the off-targets up to a predefined number of mismatches. Many CRISPR design tools adapted sequence alignment tools as the search engines to speed up the process. These commonly used alignment tools include BLAST, BLAT, Bowtie, Bowtie2 and BWA. Alignment tools use heuristic algorithm to align large amount of sequences with high performance. However, due to the seed-and-extend algorithms implemented in the sequence alignment tools, these methods are likely to provide incomplete off-targets information for ultra-short sequences, such as 20-bp guide RNAs (gRNA). An incomplete list of off-targets sites may lead to erroneous CRISPR design. To address this problem, we derived four sets of gRNAs to evaluate the accuracy of existing search engines; further, we introduce a search engine, namely CRISPR-SE. CRISPR-SE is an accurate and fast search engine using a brute force approach. In CRISPR-SE, all gRNAs are virtually compared with query gRNA, therefore, the accuracies are guaranteed. We performed the accuracy benchmark with multiple search engines. The results show that as expected, alignment tools reported an incomplete and varied list of off-target sites. CRISPR-SE performs well in both accuracy and speed. CRISPR-SE will improve the quality of CRISPR design as an accurate high-performance search engine. Oxford University Press 2021-02-23 /pmc/articles/PMC7902234/ /pubmed/33655210 http://dx.doi.org/10.1093/nargab/lqab013 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Article
Li, Bin
Chen, Poshen B
Diao, Yarui
CRISPR-SE: a brute force search engine for CRISPR design
title CRISPR-SE: a brute force search engine for CRISPR design
title_full CRISPR-SE: a brute force search engine for CRISPR design
title_fullStr CRISPR-SE: a brute force search engine for CRISPR design
title_full_unstemmed CRISPR-SE: a brute force search engine for CRISPR design
title_short CRISPR-SE: a brute force search engine for CRISPR design
title_sort crispr-se: a brute force search engine for crispr design
topic Methods Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7902234/
https://www.ncbi.nlm.nih.gov/pubmed/33655210
http://dx.doi.org/10.1093/nargab/lqab013
work_keys_str_mv AT libin crisprseabruteforcesearchengineforcrisprdesign
AT chenposhenb crisprseabruteforcesearchengineforcrisprdesign
AT diaoyarui crisprseabruteforcesearchengineforcrisprdesign