Cargando…

Effective use of sequence information to predict CRISPR-Cas9 off-target

The CRISPR/Cas9 gene-editing system is the third-generation gene-editing technology that has been widely used in biomedical applications. However, off-target effects occurring CRISPR/Cas9 system has been a challenging problem it faces in practical applications. Although many predictive models have b...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Zhong-Rui, Jiang, Zhen-Ran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8804193/
https://www.ncbi.nlm.nih.gov/pubmed/35140885
http://dx.doi.org/10.1016/j.csbj.2022.01.006
_version_ 1784643020767887360
author Zhang, Zhong-Rui
Jiang, Zhen-Ran
author_facet Zhang, Zhong-Rui
Jiang, Zhen-Ran
author_sort Zhang, Zhong-Rui
collection PubMed
description The CRISPR/Cas9 gene-editing system is the third-generation gene-editing technology that has been widely used in biomedical applications. However, off-target effects occurring CRISPR/Cas9 system has been a challenging problem it faces in practical applications. Although many predictive models have been developed to predict off-target activities, current models do not effectively use sequence pair information. There is still room for improved accuracy. This study aims to effectively use sequence pair information to improve the model's performance for predicting off-target activities. We propose a new coding scheme for coding sequence pairs and design a new model called CRISPR-IP for predicting off-target activity. Our coding scheme distinguishes regions with different functions in the sequence pairs through the function channel. Moreover, it distinguishes between bases and base pairs using type channels, effectively representing the sequence pair information. The CRISPR-IP model is based on CNN, BiLSTM, and the attention layer to learn features of sequence pairs. We performed performance verification on two data sets and found that our coding scheme can represent sequence pair information effectively, and the CRISPR-IP model performance is better than others. Data and source codes are available at https://github.com/BioinfoVirgo/CRISPR-IP.
format Online
Article
Text
id pubmed-8804193
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-88041932022-02-08 Effective use of sequence information to predict CRISPR-Cas9 off-target Zhang, Zhong-Rui Jiang, Zhen-Ran Comput Struct Biotechnol J Research Article The CRISPR/Cas9 gene-editing system is the third-generation gene-editing technology that has been widely used in biomedical applications. However, off-target effects occurring CRISPR/Cas9 system has been a challenging problem it faces in practical applications. Although many predictive models have been developed to predict off-target activities, current models do not effectively use sequence pair information. There is still room for improved accuracy. This study aims to effectively use sequence pair information to improve the model's performance for predicting off-target activities. We propose a new coding scheme for coding sequence pairs and design a new model called CRISPR-IP for predicting off-target activity. Our coding scheme distinguishes regions with different functions in the sequence pairs through the function channel. Moreover, it distinguishes between bases and base pairs using type channels, effectively representing the sequence pair information. The CRISPR-IP model is based on CNN, BiLSTM, and the attention layer to learn features of sequence pairs. We performed performance verification on two data sets and found that our coding scheme can represent sequence pair information effectively, and the CRISPR-IP model performance is better than others. Data and source codes are available at https://github.com/BioinfoVirgo/CRISPR-IP. Research Network of Computational and Structural Biotechnology 2022-01-19 /pmc/articles/PMC8804193/ /pubmed/35140885 http://dx.doi.org/10.1016/j.csbj.2022.01.006 Text en © 2022 The Authors. Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Zhang, Zhong-Rui
Jiang, Zhen-Ran
Effective use of sequence information to predict CRISPR-Cas9 off-target
title Effective use of sequence information to predict CRISPR-Cas9 off-target
title_full Effective use of sequence information to predict CRISPR-Cas9 off-target
title_fullStr Effective use of sequence information to predict CRISPR-Cas9 off-target
title_full_unstemmed Effective use of sequence information to predict CRISPR-Cas9 off-target
title_short Effective use of sequence information to predict CRISPR-Cas9 off-target
title_sort effective use of sequence information to predict crispr-cas9 off-target
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8804193/
https://www.ncbi.nlm.nih.gov/pubmed/35140885
http://dx.doi.org/10.1016/j.csbj.2022.01.006
work_keys_str_mv AT zhangzhongrui effectiveuseofsequenceinformationtopredictcrisprcas9offtarget
AT jiangzhenran effectiveuseofsequenceinformationtopredictcrisprcas9offtarget