Cargando…
Effective use of sequence information to predict CRISPR-Cas9 off-target
The CRISPR/Cas9 gene-editing system is the third-generation gene-editing technology that has been widely used in biomedical applications. However, off-target effects occurring CRISPR/Cas9 system has been a challenging problem it faces in practical applications. Although many predictive models have b...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8804193/ https://www.ncbi.nlm.nih.gov/pubmed/35140885 http://dx.doi.org/10.1016/j.csbj.2022.01.006 |
_version_ | 1784643020767887360 |
---|---|
author | Zhang, Zhong-Rui Jiang, Zhen-Ran |
author_facet | Zhang, Zhong-Rui Jiang, Zhen-Ran |
author_sort | Zhang, Zhong-Rui |
collection | PubMed |
description | The CRISPR/Cas9 gene-editing system is the third-generation gene-editing technology that has been widely used in biomedical applications. However, off-target effects occurring CRISPR/Cas9 system has been a challenging problem it faces in practical applications. Although many predictive models have been developed to predict off-target activities, current models do not effectively use sequence pair information. There is still room for improved accuracy. This study aims to effectively use sequence pair information to improve the model's performance for predicting off-target activities. We propose a new coding scheme for coding sequence pairs and design a new model called CRISPR-IP for predicting off-target activity. Our coding scheme distinguishes regions with different functions in the sequence pairs through the function channel. Moreover, it distinguishes between bases and base pairs using type channels, effectively representing the sequence pair information. The CRISPR-IP model is based on CNN, BiLSTM, and the attention layer to learn features of sequence pairs. We performed performance verification on two data sets and found that our coding scheme can represent sequence pair information effectively, and the CRISPR-IP model performance is better than others. Data and source codes are available at https://github.com/BioinfoVirgo/CRISPR-IP. |
format | Online Article Text |
id | pubmed-8804193 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-88041932022-02-08 Effective use of sequence information to predict CRISPR-Cas9 off-target Zhang, Zhong-Rui Jiang, Zhen-Ran Comput Struct Biotechnol J Research Article The CRISPR/Cas9 gene-editing system is the third-generation gene-editing technology that has been widely used in biomedical applications. However, off-target effects occurring CRISPR/Cas9 system has been a challenging problem it faces in practical applications. Although many predictive models have been developed to predict off-target activities, current models do not effectively use sequence pair information. There is still room for improved accuracy. This study aims to effectively use sequence pair information to improve the model's performance for predicting off-target activities. We propose a new coding scheme for coding sequence pairs and design a new model called CRISPR-IP for predicting off-target activity. Our coding scheme distinguishes regions with different functions in the sequence pairs through the function channel. Moreover, it distinguishes between bases and base pairs using type channels, effectively representing the sequence pair information. The CRISPR-IP model is based on CNN, BiLSTM, and the attention layer to learn features of sequence pairs. We performed performance verification on two data sets and found that our coding scheme can represent sequence pair information effectively, and the CRISPR-IP model performance is better than others. Data and source codes are available at https://github.com/BioinfoVirgo/CRISPR-IP. Research Network of Computational and Structural Biotechnology 2022-01-19 /pmc/articles/PMC8804193/ /pubmed/35140885 http://dx.doi.org/10.1016/j.csbj.2022.01.006 Text en © 2022 The Authors. Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Research Article Zhang, Zhong-Rui Jiang, Zhen-Ran Effective use of sequence information to predict CRISPR-Cas9 off-target |
title | Effective use of sequence information to predict CRISPR-Cas9 off-target |
title_full | Effective use of sequence information to predict CRISPR-Cas9 off-target |
title_fullStr | Effective use of sequence information to predict CRISPR-Cas9 off-target |
title_full_unstemmed | Effective use of sequence information to predict CRISPR-Cas9 off-target |
title_short | Effective use of sequence information to predict CRISPR-Cas9 off-target |
title_sort | effective use of sequence information to predict crispr-cas9 off-target |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8804193/ https://www.ncbi.nlm.nih.gov/pubmed/35140885 http://dx.doi.org/10.1016/j.csbj.2022.01.006 |
work_keys_str_mv | AT zhangzhongrui effectiveuseofsequenceinformationtopredictcrisprcas9offtarget AT jiangzhenran effectiveuseofsequenceinformationtopredictcrisprcas9offtarget |