Cargando…
Comprehensive computational analysis of epigenetic descriptors affecting CRISPR-Cas9 off-target activity
BACKGROUND: A common issue in CRISPR-Cas9 genome editing is off-target activity, which prevents the widespread use of CRISPR-Cas9 in medical applications. Among other factors, primary chromatin structure and epigenetics may influence off-target activity. METHODS: In this work, we utilize crisprSQL,...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9724382/ https://www.ncbi.nlm.nih.gov/pubmed/36474180 http://dx.doi.org/10.1186/s12864-022-09012-7 |
_version_ | 1784844403023544320 |
---|---|
author | Mak, Jeffrey K. Störtz, Florian Minary, Peter |
author_facet | Mak, Jeffrey K. Störtz, Florian Minary, Peter |
author_sort | Mak, Jeffrey K. |
collection | PubMed |
description | BACKGROUND: A common issue in CRISPR-Cas9 genome editing is off-target activity, which prevents the widespread use of CRISPR-Cas9 in medical applications. Among other factors, primary chromatin structure and epigenetics may influence off-target activity. METHODS: In this work, we utilize crisprSQL, an off-target database, to analyze the effect of 19 epigenetic descriptors on CRISPR-Cas9 off-target activity. Termed as 19 epigenetic features/scores, they consist of 6 experimental epigenetic and 13 computed nucleosome organization-related features. In terms of novel features, 15 of the epigenetic scores are newly considered. The 15 newly considered scores consist of 13 freshly computed nucleosome occupancy/positioning scores and 2 experimental features (MNase and DRIP). The other 4 existing scores are experimental features (CTCF, DNase I, H3K4me3, RRBS) commonly used in deep learning models for off-target activity prediction. For data curation, MNase was aggregated from existing experimental nucleosome occupancy data. Based on the sequence context information available in crisprSQL, we also computed nucleosome occupancy/positioning scores for off-target sites. RESULTS: To investigate the relationship between the 19 epigenetic features and off-target activity, we first conducted Spearman and Pearson correlation analysis. Such analysis shows that some computed scores derived from training-based models and training-free algorithms outperform all experimental epigenetic features. Next, we evaluated the contribution of all epigenetic features in two successful machine/deep learning models which predict off-target activity. We found that some computed scores, unlike all 6 experimental features, significantly contribute to the predictions of both models. As a practical research contribution, we make the off-target dataset containing all 19 epigenetic features available to the research community. CONCLUSIONS: Our comprehensive computational analysis helps the CRISPR-Cas9 community better understand the relationship between epigenetic features and CRISPR-Cas9 off-target activity. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-022-09012-7. |
format | Online Article Text |
id | pubmed-9724382 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-97243822022-12-07 Comprehensive computational analysis of epigenetic descriptors affecting CRISPR-Cas9 off-target activity Mak, Jeffrey K. Störtz, Florian Minary, Peter BMC Genomics Research BACKGROUND: A common issue in CRISPR-Cas9 genome editing is off-target activity, which prevents the widespread use of CRISPR-Cas9 in medical applications. Among other factors, primary chromatin structure and epigenetics may influence off-target activity. METHODS: In this work, we utilize crisprSQL, an off-target database, to analyze the effect of 19 epigenetic descriptors on CRISPR-Cas9 off-target activity. Termed as 19 epigenetic features/scores, they consist of 6 experimental epigenetic and 13 computed nucleosome organization-related features. In terms of novel features, 15 of the epigenetic scores are newly considered. The 15 newly considered scores consist of 13 freshly computed nucleosome occupancy/positioning scores and 2 experimental features (MNase and DRIP). The other 4 existing scores are experimental features (CTCF, DNase I, H3K4me3, RRBS) commonly used in deep learning models for off-target activity prediction. For data curation, MNase was aggregated from existing experimental nucleosome occupancy data. Based on the sequence context information available in crisprSQL, we also computed nucleosome occupancy/positioning scores for off-target sites. RESULTS: To investigate the relationship between the 19 epigenetic features and off-target activity, we first conducted Spearman and Pearson correlation analysis. Such analysis shows that some computed scores derived from training-based models and training-free algorithms outperform all experimental epigenetic features. Next, we evaluated the contribution of all epigenetic features in two successful machine/deep learning models which predict off-target activity. We found that some computed scores, unlike all 6 experimental features, significantly contribute to the predictions of both models. As a practical research contribution, we make the off-target dataset containing all 19 epigenetic features available to the research community. CONCLUSIONS: Our comprehensive computational analysis helps the CRISPR-Cas9 community better understand the relationship between epigenetic features and CRISPR-Cas9 off-target activity. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-022-09012-7. BioMed Central 2022-12-06 /pmc/articles/PMC9724382/ /pubmed/36474180 http://dx.doi.org/10.1186/s12864-022-09012-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Mak, Jeffrey K. Störtz, Florian Minary, Peter Comprehensive computational analysis of epigenetic descriptors affecting CRISPR-Cas9 off-target activity |
title | Comprehensive computational analysis of epigenetic descriptors affecting CRISPR-Cas9 off-target activity |
title_full | Comprehensive computational analysis of epigenetic descriptors affecting CRISPR-Cas9 off-target activity |
title_fullStr | Comprehensive computational analysis of epigenetic descriptors affecting CRISPR-Cas9 off-target activity |
title_full_unstemmed | Comprehensive computational analysis of epigenetic descriptors affecting CRISPR-Cas9 off-target activity |
title_short | Comprehensive computational analysis of epigenetic descriptors affecting CRISPR-Cas9 off-target activity |
title_sort | comprehensive computational analysis of epigenetic descriptors affecting crispr-cas9 off-target activity |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9724382/ https://www.ncbi.nlm.nih.gov/pubmed/36474180 http://dx.doi.org/10.1186/s12864-022-09012-7 |
work_keys_str_mv | AT makjeffreyk comprehensivecomputationalanalysisofepigeneticdescriptorsaffectingcrisprcas9offtargetactivity AT stortzflorian comprehensivecomputationalanalysisofepigeneticdescriptorsaffectingcrisprcas9offtargetactivity AT minarypeter comprehensivecomputationalanalysisofepigeneticdescriptorsaffectingcrisprcas9offtargetactivity |