Cargando…

CRISPRL and: Interpretable large-scale inference of DNA repair landscape based on a spectral approach

SUMMARY: We propose a new spectral framework for reliable training, scalable inference and interpretable explanation of the DNA repair outcome following a Cas9 cutting. Our framework, dubbed CRISPRL and, relies on an unexploited observation about the nature of the repair process: the landscape of th...

Descripción completa

Detalles Bibliográficos
Autores principales: Aghazadeh, Amirali, Ocal, Orhan, Ramchandran, Kannan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355252/
https://www.ncbi.nlm.nih.gov/pubmed/32657417
http://dx.doi.org/10.1093/bioinformatics/btaa505
_version_ 1783558237384081408
author Aghazadeh, Amirali
Ocal, Orhan
Ramchandran, Kannan
author_facet Aghazadeh, Amirali
Ocal, Orhan
Ramchandran, Kannan
author_sort Aghazadeh, Amirali
collection PubMed
description SUMMARY: We propose a new spectral framework for reliable training, scalable inference and interpretable explanation of the DNA repair outcome following a Cas9 cutting. Our framework, dubbed CRISPRL and, relies on an unexploited observation about the nature of the repair process: the landscape of the DNA repair is highly sparse in the (Walsh–Hadamard) spectral domain. This observation enables our framework to address key shortcomings that limit the interpretability and scaling of current deep-learning-based DNA repair models. In particular, CRISPRL and reduces the time to compute the full DNA repair landscape from a striking 5230 years to 1 week and the sampling complexity from [Formula: see text] to 3 million guide RNAs with only a small loss in accuracy (R(2) [Formula: see text] ∼ 0.9). Our proposed framework is based on a divide-and-conquer strategy that uses a fast peeling algorithm to learn the DNA repair models. CRISPRL and captures lower-degree features around the cut site, which enrich for short insertions and deletions as well as higher-degree microhomology patterns that enrich for longer deletions. AVAILABILITY AND IMPLEMENTATION: The CRISPRL and software is publicly available at https://github.com/UCBASiCS/CRISPRLand.
format Online
Article
Text
id pubmed-7355252
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-73552522020-07-16 CRISPRL and: Interpretable large-scale inference of DNA repair landscape based on a spectral approach Aghazadeh, Amirali Ocal, Orhan Ramchandran, Kannan Bioinformatics General Computational Biology SUMMARY: We propose a new spectral framework for reliable training, scalable inference and interpretable explanation of the DNA repair outcome following a Cas9 cutting. Our framework, dubbed CRISPRL and, relies on an unexploited observation about the nature of the repair process: the landscape of the DNA repair is highly sparse in the (Walsh–Hadamard) spectral domain. This observation enables our framework to address key shortcomings that limit the interpretability and scaling of current deep-learning-based DNA repair models. In particular, CRISPRL and reduces the time to compute the full DNA repair landscape from a striking 5230 years to 1 week and the sampling complexity from [Formula: see text] to 3 million guide RNAs with only a small loss in accuracy (R(2) [Formula: see text] ∼ 0.9). Our proposed framework is based on a divide-and-conquer strategy that uses a fast peeling algorithm to learn the DNA repair models. CRISPRL and captures lower-degree features around the cut site, which enrich for short insertions and deletions as well as higher-degree microhomology patterns that enrich for longer deletions. AVAILABILITY AND IMPLEMENTATION: The CRISPRL and software is publicly available at https://github.com/UCBASiCS/CRISPRLand. Oxford University Press 2020-07 2020-07-13 /pmc/articles/PMC7355252/ /pubmed/32657417 http://dx.doi.org/10.1093/bioinformatics/btaa505 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle General Computational Biology
Aghazadeh, Amirali
Ocal, Orhan
Ramchandran, Kannan
CRISPRL and: Interpretable large-scale inference of DNA repair landscape based on a spectral approach
title CRISPRL and: Interpretable large-scale inference of DNA repair landscape based on a spectral approach
title_full CRISPRL and: Interpretable large-scale inference of DNA repair landscape based on a spectral approach
title_fullStr CRISPRL and: Interpretable large-scale inference of DNA repair landscape based on a spectral approach
title_full_unstemmed CRISPRL and: Interpretable large-scale inference of DNA repair landscape based on a spectral approach
title_short CRISPRL and: Interpretable large-scale inference of DNA repair landscape based on a spectral approach
title_sort crisprl and: interpretable large-scale inference of dna repair landscape based on a spectral approach
topic General Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7355252/
https://www.ncbi.nlm.nih.gov/pubmed/32657417
http://dx.doi.org/10.1093/bioinformatics/btaa505
work_keys_str_mv AT aghazadehamirali crisprlandinterpretablelargescaleinferenceofdnarepairlandscapebasedonaspectralapproach
AT ocalorhan crisprlandinterpretablelargescaleinferenceofdnarepairlandscapebasedonaspectralapproach
AT ramchandrankannan crisprlandinterpretablelargescaleinferenceofdnarepairlandscapebasedonaspectralapproach