Cargando…

Machine learning for design of degenerate Cas13a crRNAs using lassa virus as a model of highly variable RNA target

The design of minimum CRISPR RNA (crRNA) sets for detection of diverse RNA targets using sequence degeneracy has not been systematically addressed. We tested candidate degenerate Cas13a crRNA sets designed for detection of diverse RNA targets (Lassa virus). A decision tree machine learning (ML) algo...

Descripción completa

Detalles Bibliográficos
Autores principales: Leski, T. A., Spangler, J. R., Wang, Z., Schultzhaus, Z., Taitt, C. R., Dean, S. N., Stenger, D. A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10119381/
https://www.ncbi.nlm.nih.gov/pubmed/37081092
http://dx.doi.org/10.1038/s41598-023-33494-4
_version_ 1785029008833904640
author Leski, T. A.
Spangler, J. R.
Wang, Z.
Schultzhaus, Z.
Taitt, C. R.
Dean, S. N.
Stenger, D. A.
author_facet Leski, T. A.
Spangler, J. R.
Wang, Z.
Schultzhaus, Z.
Taitt, C. R.
Dean, S. N.
Stenger, D. A.
author_sort Leski, T. A.
collection PubMed
description The design of minimum CRISPR RNA (crRNA) sets for detection of diverse RNA targets using sequence degeneracy has not been systematically addressed. We tested candidate degenerate Cas13a crRNA sets designed for detection of diverse RNA targets (Lassa virus). A decision tree machine learning (ML) algorithm (RuleFit) was applied to define the top attributes that determine the specificity of degenerate crRNAs to elicit collateral nuclease activity. Although the total number of mismatches (0–4) is important, the specificity depends as well on the spacing of mismatches, and their proximity to the 5’ end of the spacer. We developed a predictive algorithm for design of candidate degenerate crRNA sets, allowing improved discrimination between “included” and “excluded” groups of related target sequences. A single degenerate crRNA set adhering to these rules detected representatives of all Lassa lineages. Our general ML approach may be applied to the design of degenerate crRNA sets for any CRISPR/Cas system.
format Online
Article
Text
id pubmed-10119381
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-101193812023-04-22 Machine learning for design of degenerate Cas13a crRNAs using lassa virus as a model of highly variable RNA target Leski, T. A. Spangler, J. R. Wang, Z. Schultzhaus, Z. Taitt, C. R. Dean, S. N. Stenger, D. A. Sci Rep Article The design of minimum CRISPR RNA (crRNA) sets for detection of diverse RNA targets using sequence degeneracy has not been systematically addressed. We tested candidate degenerate Cas13a crRNA sets designed for detection of diverse RNA targets (Lassa virus). A decision tree machine learning (ML) algorithm (RuleFit) was applied to define the top attributes that determine the specificity of degenerate crRNAs to elicit collateral nuclease activity. Although the total number of mismatches (0–4) is important, the specificity depends as well on the spacing of mismatches, and their proximity to the 5’ end of the spacer. We developed a predictive algorithm for design of candidate degenerate crRNA sets, allowing improved discrimination between “included” and “excluded” groups of related target sequences. A single degenerate crRNA set adhering to these rules detected representatives of all Lassa lineages. Our general ML approach may be applied to the design of degenerate crRNA sets for any CRISPR/Cas system. Nature Publishing Group UK 2023-04-20 /pmc/articles/PMC10119381/ /pubmed/37081092 http://dx.doi.org/10.1038/s41598-023-33494-4 Text en © This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Leski, T. A.
Spangler, J. R.
Wang, Z.
Schultzhaus, Z.
Taitt, C. R.
Dean, S. N.
Stenger, D. A.
Machine learning for design of degenerate Cas13a crRNAs using lassa virus as a model of highly variable RNA target
title Machine learning for design of degenerate Cas13a crRNAs using lassa virus as a model of highly variable RNA target
title_full Machine learning for design of degenerate Cas13a crRNAs using lassa virus as a model of highly variable RNA target
title_fullStr Machine learning for design of degenerate Cas13a crRNAs using lassa virus as a model of highly variable RNA target
title_full_unstemmed Machine learning for design of degenerate Cas13a crRNAs using lassa virus as a model of highly variable RNA target
title_short Machine learning for design of degenerate Cas13a crRNAs using lassa virus as a model of highly variable RNA target
title_sort machine learning for design of degenerate cas13a crrnas using lassa virus as a model of highly variable rna target
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10119381/
https://www.ncbi.nlm.nih.gov/pubmed/37081092
http://dx.doi.org/10.1038/s41598-023-33494-4
work_keys_str_mv AT leskita machinelearningfordesignofdegeneratecas13acrrnasusinglassavirusasamodelofhighlyvariablernatarget
AT spanglerjr machinelearningfordesignofdegeneratecas13acrrnasusinglassavirusasamodelofhighlyvariablernatarget
AT wangz machinelearningfordesignofdegeneratecas13acrrnasusinglassavirusasamodelofhighlyvariablernatarget
AT schultzhausz machinelearningfordesignofdegeneratecas13acrrnasusinglassavirusasamodelofhighlyvariablernatarget
AT taittcr machinelearningfordesignofdegeneratecas13acrrnasusinglassavirusasamodelofhighlyvariablernatarget
AT deansn machinelearningfordesignofdegeneratecas13acrrnasusinglassavirusasamodelofhighlyvariablernatarget
AT stengerda machinelearningfordesignofdegeneratecas13acrrnasusinglassavirusasamodelofhighlyvariablernatarget