Cargando…

Genome mining for anti-CRISPR operons using machine learning

MOTIVATION: Encoded by (pro-)viruses, anti-CRISPR (Acr) proteins inhibit the CRISPR-Cas immune system of their prokaryotic hosts. As a result, Acr proteins can be employed to develop more controllable CRISPR-Cas genome editing tools. Recent studies revealed that known acr genes often coexist with ot...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Bowen, Khatri, Minal, Zheng, Jinfang, Deogun, Jitender, Yin, Yanbin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10196667/
https://www.ncbi.nlm.nih.gov/pubmed/37158576
http://dx.doi.org/10.1093/bioinformatics/btad309
_version_ 1785044397099843584
author Yang, Bowen
Khatri, Minal
Zheng, Jinfang
Deogun, Jitender
Yin, Yanbin
author_facet Yang, Bowen
Khatri, Minal
Zheng, Jinfang
Deogun, Jitender
Yin, Yanbin
author_sort Yang, Bowen
collection PubMed
description MOTIVATION: Encoded by (pro-)viruses, anti-CRISPR (Acr) proteins inhibit the CRISPR-Cas immune system of their prokaryotic hosts. As a result, Acr proteins can be employed to develop more controllable CRISPR-Cas genome editing tools. Recent studies revealed that known acr genes often coexist with other acr genes and with phage structural genes within the same operon. For example, we found that 47 of 98 known acr genes (or their homologs) co-exist in the same operons. None of the current Acr prediction tools have considered this important genomic context feature. We have developed a new software tool AOminer to facilitate the improved discovery of new Acrs by fully exploiting the genomic context of known acr genes and their homologs. RESULTS: AOminer is the first machine learning based tool focused on the discovery of Acr operons (AOs). A two-state HMM (hidden Markov model) was trained to learn the conserved genomic context of operons that contain known acr genes or their homologs, and the learnt features could distinguish AOs and non-AOs. AOminer allows automated mining for potential AOs from query genomes or operons. AOminer outperformed all existing Acr prediction tools with an accuracy = 0.85. AOminer will facilitate the discovery of novel anti-CRISPR operons. AVAILABILITY AND IMPLEMENTATION: The webserver is available at: http://aca.unl.edu/AOminer/AOminer_APP/. The python program is at: https://github.com/boweny920/AOminer.
format Online
Article
Text
id pubmed-10196667
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101966672023-05-20 Genome mining for anti-CRISPR operons using machine learning Yang, Bowen Khatri, Minal Zheng, Jinfang Deogun, Jitender Yin, Yanbin Bioinformatics Applications Note MOTIVATION: Encoded by (pro-)viruses, anti-CRISPR (Acr) proteins inhibit the CRISPR-Cas immune system of their prokaryotic hosts. As a result, Acr proteins can be employed to develop more controllable CRISPR-Cas genome editing tools. Recent studies revealed that known acr genes often coexist with other acr genes and with phage structural genes within the same operon. For example, we found that 47 of 98 known acr genes (or their homologs) co-exist in the same operons. None of the current Acr prediction tools have considered this important genomic context feature. We have developed a new software tool AOminer to facilitate the improved discovery of new Acrs by fully exploiting the genomic context of known acr genes and their homologs. RESULTS: AOminer is the first machine learning based tool focused on the discovery of Acr operons (AOs). A two-state HMM (hidden Markov model) was trained to learn the conserved genomic context of operons that contain known acr genes or their homologs, and the learnt features could distinguish AOs and non-AOs. AOminer allows automated mining for potential AOs from query genomes or operons. AOminer outperformed all existing Acr prediction tools with an accuracy = 0.85. AOminer will facilitate the discovery of novel anti-CRISPR operons. AVAILABILITY AND IMPLEMENTATION: The webserver is available at: http://aca.unl.edu/AOminer/AOminer_APP/. The python program is at: https://github.com/boweny920/AOminer. Oxford University Press 2023-05-09 /pmc/articles/PMC10196667/ /pubmed/37158576 http://dx.doi.org/10.1093/bioinformatics/btad309 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Yang, Bowen
Khatri, Minal
Zheng, Jinfang
Deogun, Jitender
Yin, Yanbin
Genome mining for anti-CRISPR operons using machine learning
title Genome mining for anti-CRISPR operons using machine learning
title_full Genome mining for anti-CRISPR operons using machine learning
title_fullStr Genome mining for anti-CRISPR operons using machine learning
title_full_unstemmed Genome mining for anti-CRISPR operons using machine learning
title_short Genome mining for anti-CRISPR operons using machine learning
title_sort genome mining for anti-crispr operons using machine learning
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10196667/
https://www.ncbi.nlm.nih.gov/pubmed/37158576
http://dx.doi.org/10.1093/bioinformatics/btad309
work_keys_str_mv AT yangbowen genomeminingforanticrisproperonsusingmachinelearning
AT khatriminal genomeminingforanticrisproperonsusingmachinelearning
AT zhengjinfang genomeminingforanticrisproperonsusingmachinelearning
AT deogunjitender genomeminingforanticrisproperonsusingmachinelearning
AT yinyanbin genomeminingforanticrisproperonsusingmachinelearning