Cargando…

AcaFinder: Genome Mining for Anti-CRISPR-Associated Genes

Anti-CRISPR (Acr) proteins are encoded by (pro)viruses to inhibit their host’s CRISPR-Cas systems. Genes encoding Acr and Aca (Acr associated) proteins often colocalize to form acr-aca operons. Here, we present AcaFinder as the first Aca genome mining tool. AcaFinder can (i) predict Acas and their a...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Bowen, Zheng, Jinfang, Yin, Yanbin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9765179/
https://www.ncbi.nlm.nih.gov/pubmed/36413017
http://dx.doi.org/10.1128/msystems.00817-22
_version_ 1784853428495712256
author Yang, Bowen
Zheng, Jinfang
Yin, Yanbin
author_facet Yang, Bowen
Zheng, Jinfang
Yin, Yanbin
author_sort Yang, Bowen
collection PubMed
description Anti-CRISPR (Acr) proteins are encoded by (pro)viruses to inhibit their host’s CRISPR-Cas systems. Genes encoding Acr and Aca (Acr associated) proteins often colocalize to form acr-aca operons. Here, we present AcaFinder as the first Aca genome mining tool. AcaFinder can (i) predict Acas and their associated acr-aca operons using guilt-by-association (GBA); (ii) identify homologs of known Acas using an HMM (Hidden Markov model) database; (iii) take input genomes for potential prophages, CRISPR-Cas systems, and self-targeting spacers (STSs); and (iv) provide a standalone program (https://github.com/boweny920/AcaFinder) and a web server (http://aca.unl.edu/Aca). AcaFinder was applied to mining over 16,000 prokaryotic and 142,000 gut phage genomes. After a multistep filtering, 36 high-confident new Aca families were identified, which is three times that of the 12 known Aca families. Seven new Aca families were from major human gut bacteria (Bacteroidota, Actinobacteria, and Fusobacteria) and their phages, while most known Aca families were from Proteobacteria and Firmicutes. A complex association network between Acrs and Acas was revealed by analyzing their operonic colocalizations. It appears very common in evolution that the same aca genes can recombine with different acr genes and vice versa to form diverse acr-aca operon combinations. IMPORTANCE At least four bioinformatics programs have been published for genome mining of Acrs since 2020. In contrast, no bioinformatics tools are available for automated Aca discovery. As the self-transcriptional repressor of acr-aca operons, Aca can be viewed as anti-anti-CRISPRs, with great potential in the improvement of CRISPR-Cas technology. Although all the 12 known Aca proteins contain a conserved helix-turn-helix (HTH) domain, not all HTH-containing proteins are Acas. However, HTH-containing proteins with adjacent Acr homologs encoded in the same genetic operon are likely Aca proteins. AcaFinder implements this guilt-by-association idea and the idea of using HMMs of known Acas for homologs into one software package. Applying AcaFinder in screening prokaryotic and gut phage genomes reveals a complex acr-aca operonic colocalization network between different families of Acrs and Acas.
format Online
Article
Text
id pubmed-9765179
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-97651792022-12-21 AcaFinder: Genome Mining for Anti-CRISPR-Associated Genes Yang, Bowen Zheng, Jinfang Yin, Yanbin mSystems Research Article Anti-CRISPR (Acr) proteins are encoded by (pro)viruses to inhibit their host’s CRISPR-Cas systems. Genes encoding Acr and Aca (Acr associated) proteins often colocalize to form acr-aca operons. Here, we present AcaFinder as the first Aca genome mining tool. AcaFinder can (i) predict Acas and their associated acr-aca operons using guilt-by-association (GBA); (ii) identify homologs of known Acas using an HMM (Hidden Markov model) database; (iii) take input genomes for potential prophages, CRISPR-Cas systems, and self-targeting spacers (STSs); and (iv) provide a standalone program (https://github.com/boweny920/AcaFinder) and a web server (http://aca.unl.edu/Aca). AcaFinder was applied to mining over 16,000 prokaryotic and 142,000 gut phage genomes. After a multistep filtering, 36 high-confident new Aca families were identified, which is three times that of the 12 known Aca families. Seven new Aca families were from major human gut bacteria (Bacteroidota, Actinobacteria, and Fusobacteria) and their phages, while most known Aca families were from Proteobacteria and Firmicutes. A complex association network between Acrs and Acas was revealed by analyzing their operonic colocalizations. It appears very common in evolution that the same aca genes can recombine with different acr genes and vice versa to form diverse acr-aca operon combinations. IMPORTANCE At least four bioinformatics programs have been published for genome mining of Acrs since 2020. In contrast, no bioinformatics tools are available for automated Aca discovery. As the self-transcriptional repressor of acr-aca operons, Aca can be viewed as anti-anti-CRISPRs, with great potential in the improvement of CRISPR-Cas technology. Although all the 12 known Aca proteins contain a conserved helix-turn-helix (HTH) domain, not all HTH-containing proteins are Acas. However, HTH-containing proteins with adjacent Acr homologs encoded in the same genetic operon are likely Aca proteins. AcaFinder implements this guilt-by-association idea and the idea of using HMMs of known Acas for homologs into one software package. Applying AcaFinder in screening prokaryotic and gut phage genomes reveals a complex acr-aca operonic colocalization network between different families of Acrs and Acas. American Society for Microbiology 2022-11-22 /pmc/articles/PMC9765179/ /pubmed/36413017 http://dx.doi.org/10.1128/msystems.00817-22 Text en Copyright © 2022 Yang et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
Yang, Bowen
Zheng, Jinfang
Yin, Yanbin
AcaFinder: Genome Mining for Anti-CRISPR-Associated Genes
title AcaFinder: Genome Mining for Anti-CRISPR-Associated Genes
title_full AcaFinder: Genome Mining for Anti-CRISPR-Associated Genes
title_fullStr AcaFinder: Genome Mining for Anti-CRISPR-Associated Genes
title_full_unstemmed AcaFinder: Genome Mining for Anti-CRISPR-Associated Genes
title_short AcaFinder: Genome Mining for Anti-CRISPR-Associated Genes
title_sort acafinder: genome mining for anti-crispr-associated genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9765179/
https://www.ncbi.nlm.nih.gov/pubmed/36413017
http://dx.doi.org/10.1128/msystems.00817-22
work_keys_str_mv AT yangbowen acafindergenomeminingforanticrisprassociatedgenes
AT zhengjinfang acafindergenomeminingforanticrisprassociatedgenes
AT yinyanbin acafindergenomeminingforanticrisprassociatedgenes