Cargando…

A Guild of 45 CRISPR-Associated (Cas) Protein Families and Multiple CRISPR/Cas Subtypes Exist in Prokaryotic Genomes

Clustered regularly interspaced short palindromic repeats (CRISPRs) are a family of DNA direct repeats found in many prokaryotic genomes. Repeats of 21–37 bp typically show weak dyad symmetry and are separated by regularly sized, nonrepetitive spacer sequences. Four CRISPR-associated (Cas) protein f...

Descripción completa

Detalles Bibliográficos
Autores principales: Haft, Daniel H, Selengut, Jeremy, Mongodin, Emmanuel F, Nelson, Karen E
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1282333/
https://www.ncbi.nlm.nih.gov/pubmed/16292354
http://dx.doi.org/10.1371/journal.pcbi.0010060
_version_ 1782126133512241152
author Haft, Daniel H
Selengut, Jeremy
Mongodin, Emmanuel F
Nelson, Karen E
author_facet Haft, Daniel H
Selengut, Jeremy
Mongodin, Emmanuel F
Nelson, Karen E
author_sort Haft, Daniel H
collection PubMed
description Clustered regularly interspaced short palindromic repeats (CRISPRs) are a family of DNA direct repeats found in many prokaryotic genomes. Repeats of 21–37 bp typically show weak dyad symmetry and are separated by regularly sized, nonrepetitive spacer sequences. Four CRISPR-associated (Cas) protein families, designated Cas1 to Cas4, are strictly associated with CRISPR elements and always occur near a repeat cluster. Some spacers originate from mobile genetic elements and are thought to confer “immunity” against the elements that harbor these sequences. In the present study, we have systematically investigated uncharacterized proteins encoded in the vicinity of these CRISPRs and found many additional protein families that are strictly associated with CRISPR loci across multiple prokaryotic species. Multiple sequence alignments and hidden Markov models have been built for 45 Cas protein families. These models identify family members with high sensitivity and selectivity and classify key regulators of development, DevR and DevS, in Myxococcus xanthus as Cas proteins. These identifications show that CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a repeat cluster or filling the region between two repeat clusters. Distinctive subsets of the collection of Cas proteins recur in phylogenetically distant species and correlate with characteristic repeat periodicity. The analyses presented here support initial proposals of mobility of these units, along with the likelihood that loci of different subtypes interact with one another as well as with host cell defensive, replicative, and regulatory systems. It is evident from this analysis that CRISPR/cas loci are larger, more complex, and more heterogeneous than previously appreciated.
format Text
id pubmed-1282333
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-12823332005-12-01 A Guild of 45 CRISPR-Associated (Cas) Protein Families and Multiple CRISPR/Cas Subtypes Exist in Prokaryotic Genomes Haft, Daniel H Selengut, Jeremy Mongodin, Emmanuel F Nelson, Karen E PLoS Comput Biol Research Article Clustered regularly interspaced short palindromic repeats (CRISPRs) are a family of DNA direct repeats found in many prokaryotic genomes. Repeats of 21–37 bp typically show weak dyad symmetry and are separated by regularly sized, nonrepetitive spacer sequences. Four CRISPR-associated (Cas) protein families, designated Cas1 to Cas4, are strictly associated with CRISPR elements and always occur near a repeat cluster. Some spacers originate from mobile genetic elements and are thought to confer “immunity” against the elements that harbor these sequences. In the present study, we have systematically investigated uncharacterized proteins encoded in the vicinity of these CRISPRs and found many additional protein families that are strictly associated with CRISPR loci across multiple prokaryotic species. Multiple sequence alignments and hidden Markov models have been built for 45 Cas protein families. These models identify family members with high sensitivity and selectivity and classify key regulators of development, DevR and DevS, in Myxococcus xanthus as Cas proteins. These identifications show that CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a repeat cluster or filling the region between two repeat clusters. Distinctive subsets of the collection of Cas proteins recur in phylogenetically distant species and correlate with characteristic repeat periodicity. The analyses presented here support initial proposals of mobility of these units, along with the likelihood that loci of different subtypes interact with one another as well as with host cell defensive, replicative, and regulatory systems. It is evident from this analysis that CRISPR/cas loci are larger, more complex, and more heterogeneous than previously appreciated. Public Library of Science 2005-11 2005-11-11 /pmc/articles/PMC1282333/ /pubmed/16292354 http://dx.doi.org/10.1371/journal.pcbi.0010060 Text en Copyright: © 2005 Haft et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Haft, Daniel H
Selengut, Jeremy
Mongodin, Emmanuel F
Nelson, Karen E
A Guild of 45 CRISPR-Associated (Cas) Protein Families and Multiple CRISPR/Cas Subtypes Exist in Prokaryotic Genomes
title A Guild of 45 CRISPR-Associated (Cas) Protein Families and Multiple CRISPR/Cas Subtypes Exist in Prokaryotic Genomes
title_full A Guild of 45 CRISPR-Associated (Cas) Protein Families and Multiple CRISPR/Cas Subtypes Exist in Prokaryotic Genomes
title_fullStr A Guild of 45 CRISPR-Associated (Cas) Protein Families and Multiple CRISPR/Cas Subtypes Exist in Prokaryotic Genomes
title_full_unstemmed A Guild of 45 CRISPR-Associated (Cas) Protein Families and Multiple CRISPR/Cas Subtypes Exist in Prokaryotic Genomes
title_short A Guild of 45 CRISPR-Associated (Cas) Protein Families and Multiple CRISPR/Cas Subtypes Exist in Prokaryotic Genomes
title_sort guild of 45 crispr-associated (cas) protein families and multiple crispr/cas subtypes exist in prokaryotic genomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1282333/
https://www.ncbi.nlm.nih.gov/pubmed/16292354
http://dx.doi.org/10.1371/journal.pcbi.0010060
work_keys_str_mv AT haftdanielh aguildof45crisprassociatedcasproteinfamiliesandmultiplecrisprcassubtypesexistinprokaryoticgenomes
AT selengutjeremy aguildof45crisprassociatedcasproteinfamiliesandmultiplecrisprcassubtypesexistinprokaryoticgenomes
AT mongodinemmanuelf aguildof45crisprassociatedcasproteinfamiliesandmultiplecrisprcassubtypesexistinprokaryoticgenomes
AT nelsonkarene aguildof45crisprassociatedcasproteinfamiliesandmultiplecrisprcassubtypesexistinprokaryoticgenomes
AT haftdanielh guildof45crisprassociatedcasproteinfamiliesandmultiplecrisprcassubtypesexistinprokaryoticgenomes
AT selengutjeremy guildof45crisprassociatedcasproteinfamiliesandmultiplecrisprcassubtypesexistinprokaryoticgenomes
AT mongodinemmanuelf guildof45crisprassociatedcasproteinfamiliesandmultiplecrisprcassubtypesexistinprokaryoticgenomes
AT nelsonkarene guildof45crisprassociatedcasproteinfamiliesandmultiplecrisprcassubtypesexistinprokaryoticgenomes