Cargando…
On the global CRISPR array behavior in class I systems
BACKGROUND: Much effort is underway to build and upgrade databases and tools related to occurrence, diversity, and characterization of CRISPR-Cas systems. As microbial communities and their genome complements are unearthed, much emphasis has been placed on details of individual strains and model sys...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5575924/ https://www.ncbi.nlm.nih.gov/pubmed/28851439 http://dx.doi.org/10.1186/s13062-017-0193-2 |
_version_ | 1783260127024906240 |
---|---|
author | Toms, Alice Barrangou, Rodolphe |
author_facet | Toms, Alice Barrangou, Rodolphe |
author_sort | Toms, Alice |
collection | PubMed |
description | BACKGROUND: Much effort is underway to build and upgrade databases and tools related to occurrence, diversity, and characterization of CRISPR-Cas systems. As microbial communities and their genome complements are unearthed, much emphasis has been placed on details of individual strains and model systems within the CRISPR-Cas classification, and that collection of information as a whole affords the opportunity to analyze CRISPR-Cas systems from a quantitative perspective to gain insight into distribution of CRISPR array sizes across the different classes, types and subtypes. CRISPR diversity, nomenclature, occurrence, and biological functions have generated a plethora of data that created a need to understand the size and distribution of these various systems to appreciate their features and complexity. RESULTS: By utilizing a statistical framework and visual analytic techniques, we have been able to test several hypotheses about CRISPR loci in bacterial class I systems. Quantitatively, though CRISPR loci can expand to hundreds of spacers, the mean and median sizes are 40 and 25, respectively, reflecting rather modest acquisition and/or retention overall. Histograms uncovered that CRISPR array size displayed a parametric distribution, which was confirmed by a goodness-of fit test. Mapping the frequency of CRISPR loci on a standardized chromosome plot revealed that CRISPRs have a higher probability of occurring at clustered locations along the positive or negative strand. Lastly, when multiple arrays occur in a particular system, the size of a particular CRISPR array varies with its distance from the cas operon, reflecting acquisition and expansion biases. CONCLUSIONS: This study establishes that bacterial Class I CRISPR array size tends to follow a geometric distribution; these CRISPRs are not randomly distributed along the chromosome; and the CRISPR array closest to the cas genes is typically larger than loci in trans. Overall, we provide an analytical framework to understand the features and behavior of CRISPR-Cas systems through a quantitative lens. REVIEWERS: This article was reviewed by Eugene Koonin (NIH-NCBI) and Uri Gophna (Tel Aviv University). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13062-017-0193-2) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5575924 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-55759242017-08-30 On the global CRISPR array behavior in class I systems Toms, Alice Barrangou, Rodolphe Biol Direct Research BACKGROUND: Much effort is underway to build and upgrade databases and tools related to occurrence, diversity, and characterization of CRISPR-Cas systems. As microbial communities and their genome complements are unearthed, much emphasis has been placed on details of individual strains and model systems within the CRISPR-Cas classification, and that collection of information as a whole affords the opportunity to analyze CRISPR-Cas systems from a quantitative perspective to gain insight into distribution of CRISPR array sizes across the different classes, types and subtypes. CRISPR diversity, nomenclature, occurrence, and biological functions have generated a plethora of data that created a need to understand the size and distribution of these various systems to appreciate their features and complexity. RESULTS: By utilizing a statistical framework and visual analytic techniques, we have been able to test several hypotheses about CRISPR loci in bacterial class I systems. Quantitatively, though CRISPR loci can expand to hundreds of spacers, the mean and median sizes are 40 and 25, respectively, reflecting rather modest acquisition and/or retention overall. Histograms uncovered that CRISPR array size displayed a parametric distribution, which was confirmed by a goodness-of fit test. Mapping the frequency of CRISPR loci on a standardized chromosome plot revealed that CRISPRs have a higher probability of occurring at clustered locations along the positive or negative strand. Lastly, when multiple arrays occur in a particular system, the size of a particular CRISPR array varies with its distance from the cas operon, reflecting acquisition and expansion biases. CONCLUSIONS: This study establishes that bacterial Class I CRISPR array size tends to follow a geometric distribution; these CRISPRs are not randomly distributed along the chromosome; and the CRISPR array closest to the cas genes is typically larger than loci in trans. Overall, we provide an analytical framework to understand the features and behavior of CRISPR-Cas systems through a quantitative lens. REVIEWERS: This article was reviewed by Eugene Koonin (NIH-NCBI) and Uri Gophna (Tel Aviv University). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13062-017-0193-2) contains supplementary material, which is available to authorized users. BioMed Central 2017-08-29 /pmc/articles/PMC5575924/ /pubmed/28851439 http://dx.doi.org/10.1186/s13062-017-0193-2 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Toms, Alice Barrangou, Rodolphe On the global CRISPR array behavior in class I systems |
title | On the global CRISPR array behavior in class I systems |
title_full | On the global CRISPR array behavior in class I systems |
title_fullStr | On the global CRISPR array behavior in class I systems |
title_full_unstemmed | On the global CRISPR array behavior in class I systems |
title_short | On the global CRISPR array behavior in class I systems |
title_sort | on the global crispr array behavior in class i systems |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5575924/ https://www.ncbi.nlm.nih.gov/pubmed/28851439 http://dx.doi.org/10.1186/s13062-017-0193-2 |
work_keys_str_mv | AT tomsalice ontheglobalcrisprarraybehaviorinclassisystems AT barrangourodolphe ontheglobalcrisprarraybehaviorinclassisystems |