Cargando…

CRISPRDetect: A flexible algorithm to define CRISPR arrays

BACKGROUND: CRISPR (clustered regularly interspaced short palindromic repeats) RNAs provide the specificity for noncoding RNA-guided adaptive immune defence systems in prokaryotes. CRISPR arrays consist of repeat sequences separated by specific spacer sequences. CRISPR arrays have previously been id...

Descripción completa

Detalles Bibliográficos
Autores principales: Biswas, Ambarish, Staals, Raymond H.J., Morales, Sergio E., Fineran, Peter C., Brown, Chris M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4869251/
https://www.ncbi.nlm.nih.gov/pubmed/27184979
http://dx.doi.org/10.1186/s12864-016-2627-0
_version_ 1782432283345551360
author Biswas, Ambarish
Staals, Raymond H.J.
Morales, Sergio E.
Fineran, Peter C.
Brown, Chris M.
author_facet Biswas, Ambarish
Staals, Raymond H.J.
Morales, Sergio E.
Fineran, Peter C.
Brown, Chris M.
author_sort Biswas, Ambarish
collection PubMed
description BACKGROUND: CRISPR (clustered regularly interspaced short palindromic repeats) RNAs provide the specificity for noncoding RNA-guided adaptive immune defence systems in prokaryotes. CRISPR arrays consist of repeat sequences separated by specific spacer sequences. CRISPR arrays have previously been identified in a large proportion of prokaryotic genomes. However, currently available detection algorithms do not utilise recently discovered features regarding CRISPR loci. RESULTS: We have developed a new approach to automatically detect, predict and interactively refine CRISPR arrays. It is available as a web program and command line from bioanalysis.otago.ac.nz/CRISPRDetect. CRISPRDetect discovers putative arrays, extends the array by detecting additional variant repeats, corrects the direction of arrays, refines the repeat/spacer boundaries, and annotates different types of sequence variations (e.g. insertion/deletion) in near identical repeats. Due to these features, CRISPRDetect has significant advantages when compared to existing identification tools. As well as further support for small medium and large repeats, CRISPRDetect identified a class of arrays with ‘extra-large’ repeats in bacteria (repeats 44–50 nt). The CRISPRDetect output is integrated with other analysis tools. Notably, the predicted spacers can be directly utilised by CRISPRTarget to predict targets. CONCLUSION: CRISPRDetect enables more accurate detection of arrays and spacers and its gff output is suitable for inclusion in genome annotation pipelines and visualisation. It has been used to analyse all complete bacterial and archaeal reference genomes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2627-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4869251
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-48692512016-05-18 CRISPRDetect: A flexible algorithm to define CRISPR arrays Biswas, Ambarish Staals, Raymond H.J. Morales, Sergio E. Fineran, Peter C. Brown, Chris M. BMC Genomics Software BACKGROUND: CRISPR (clustered regularly interspaced short palindromic repeats) RNAs provide the specificity for noncoding RNA-guided adaptive immune defence systems in prokaryotes. CRISPR arrays consist of repeat sequences separated by specific spacer sequences. CRISPR arrays have previously been identified in a large proportion of prokaryotic genomes. However, currently available detection algorithms do not utilise recently discovered features regarding CRISPR loci. RESULTS: We have developed a new approach to automatically detect, predict and interactively refine CRISPR arrays. It is available as a web program and command line from bioanalysis.otago.ac.nz/CRISPRDetect. CRISPRDetect discovers putative arrays, extends the array by detecting additional variant repeats, corrects the direction of arrays, refines the repeat/spacer boundaries, and annotates different types of sequence variations (e.g. insertion/deletion) in near identical repeats. Due to these features, CRISPRDetect has significant advantages when compared to existing identification tools. As well as further support for small medium and large repeats, CRISPRDetect identified a class of arrays with ‘extra-large’ repeats in bacteria (repeats 44–50 nt). The CRISPRDetect output is integrated with other analysis tools. Notably, the predicted spacers can be directly utilised by CRISPRTarget to predict targets. CONCLUSION: CRISPRDetect enables more accurate detection of arrays and spacers and its gff output is suitable for inclusion in genome annotation pipelines and visualisation. It has been used to analyse all complete bacterial and archaeal reference genomes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2627-0) contains supplementary material, which is available to authorized users. BioMed Central 2016-05-17 /pmc/articles/PMC4869251/ /pubmed/27184979 http://dx.doi.org/10.1186/s12864-016-2627-0 Text en © Biswas et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Biswas, Ambarish
Staals, Raymond H.J.
Morales, Sergio E.
Fineran, Peter C.
Brown, Chris M.
CRISPRDetect: A flexible algorithm to define CRISPR arrays
title CRISPRDetect: A flexible algorithm to define CRISPR arrays
title_full CRISPRDetect: A flexible algorithm to define CRISPR arrays
title_fullStr CRISPRDetect: A flexible algorithm to define CRISPR arrays
title_full_unstemmed CRISPRDetect: A flexible algorithm to define CRISPR arrays
title_short CRISPRDetect: A flexible algorithm to define CRISPR arrays
title_sort crisprdetect: a flexible algorithm to define crispr arrays
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4869251/
https://www.ncbi.nlm.nih.gov/pubmed/27184979
http://dx.doi.org/10.1186/s12864-016-2627-0
work_keys_str_mv AT biswasambarish crisprdetectaflexiblealgorithmtodefinecrisprarrays
AT staalsraymondhj crisprdetectaflexiblealgorithmtodefinecrisprarrays
AT moralessergioe crisprdetectaflexiblealgorithmtodefinecrisprarrays
AT fineranpeterc crisprdetectaflexiblealgorithmtodefinecrisprarrays
AT brownchrism crisprdetectaflexiblealgorithmtodefinecrisprarrays