Cargando…

MetaCRAST: reference-guided extraction of CRISPR spacers from unassembled metagenomes

Clustered regularly interspaced short palindromic repeat (CRISPR) systems are the adaptive immune systems of bacteria and archaea against viral infection. While CRISPRs have been exploited as a tool for genetic engineering, their spacer sequences can also provide valuable insights into microbial eco...

Descripción completa

Detalles Bibliográficos
Autores principales: Moller, Abraham G., Liang, Chun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5592083/
https://www.ncbi.nlm.nih.gov/pubmed/28894651
http://dx.doi.org/10.7717/peerj.3788
_version_ 1783262834753273856
author Moller, Abraham G.
Liang, Chun
author_facet Moller, Abraham G.
Liang, Chun
author_sort Moller, Abraham G.
collection PubMed
description Clustered regularly interspaced short palindromic repeat (CRISPR) systems are the adaptive immune systems of bacteria and archaea against viral infection. While CRISPRs have been exploited as a tool for genetic engineering, their spacer sequences can also provide valuable insights into microbial ecology by linking environmental viruses to their microbial hosts. Despite this importance, metagenomic CRISPR detection remains a major challenge. Here we present a reference-guided CRISPR spacer detection tool (Metagenomic CRISPR Reference-Aided Search Tool—MetaCRAST) that constrains searches based on user-specified direct repeats (DRs). These DRs could be expected from assembly or taxonomic profiles of metagenomes. We compared the performance of MetaCRAST to those of two existing metagenomic CRISPR detection tools—Crass and MinCED—using both real and simulated acid mine drainage (AMD) and enhanced biological phosphorus removal (EBPR) metagenomes. Our evaluation shows MetaCRAST improves CRISPR spacer detection in real metagenomes compared to the de novo CRISPR detection methods Crass and MinCED. Evaluation on simulated metagenomes show it performs better than de novo tools for Illumina metagenomes and comparably for 454 metagenomes. It also has comparable performance dependence on read length and community composition, run time, and accuracy to these tools. MetaCRAST is implemented in Perl, parallelizable through the Many Core Engine (MCE), and takes metagenomic sequence reads and direct repeat queries (FASTA or FASTQ) as input. It is freely available for download at https://github.com/molleraj/MetaCRAST.
format Online
Article
Text
id pubmed-5592083
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-55920832017-09-11 MetaCRAST: reference-guided extraction of CRISPR spacers from unassembled metagenomes Moller, Abraham G. Liang, Chun PeerJ Bioinformatics Clustered regularly interspaced short palindromic repeat (CRISPR) systems are the adaptive immune systems of bacteria and archaea against viral infection. While CRISPRs have been exploited as a tool for genetic engineering, their spacer sequences can also provide valuable insights into microbial ecology by linking environmental viruses to their microbial hosts. Despite this importance, metagenomic CRISPR detection remains a major challenge. Here we present a reference-guided CRISPR spacer detection tool (Metagenomic CRISPR Reference-Aided Search Tool—MetaCRAST) that constrains searches based on user-specified direct repeats (DRs). These DRs could be expected from assembly or taxonomic profiles of metagenomes. We compared the performance of MetaCRAST to those of two existing metagenomic CRISPR detection tools—Crass and MinCED—using both real and simulated acid mine drainage (AMD) and enhanced biological phosphorus removal (EBPR) metagenomes. Our evaluation shows MetaCRAST improves CRISPR spacer detection in real metagenomes compared to the de novo CRISPR detection methods Crass and MinCED. Evaluation on simulated metagenomes show it performs better than de novo tools for Illumina metagenomes and comparably for 454 metagenomes. It also has comparable performance dependence on read length and community composition, run time, and accuracy to these tools. MetaCRAST is implemented in Perl, parallelizable through the Many Core Engine (MCE), and takes metagenomic sequence reads and direct repeat queries (FASTA or FASTQ) as input. It is freely available for download at https://github.com/molleraj/MetaCRAST. PeerJ Inc. 2017-09-07 /pmc/articles/PMC5592083/ /pubmed/28894651 http://dx.doi.org/10.7717/peerj.3788 Text en ©2017 Moller and Liang http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Moller, Abraham G.
Liang, Chun
MetaCRAST: reference-guided extraction of CRISPR spacers from unassembled metagenomes
title MetaCRAST: reference-guided extraction of CRISPR spacers from unassembled metagenomes
title_full MetaCRAST: reference-guided extraction of CRISPR spacers from unassembled metagenomes
title_fullStr MetaCRAST: reference-guided extraction of CRISPR spacers from unassembled metagenomes
title_full_unstemmed MetaCRAST: reference-guided extraction of CRISPR spacers from unassembled metagenomes
title_short MetaCRAST: reference-guided extraction of CRISPR spacers from unassembled metagenomes
title_sort metacrast: reference-guided extraction of crispr spacers from unassembled metagenomes
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5592083/
https://www.ncbi.nlm.nih.gov/pubmed/28894651
http://dx.doi.org/10.7717/peerj.3788
work_keys_str_mv AT mollerabrahamg metacrastreferenceguidedextractionofcrisprspacersfromunassembledmetagenomes
AT liangchun metacrastreferenceguidedextractionofcrisprspacersfromunassembledmetagenomes