Cargando…
MetaCRAST: reference-guided extraction of CRISPR spacers from unassembled metagenomes
Clustered regularly interspaced short palindromic repeat (CRISPR) systems are the adaptive immune systems of bacteria and archaea against viral infection. While CRISPRs have been exploited as a tool for genetic engineering, their spacer sequences can also provide valuable insights into microbial eco...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5592083/ https://www.ncbi.nlm.nih.gov/pubmed/28894651 http://dx.doi.org/10.7717/peerj.3788 |
_version_ | 1783262834753273856 |
---|---|
author | Moller, Abraham G. Liang, Chun |
author_facet | Moller, Abraham G. Liang, Chun |
author_sort | Moller, Abraham G. |
collection | PubMed |
description | Clustered regularly interspaced short palindromic repeat (CRISPR) systems are the adaptive immune systems of bacteria and archaea against viral infection. While CRISPRs have been exploited as a tool for genetic engineering, their spacer sequences can also provide valuable insights into microbial ecology by linking environmental viruses to their microbial hosts. Despite this importance, metagenomic CRISPR detection remains a major challenge. Here we present a reference-guided CRISPR spacer detection tool (Metagenomic CRISPR Reference-Aided Search Tool—MetaCRAST) that constrains searches based on user-specified direct repeats (DRs). These DRs could be expected from assembly or taxonomic profiles of metagenomes. We compared the performance of MetaCRAST to those of two existing metagenomic CRISPR detection tools—Crass and MinCED—using both real and simulated acid mine drainage (AMD) and enhanced biological phosphorus removal (EBPR) metagenomes. Our evaluation shows MetaCRAST improves CRISPR spacer detection in real metagenomes compared to the de novo CRISPR detection methods Crass and MinCED. Evaluation on simulated metagenomes show it performs better than de novo tools for Illumina metagenomes and comparably for 454 metagenomes. It also has comparable performance dependence on read length and community composition, run time, and accuracy to these tools. MetaCRAST is implemented in Perl, parallelizable through the Many Core Engine (MCE), and takes metagenomic sequence reads and direct repeat queries (FASTA or FASTQ) as input. It is freely available for download at https://github.com/molleraj/MetaCRAST. |
format | Online Article Text |
id | pubmed-5592083 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-55920832017-09-11 MetaCRAST: reference-guided extraction of CRISPR spacers from unassembled metagenomes Moller, Abraham G. Liang, Chun PeerJ Bioinformatics Clustered regularly interspaced short palindromic repeat (CRISPR) systems are the adaptive immune systems of bacteria and archaea against viral infection. While CRISPRs have been exploited as a tool for genetic engineering, their spacer sequences can also provide valuable insights into microbial ecology by linking environmental viruses to their microbial hosts. Despite this importance, metagenomic CRISPR detection remains a major challenge. Here we present a reference-guided CRISPR spacer detection tool (Metagenomic CRISPR Reference-Aided Search Tool—MetaCRAST) that constrains searches based on user-specified direct repeats (DRs). These DRs could be expected from assembly or taxonomic profiles of metagenomes. We compared the performance of MetaCRAST to those of two existing metagenomic CRISPR detection tools—Crass and MinCED—using both real and simulated acid mine drainage (AMD) and enhanced biological phosphorus removal (EBPR) metagenomes. Our evaluation shows MetaCRAST improves CRISPR spacer detection in real metagenomes compared to the de novo CRISPR detection methods Crass and MinCED. Evaluation on simulated metagenomes show it performs better than de novo tools for Illumina metagenomes and comparably for 454 metagenomes. It also has comparable performance dependence on read length and community composition, run time, and accuracy to these tools. MetaCRAST is implemented in Perl, parallelizable through the Many Core Engine (MCE), and takes metagenomic sequence reads and direct repeat queries (FASTA or FASTQ) as input. It is freely available for download at https://github.com/molleraj/MetaCRAST. PeerJ Inc. 2017-09-07 /pmc/articles/PMC5592083/ /pubmed/28894651 http://dx.doi.org/10.7717/peerj.3788 Text en ©2017 Moller and Liang http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Bioinformatics Moller, Abraham G. Liang, Chun MetaCRAST: reference-guided extraction of CRISPR spacers from unassembled metagenomes |
title | MetaCRAST: reference-guided extraction of CRISPR spacers from unassembled metagenomes |
title_full | MetaCRAST: reference-guided extraction of CRISPR spacers from unassembled metagenomes |
title_fullStr | MetaCRAST: reference-guided extraction of CRISPR spacers from unassembled metagenomes |
title_full_unstemmed | MetaCRAST: reference-guided extraction of CRISPR spacers from unassembled metagenomes |
title_short | MetaCRAST: reference-guided extraction of CRISPR spacers from unassembled metagenomes |
title_sort | metacrast: reference-guided extraction of crispr spacers from unassembled metagenomes |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5592083/ https://www.ncbi.nlm.nih.gov/pubmed/28894651 http://dx.doi.org/10.7717/peerj.3788 |
work_keys_str_mv | AT mollerabrahamg metacrastreferenceguidedextractionofcrisprspacersfromunassembledmetagenomes AT liangchun metacrastreferenceguidedextractionofcrisprspacersfromunassembledmetagenomes |