Cargando…

WGSQuikr: Fast Whole-Genome Shotgun Metagenomic Classification

With the decrease in cost and increase in output of whole-genome shotgun technologies, many metagenomic studies are utilizing this approach in lieu of the more traditional 16S rRNA amplicon technique. Due to the large number of relatively short reads output from whole-genome shotgun technologies, th...

Descripción completa

Detalles Bibliográficos
Autores principales: Koslicki, David, Foucart, Simon, Rosen, Gail
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3953531/
https://www.ncbi.nlm.nih.gov/pubmed/24626336
http://dx.doi.org/10.1371/journal.pone.0091784
_version_ 1782307373329678336
author Koslicki, David
Foucart, Simon
Rosen, Gail
author_facet Koslicki, David
Foucart, Simon
Rosen, Gail
author_sort Koslicki, David
collection PubMed
description With the decrease in cost and increase in output of whole-genome shotgun technologies, many metagenomic studies are utilizing this approach in lieu of the more traditional 16S rRNA amplicon technique. Due to the large number of relatively short reads output from whole-genome shotgun technologies, there is a need for fast and accurate short-read OTU classifiers. While there are relatively fast and accurate algorithms available, such as MetaPhlAn, MetaPhyler, PhyloPythiaS, and PhymmBL, these algorithms still classify samples in a read-by-read fashion and so execution times can range from hours to days on large datasets. We introduce WGSQuikr, a reconstruction method which can compute a vector of taxonomic assignments and their proportions in the sample with remarkable speed and accuracy. We demonstrate on simulated data that WGSQuikr is typically more accurate and up to an order of magnitude faster than the aforementioned classification algorithms. We also verify the utility of WGSQuikr on real biological data in the form of a mock community. WGSQuikr is a Whole-Genome Shotgun QUadratic, Iterative, [Image: see text]-mer based Reconstruction method which extends the previously introduced 16S rRNA-based algorithm Quikr. A MATLAB implementation of WGSQuikr is available at: http://sourceforge.net/projects/wgsquikr.
format Online
Article
Text
id pubmed-3953531
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39535312014-03-18 WGSQuikr: Fast Whole-Genome Shotgun Metagenomic Classification Koslicki, David Foucart, Simon Rosen, Gail PLoS One Research Article With the decrease in cost and increase in output of whole-genome shotgun technologies, many metagenomic studies are utilizing this approach in lieu of the more traditional 16S rRNA amplicon technique. Due to the large number of relatively short reads output from whole-genome shotgun technologies, there is a need for fast and accurate short-read OTU classifiers. While there are relatively fast and accurate algorithms available, such as MetaPhlAn, MetaPhyler, PhyloPythiaS, and PhymmBL, these algorithms still classify samples in a read-by-read fashion and so execution times can range from hours to days on large datasets. We introduce WGSQuikr, a reconstruction method which can compute a vector of taxonomic assignments and their proportions in the sample with remarkable speed and accuracy. We demonstrate on simulated data that WGSQuikr is typically more accurate and up to an order of magnitude faster than the aforementioned classification algorithms. We also verify the utility of WGSQuikr on real biological data in the form of a mock community. WGSQuikr is a Whole-Genome Shotgun QUadratic, Iterative, [Image: see text]-mer based Reconstruction method which extends the previously introduced 16S rRNA-based algorithm Quikr. A MATLAB implementation of WGSQuikr is available at: http://sourceforge.net/projects/wgsquikr. Public Library of Science 2014-03-13 /pmc/articles/PMC3953531/ /pubmed/24626336 http://dx.doi.org/10.1371/journal.pone.0091784 Text en © 2014 Koslicki et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Koslicki, David
Foucart, Simon
Rosen, Gail
WGSQuikr: Fast Whole-Genome Shotgun Metagenomic Classification
title WGSQuikr: Fast Whole-Genome Shotgun Metagenomic Classification
title_full WGSQuikr: Fast Whole-Genome Shotgun Metagenomic Classification
title_fullStr WGSQuikr: Fast Whole-Genome Shotgun Metagenomic Classification
title_full_unstemmed WGSQuikr: Fast Whole-Genome Shotgun Metagenomic Classification
title_short WGSQuikr: Fast Whole-Genome Shotgun Metagenomic Classification
title_sort wgsquikr: fast whole-genome shotgun metagenomic classification
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3953531/
https://www.ncbi.nlm.nih.gov/pubmed/24626336
http://dx.doi.org/10.1371/journal.pone.0091784
work_keys_str_mv AT koslickidavid wgsquikrfastwholegenomeshotgunmetagenomicclassification
AT foucartsimon wgsquikrfastwholegenomeshotgunmetagenomicclassification
AT rosengail wgsquikrfastwholegenomeshotgunmetagenomicclassification