Cargando…

Identification and quantitation of clinically relevant microbes in patient samples: Comparison of three k-mer based classifiers for speed, accuracy, and sensitivity

Infections are a serious health concern worldwide, particularly in vulnerable populations such as the immunocompromised, elderly, and young. Advances in metagenomic sequencing availability, speed, and decreased cost offer the opportunity to supplement or even replace culture-based identification of...

Descripción completa

Detalles Bibliográficos
Autores principales: Watts, George S., Thornton, James E., Youens-Clark, Ken, Ponsero, Alise J., Slepian, Marvin J., Menashi, Emmanuel, Hu, Charles, Deng, Wuquan, Armstrong, David G., Reed, Spenser, Cranmer, Lee D., Hurwitz, Bonnie L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6897419/
https://www.ncbi.nlm.nih.gov/pubmed/31756192
http://dx.doi.org/10.1371/journal.pcbi.1006863
_version_ 1783476965480595456
author Watts, George S.
Thornton, James E.
Youens-Clark, Ken
Ponsero, Alise J.
Slepian, Marvin J.
Menashi, Emmanuel
Hu, Charles
Deng, Wuquan
Armstrong, David G.
Reed, Spenser
Cranmer, Lee D.
Hurwitz, Bonnie L.
author_facet Watts, George S.
Thornton, James E.
Youens-Clark, Ken
Ponsero, Alise J.
Slepian, Marvin J.
Menashi, Emmanuel
Hu, Charles
Deng, Wuquan
Armstrong, David G.
Reed, Spenser
Cranmer, Lee D.
Hurwitz, Bonnie L.
author_sort Watts, George S.
collection PubMed
description Infections are a serious health concern worldwide, particularly in vulnerable populations such as the immunocompromised, elderly, and young. Advances in metagenomic sequencing availability, speed, and decreased cost offer the opportunity to supplement or even replace culture-based identification of pathogens with DNA sequence-based diagnostics. Adopting metagenomic analysis for clinical use requires that all aspects of the workflow are optimized and tested, including data analysis and computational time and resources. We tested the accuracy, sensitivity, and resource requirements of three top metagenomic taxonomic classifiers that use fast k-mer based algorithms: Centrifuge, CLARK, and KrakenUniq. Binary mixtures of bacteria showed all three reliably identified organisms down to 1% relative abundance, while only the relative abundance estimates of Centrifuge and CLARK were accurate. All three classifiers identified the organisms present in their default databases from a mock bacterial community of 20 organisms, but only Centrifuge had no false positives. In addition, Centrifuge required far less computational resources and time for analysis. Centrifuge analysis of metagenomes obtained from samples of VAP, infected DFUs, and FN showed Centrifuge identified pathogenic bacteria and one virus that were corroborated by culture or a clinical PCR assay. Importantly, in both diabetic foot ulcer patients, metagenomic sequencing identified pathogens 4–6 weeks before culture. Finally, we show that Centrifuge results were minimally affected by elimination of time-consuming read quality control and host screening steps.
format Online
Article
Text
id pubmed-6897419
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-68974192019-12-13 Identification and quantitation of clinically relevant microbes in patient samples: Comparison of three k-mer based classifiers for speed, accuracy, and sensitivity Watts, George S. Thornton, James E. Youens-Clark, Ken Ponsero, Alise J. Slepian, Marvin J. Menashi, Emmanuel Hu, Charles Deng, Wuquan Armstrong, David G. Reed, Spenser Cranmer, Lee D. Hurwitz, Bonnie L. PLoS Comput Biol Research Article Infections are a serious health concern worldwide, particularly in vulnerable populations such as the immunocompromised, elderly, and young. Advances in metagenomic sequencing availability, speed, and decreased cost offer the opportunity to supplement or even replace culture-based identification of pathogens with DNA sequence-based diagnostics. Adopting metagenomic analysis for clinical use requires that all aspects of the workflow are optimized and tested, including data analysis and computational time and resources. We tested the accuracy, sensitivity, and resource requirements of three top metagenomic taxonomic classifiers that use fast k-mer based algorithms: Centrifuge, CLARK, and KrakenUniq. Binary mixtures of bacteria showed all three reliably identified organisms down to 1% relative abundance, while only the relative abundance estimates of Centrifuge and CLARK were accurate. All three classifiers identified the organisms present in their default databases from a mock bacterial community of 20 organisms, but only Centrifuge had no false positives. In addition, Centrifuge required far less computational resources and time for analysis. Centrifuge analysis of metagenomes obtained from samples of VAP, infected DFUs, and FN showed Centrifuge identified pathogenic bacteria and one virus that were corroborated by culture or a clinical PCR assay. Importantly, in both diabetic foot ulcer patients, metagenomic sequencing identified pathogens 4–6 weeks before culture. Finally, we show that Centrifuge results were minimally affected by elimination of time-consuming read quality control and host screening steps. Public Library of Science 2019-11-22 /pmc/articles/PMC6897419/ /pubmed/31756192 http://dx.doi.org/10.1371/journal.pcbi.1006863 Text en © 2019 Watts et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Watts, George S.
Thornton, James E.
Youens-Clark, Ken
Ponsero, Alise J.
Slepian, Marvin J.
Menashi, Emmanuel
Hu, Charles
Deng, Wuquan
Armstrong, David G.
Reed, Spenser
Cranmer, Lee D.
Hurwitz, Bonnie L.
Identification and quantitation of clinically relevant microbes in patient samples: Comparison of three k-mer based classifiers for speed, accuracy, and sensitivity
title Identification and quantitation of clinically relevant microbes in patient samples: Comparison of three k-mer based classifiers for speed, accuracy, and sensitivity
title_full Identification and quantitation of clinically relevant microbes in patient samples: Comparison of three k-mer based classifiers for speed, accuracy, and sensitivity
title_fullStr Identification and quantitation of clinically relevant microbes in patient samples: Comparison of three k-mer based classifiers for speed, accuracy, and sensitivity
title_full_unstemmed Identification and quantitation of clinically relevant microbes in patient samples: Comparison of three k-mer based classifiers for speed, accuracy, and sensitivity
title_short Identification and quantitation of clinically relevant microbes in patient samples: Comparison of three k-mer based classifiers for speed, accuracy, and sensitivity
title_sort identification and quantitation of clinically relevant microbes in patient samples: comparison of three k-mer based classifiers for speed, accuracy, and sensitivity
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6897419/
https://www.ncbi.nlm.nih.gov/pubmed/31756192
http://dx.doi.org/10.1371/journal.pcbi.1006863
work_keys_str_mv AT wattsgeorges identificationandquantitationofclinicallyrelevantmicrobesinpatientsamplescomparisonofthreekmerbasedclassifiersforspeedaccuracyandsensitivity
AT thorntonjamese identificationandquantitationofclinicallyrelevantmicrobesinpatientsamplescomparisonofthreekmerbasedclassifiersforspeedaccuracyandsensitivity
AT youensclarkken identificationandquantitationofclinicallyrelevantmicrobesinpatientsamplescomparisonofthreekmerbasedclassifiersforspeedaccuracyandsensitivity
AT ponseroalisej identificationandquantitationofclinicallyrelevantmicrobesinpatientsamplescomparisonofthreekmerbasedclassifiersforspeedaccuracyandsensitivity
AT slepianmarvinj identificationandquantitationofclinicallyrelevantmicrobesinpatientsamplescomparisonofthreekmerbasedclassifiersforspeedaccuracyandsensitivity
AT menashiemmanuel identificationandquantitationofclinicallyrelevantmicrobesinpatientsamplescomparisonofthreekmerbasedclassifiersforspeedaccuracyandsensitivity
AT hucharles identificationandquantitationofclinicallyrelevantmicrobesinpatientsamplescomparisonofthreekmerbasedclassifiersforspeedaccuracyandsensitivity
AT dengwuquan identificationandquantitationofclinicallyrelevantmicrobesinpatientsamplescomparisonofthreekmerbasedclassifiersforspeedaccuracyandsensitivity
AT armstrongdavidg identificationandquantitationofclinicallyrelevantmicrobesinpatientsamplescomparisonofthreekmerbasedclassifiersforspeedaccuracyandsensitivity
AT reedspenser identificationandquantitationofclinicallyrelevantmicrobesinpatientsamplescomparisonofthreekmerbasedclassifiersforspeedaccuracyandsensitivity
AT cranmerleed identificationandquantitationofclinicallyrelevantmicrobesinpatientsamplescomparisonofthreekmerbasedclassifiersforspeedaccuracyandsensitivity
AT hurwitzbonniel identificationandquantitationofclinicallyrelevantmicrobesinpatientsamplescomparisonofthreekmerbasedclassifiersforspeedaccuracyandsensitivity