Cargando…

Background Filtering of Clinical Metagenomic Sequencing with a Library Concentration-Normalized Model

Metagenomic next-generation sequencing (mNGS) can accurately detect pathogens in clinical samples. However, wet-lab contamination constrains mNGS analysis and may result in erroneous interpretation of results. Many existing methods rely on large-scale observational microbiome studies and may not be...

Descripción completa

Detalles Bibliográficos
Autores principales: Du, Juan, Zhang, Jingjia, Zhang, Dong, Zhou, Yiwen, Wu, Pengfei, Ding, Wenchao, Wang, Jun, Ouyang, Chuan, Yang, Qiwen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9603461/
https://www.ncbi.nlm.nih.gov/pubmed/36135379
http://dx.doi.org/10.1128/spectrum.01779-22
_version_ 1784817555120062464
author Du, Juan
Zhang, Jingjia
Zhang, Dong
Zhou, Yiwen
Wu, Pengfei
Ding, Wenchao
Wang, Jun
Ouyang, Chuan
Yang, Qiwen
author_facet Du, Juan
Zhang, Jingjia
Zhang, Dong
Zhou, Yiwen
Wu, Pengfei
Ding, Wenchao
Wang, Jun
Ouyang, Chuan
Yang, Qiwen
author_sort Du, Juan
collection PubMed
description Metagenomic next-generation sequencing (mNGS) can accurately detect pathogens in clinical samples. However, wet-lab contamination constrains mNGS analysis and may result in erroneous interpretation of results. Many existing methods rely on large-scale observational microbiome studies and may not be applicable to clinical mNGS tests. By generation of a pretrained profile of common laboratory contaminants, we developed an mNGS noise-filtering model based on the inverse linear relationship between microbial sequencing reads and sample library concentration, named the background elimination and correction by library concentration-normalized (BECLEAN) model. Its efficacy was evaluated with bacteria- and yeast-spiked samples and 28 cerebrospinal fluid (CSF) specimens. The diagnostic accuracy, precision, sensitivity, and specificity of BECLEAN with reference to conventional methods and diagnosis were 92.9%, 86.7%, 100%, and 86.7%, respectively. BECLEAN led to a dramatic reduction of background noise without affecting the true-positive rate and thus can provide a time-saving and convenient tool in various clinical settings. IMPORTANCE Most of the existing methods to remove wet-lab contamination rely on large-scale observational microbiome studies and may not be applicable to clinical mNGS testing in individual cases. In clinical settings, only a handful of samples might be sequenced in a run. The lab-specific microbiome can complicate existing statistical approaches for removing contamination from small-scale clinical metagenomic sequencing data sets; thus, use of a preliminary lab-specific training set is necessary. Our study provides a rapid and accurate background-filtering tool for clinical metagenomic sequencing by generation of a pretrained profile of common laboratory contaminants. Notably, our work demonstrates that the inverse linear relationship between microbial sequencing reads and library concentration can serve to identify true contaminants and evaluate the relative abundance of a taxon in samples by comparing the observed microbial reads to the model-predicted value. Our findings extend the previously published research and demonstrate confirmatory results in clinical settings.
format Online
Article
Text
id pubmed-9603461
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-96034612022-10-27 Background Filtering of Clinical Metagenomic Sequencing with a Library Concentration-Normalized Model Du, Juan Zhang, Jingjia Zhang, Dong Zhou, Yiwen Wu, Pengfei Ding, Wenchao Wang, Jun Ouyang, Chuan Yang, Qiwen Microbiol Spectr Methods and Protocols Metagenomic next-generation sequencing (mNGS) can accurately detect pathogens in clinical samples. However, wet-lab contamination constrains mNGS analysis and may result in erroneous interpretation of results. Many existing methods rely on large-scale observational microbiome studies and may not be applicable to clinical mNGS tests. By generation of a pretrained profile of common laboratory contaminants, we developed an mNGS noise-filtering model based on the inverse linear relationship between microbial sequencing reads and sample library concentration, named the background elimination and correction by library concentration-normalized (BECLEAN) model. Its efficacy was evaluated with bacteria- and yeast-spiked samples and 28 cerebrospinal fluid (CSF) specimens. The diagnostic accuracy, precision, sensitivity, and specificity of BECLEAN with reference to conventional methods and diagnosis were 92.9%, 86.7%, 100%, and 86.7%, respectively. BECLEAN led to a dramatic reduction of background noise without affecting the true-positive rate and thus can provide a time-saving and convenient tool in various clinical settings. IMPORTANCE Most of the existing methods to remove wet-lab contamination rely on large-scale observational microbiome studies and may not be applicable to clinical mNGS testing in individual cases. In clinical settings, only a handful of samples might be sequenced in a run. The lab-specific microbiome can complicate existing statistical approaches for removing contamination from small-scale clinical metagenomic sequencing data sets; thus, use of a preliminary lab-specific training set is necessary. Our study provides a rapid and accurate background-filtering tool for clinical metagenomic sequencing by generation of a pretrained profile of common laboratory contaminants. Notably, our work demonstrates that the inverse linear relationship between microbial sequencing reads and library concentration can serve to identify true contaminants and evaluate the relative abundance of a taxon in samples by comparing the observed microbial reads to the model-predicted value. Our findings extend the previously published research and demonstrate confirmatory results in clinical settings. American Society for Microbiology 2022-09-22 /pmc/articles/PMC9603461/ /pubmed/36135379 http://dx.doi.org/10.1128/spectrum.01779-22 Text en Copyright © 2022 Du et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Methods and Protocols
Du, Juan
Zhang, Jingjia
Zhang, Dong
Zhou, Yiwen
Wu, Pengfei
Ding, Wenchao
Wang, Jun
Ouyang, Chuan
Yang, Qiwen
Background Filtering of Clinical Metagenomic Sequencing with a Library Concentration-Normalized Model
title Background Filtering of Clinical Metagenomic Sequencing with a Library Concentration-Normalized Model
title_full Background Filtering of Clinical Metagenomic Sequencing with a Library Concentration-Normalized Model
title_fullStr Background Filtering of Clinical Metagenomic Sequencing with a Library Concentration-Normalized Model
title_full_unstemmed Background Filtering of Clinical Metagenomic Sequencing with a Library Concentration-Normalized Model
title_short Background Filtering of Clinical Metagenomic Sequencing with a Library Concentration-Normalized Model
title_sort background filtering of clinical metagenomic sequencing with a library concentration-normalized model
topic Methods and Protocols
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9603461/
https://www.ncbi.nlm.nih.gov/pubmed/36135379
http://dx.doi.org/10.1128/spectrum.01779-22
work_keys_str_mv AT dujuan backgroundfilteringofclinicalmetagenomicsequencingwithalibraryconcentrationnormalizedmodel
AT zhangjingjia backgroundfilteringofclinicalmetagenomicsequencingwithalibraryconcentrationnormalizedmodel
AT zhangdong backgroundfilteringofclinicalmetagenomicsequencingwithalibraryconcentrationnormalizedmodel
AT zhouyiwen backgroundfilteringofclinicalmetagenomicsequencingwithalibraryconcentrationnormalizedmodel
AT wupengfei backgroundfilteringofclinicalmetagenomicsequencingwithalibraryconcentrationnormalizedmodel
AT dingwenchao backgroundfilteringofclinicalmetagenomicsequencingwithalibraryconcentrationnormalizedmodel
AT wangjun backgroundfilteringofclinicalmetagenomicsequencingwithalibraryconcentrationnormalizedmodel
AT ouyangchuan backgroundfilteringofclinicalmetagenomicsequencingwithalibraryconcentrationnormalizedmodel
AT yangqiwen backgroundfilteringofclinicalmetagenomicsequencingwithalibraryconcentrationnormalizedmodel