Cargando…

Development of novel parameters for pathogen identification in clinical metagenomic next-generation sequencing

Introduction: Metagenomic next-generation sequencing (mNGS) has emerged as a powerful tool for rapid pathogen identification in clinical practice. However, the parameters used to interpret mNGS data, such as read count, genus rank, and coverage, lack explicit performance evaluation. In this study, t...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Xiwen, Yan, Jinghai, Huang, Hao, Ai, Lu, Yu, Xuegao, Zhong, Pengqiang, Chen, Yili, Liang, Zhikun, Qiu, Wancen, Huang, Huiying, Yan, Wenyan, Liang, Yan, Chen, Peisong, Wang, Ruizhi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10693447/
https://www.ncbi.nlm.nih.gov/pubmed/38046047
http://dx.doi.org/10.3389/fgene.2023.1266990
_version_ 1785153163320360960
author Jiang, Xiwen
Yan, Jinghai
Huang, Hao
Ai, Lu
Yu, Xuegao
Zhong, Pengqiang
Chen, Yili
Liang, Zhikun
Qiu, Wancen
Huang, Huiying
Yan, Wenyan
Liang, Yan
Chen, Peisong
Wang, Ruizhi
author_facet Jiang, Xiwen
Yan, Jinghai
Huang, Hao
Ai, Lu
Yu, Xuegao
Zhong, Pengqiang
Chen, Yili
Liang, Zhikun
Qiu, Wancen
Huang, Huiying
Yan, Wenyan
Liang, Yan
Chen, Peisong
Wang, Ruizhi
author_sort Jiang, Xiwen
collection PubMed
description Introduction: Metagenomic next-generation sequencing (mNGS) has emerged as a powerful tool for rapid pathogen identification in clinical practice. However, the parameters used to interpret mNGS data, such as read count, genus rank, and coverage, lack explicit performance evaluation. In this study, the developed indicators as well as novel parameters were assessed for their performance in bacterium detection. Methods: We developed several relevant parameters, including 10M normalized reads, double-discard reads, Genus Rank Ratio, King Genus Rank Ratio, Genus Rank Ratio*Genus Rank, and King Genus Rank Ratio*Genus Rank. These parameters, together with frequently used read indicators including raw reads, reads per million mapped reads (RPM), transcript per kilobase per million mapped reads (TPM), Genus Rank, and coverage were analyzed for their diagnostic efficiency in bronchoalveolar lavage fluid (BALF), a common source for detecting eight bacterium pathogens: Acinetobacter baumannii, Klebsiella pneumoniae, Streptococcus pneumoniae, Staphylococcus aureus, Hemophilus influenzae, Stenotrophomonas maltophilia, Pseudomonas aeruginosa, and Aspergillus fumigatus. Results: The results demonstrated that these indicators exhibited good diagnostic efficacy for the eight pathogens. The AUC values of all indicators were almost greater than 0.9, and the corresponding sensitivity and specificity values were almost greater than 0.8, excepted coverage. The negative predictive value of all indicators was greater than 0.9. The results showed that the use of double-discarded reads, Genus Rank Ratio*Genus Rank, and King Genus Rank Ratio*Genus Rank exhibited better diagnostic efficiency than that of raw reads, RPM, TPM, and in Genus Rank. These parameters can serve as a reference for interpreting mNGS data of BALF. Moreover, precision filters integrating our novel parameters were built to detect the eight bacterium pathogens in BALF samples through machine learning. Summary: In this study, we developed a set of novel parameters for pathogen identification in clinical mNGS based on reads and ranking. These parameters were found to be more effective in diagnosing pathogens than traditional approaches. The findings provide valuable insights for improving the interpretation of mNGS reports in clinical settings, specifically in BALF analysis.
format Online
Article
Text
id pubmed-10693447
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-106934472023-12-03 Development of novel parameters for pathogen identification in clinical metagenomic next-generation sequencing Jiang, Xiwen Yan, Jinghai Huang, Hao Ai, Lu Yu, Xuegao Zhong, Pengqiang Chen, Yili Liang, Zhikun Qiu, Wancen Huang, Huiying Yan, Wenyan Liang, Yan Chen, Peisong Wang, Ruizhi Front Genet Genetics Introduction: Metagenomic next-generation sequencing (mNGS) has emerged as a powerful tool for rapid pathogen identification in clinical practice. However, the parameters used to interpret mNGS data, such as read count, genus rank, and coverage, lack explicit performance evaluation. In this study, the developed indicators as well as novel parameters were assessed for their performance in bacterium detection. Methods: We developed several relevant parameters, including 10M normalized reads, double-discard reads, Genus Rank Ratio, King Genus Rank Ratio, Genus Rank Ratio*Genus Rank, and King Genus Rank Ratio*Genus Rank. These parameters, together with frequently used read indicators including raw reads, reads per million mapped reads (RPM), transcript per kilobase per million mapped reads (TPM), Genus Rank, and coverage were analyzed for their diagnostic efficiency in bronchoalveolar lavage fluid (BALF), a common source for detecting eight bacterium pathogens: Acinetobacter baumannii, Klebsiella pneumoniae, Streptococcus pneumoniae, Staphylococcus aureus, Hemophilus influenzae, Stenotrophomonas maltophilia, Pseudomonas aeruginosa, and Aspergillus fumigatus. Results: The results demonstrated that these indicators exhibited good diagnostic efficacy for the eight pathogens. The AUC values of all indicators were almost greater than 0.9, and the corresponding sensitivity and specificity values were almost greater than 0.8, excepted coverage. The negative predictive value of all indicators was greater than 0.9. The results showed that the use of double-discarded reads, Genus Rank Ratio*Genus Rank, and King Genus Rank Ratio*Genus Rank exhibited better diagnostic efficiency than that of raw reads, RPM, TPM, and in Genus Rank. These parameters can serve as a reference for interpreting mNGS data of BALF. Moreover, precision filters integrating our novel parameters were built to detect the eight bacterium pathogens in BALF samples through machine learning. Summary: In this study, we developed a set of novel parameters for pathogen identification in clinical mNGS based on reads and ranking. These parameters were found to be more effective in diagnosing pathogens than traditional approaches. The findings provide valuable insights for improving the interpretation of mNGS reports in clinical settings, specifically in BALF analysis. Frontiers Media S.A. 2023-11-17 /pmc/articles/PMC10693447/ /pubmed/38046047 http://dx.doi.org/10.3389/fgene.2023.1266990 Text en Copyright © 2023 Jiang, Yan, Huang, Ai, Yu, Zhong, Chen, Liang, Qiu, Huang, Yan, Liang, Chen and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Jiang, Xiwen
Yan, Jinghai
Huang, Hao
Ai, Lu
Yu, Xuegao
Zhong, Pengqiang
Chen, Yili
Liang, Zhikun
Qiu, Wancen
Huang, Huiying
Yan, Wenyan
Liang, Yan
Chen, Peisong
Wang, Ruizhi
Development of novel parameters for pathogen identification in clinical metagenomic next-generation sequencing
title Development of novel parameters for pathogen identification in clinical metagenomic next-generation sequencing
title_full Development of novel parameters for pathogen identification in clinical metagenomic next-generation sequencing
title_fullStr Development of novel parameters for pathogen identification in clinical metagenomic next-generation sequencing
title_full_unstemmed Development of novel parameters for pathogen identification in clinical metagenomic next-generation sequencing
title_short Development of novel parameters for pathogen identification in clinical metagenomic next-generation sequencing
title_sort development of novel parameters for pathogen identification in clinical metagenomic next-generation sequencing
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10693447/
https://www.ncbi.nlm.nih.gov/pubmed/38046047
http://dx.doi.org/10.3389/fgene.2023.1266990
work_keys_str_mv AT jiangxiwen developmentofnovelparametersforpathogenidentificationinclinicalmetagenomicnextgenerationsequencing
AT yanjinghai developmentofnovelparametersforpathogenidentificationinclinicalmetagenomicnextgenerationsequencing
AT huanghao developmentofnovelparametersforpathogenidentificationinclinicalmetagenomicnextgenerationsequencing
AT ailu developmentofnovelparametersforpathogenidentificationinclinicalmetagenomicnextgenerationsequencing
AT yuxuegao developmentofnovelparametersforpathogenidentificationinclinicalmetagenomicnextgenerationsequencing
AT zhongpengqiang developmentofnovelparametersforpathogenidentificationinclinicalmetagenomicnextgenerationsequencing
AT chenyili developmentofnovelparametersforpathogenidentificationinclinicalmetagenomicnextgenerationsequencing
AT liangzhikun developmentofnovelparametersforpathogenidentificationinclinicalmetagenomicnextgenerationsequencing
AT qiuwancen developmentofnovelparametersforpathogenidentificationinclinicalmetagenomicnextgenerationsequencing
AT huanghuiying developmentofnovelparametersforpathogenidentificationinclinicalmetagenomicnextgenerationsequencing
AT yanwenyan developmentofnovelparametersforpathogenidentificationinclinicalmetagenomicnextgenerationsequencing
AT liangyan developmentofnovelparametersforpathogenidentificationinclinicalmetagenomicnextgenerationsequencing
AT chenpeisong developmentofnovelparametersforpathogenidentificationinclinicalmetagenomicnextgenerationsequencing
AT wangruizhi developmentofnovelparametersforpathogenidentificationinclinicalmetagenomicnextgenerationsequencing