Cargando…

bvnGPS: a generalizable diagnostic model for acute bacterial and viral infection using integrative host transcriptomics and pretrained neural networks

MOTIVATION: The confusion of acute inflammation infected by virus and bacteria or noninfectious inflammation will lead to missing the best therapy occasion resulting in poor prognoses. The diagnostic model based on host gene expression has been widely used to diagnose acute infections, but the clini...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Qizhi, Zheng, Xubin, Xie, Jize, Wang, Ran, Li, Mengyao, Wong, Man-Hon, Leung, Kwong-Sak, Li, Shuai, Geng, Qingshan, Cheng, Lixin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9997702/
https://www.ncbi.nlm.nih.gov/pubmed/36857587
http://dx.doi.org/10.1093/bioinformatics/btad109
_version_ 1784903310316142592
author Li, Qizhi
Zheng, Xubin
Xie, Jize
Wang, Ran
Li, Mengyao
Wong, Man-Hon
Leung, Kwong-Sak
Li, Shuai
Geng, Qingshan
Cheng, Lixin
author_facet Li, Qizhi
Zheng, Xubin
Xie, Jize
Wang, Ran
Li, Mengyao
Wong, Man-Hon
Leung, Kwong-Sak
Li, Shuai
Geng, Qingshan
Cheng, Lixin
author_sort Li, Qizhi
collection PubMed
description MOTIVATION: The confusion of acute inflammation infected by virus and bacteria or noninfectious inflammation will lead to missing the best therapy occasion resulting in poor prognoses. The diagnostic model based on host gene expression has been widely used to diagnose acute infections, but the clinical usage was hindered by the capability across different samples and cohorts due to the small sample size for signature training and discovery. RESULTS: Here, we construct a large-scale dataset integrating multiple host transcriptomic data and analyze it using a sophisticated strategy which removes batch effect and extracts the common information from different cohorts based on the relative expression alteration of gene pairs. We assemble 2680 samples across 16 cohorts and separately build gene pair signature (GPS) for bacterial, viral, and noninfected patients. The three GPSs are further assembled into an antibiotic decision model (bacterial–viral–noninfected GPS, bvnGPS) using multiclass neural networks, which is able to determine whether a patient is bacterial infected, viral infected, or noninfected. bvnGPS can distinguish bacterial infection with area under the receiver operating characteristic curve (AUC) of 0.953 (95% confidence interval, 0.948–0.958) and viral infection with AUC of 0.956 (0.951–0.961) in the test set (N = 760). In the validation set (N = 147), bvnGPS also shows strong performance by attaining an AUC of 0.988 (0.978–0.998) on bacterial-versus-other and an AUC of 0.994 (0.984–1.000) on viral-versus-other. bvnGPS has the potential to be used in clinical practice and the proposed procedure provides insight into data integration, feature selection and multiclass classification for host transcriptomics data. AVAILABILITY AND IMPLEMENTATION: The codes implementing bvnGPS are available at https://github.com/Ritchiegit/bvnGPS. The construction of iPAGE algorithm and the training of neural network was conducted on Python 3.7 with Scikit-learn 0.24.1 and PyTorch 1.7. The visualization of the results was implemented on R 4.2, Python 3.7, and Matplotlib 3.3.4.
format Online
Article
Text
id pubmed-9997702
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-99977022023-03-10 bvnGPS: a generalizable diagnostic model for acute bacterial and viral infection using integrative host transcriptomics and pretrained neural networks Li, Qizhi Zheng, Xubin Xie, Jize Wang, Ran Li, Mengyao Wong, Man-Hon Leung, Kwong-Sak Li, Shuai Geng, Qingshan Cheng, Lixin Bioinformatics Original Paper MOTIVATION: The confusion of acute inflammation infected by virus and bacteria or noninfectious inflammation will lead to missing the best therapy occasion resulting in poor prognoses. The diagnostic model based on host gene expression has been widely used to diagnose acute infections, but the clinical usage was hindered by the capability across different samples and cohorts due to the small sample size for signature training and discovery. RESULTS: Here, we construct a large-scale dataset integrating multiple host transcriptomic data and analyze it using a sophisticated strategy which removes batch effect and extracts the common information from different cohorts based on the relative expression alteration of gene pairs. We assemble 2680 samples across 16 cohorts and separately build gene pair signature (GPS) for bacterial, viral, and noninfected patients. The three GPSs are further assembled into an antibiotic decision model (bacterial–viral–noninfected GPS, bvnGPS) using multiclass neural networks, which is able to determine whether a patient is bacterial infected, viral infected, or noninfected. bvnGPS can distinguish bacterial infection with area under the receiver operating characteristic curve (AUC) of 0.953 (95% confidence interval, 0.948–0.958) and viral infection with AUC of 0.956 (0.951–0.961) in the test set (N = 760). In the validation set (N = 147), bvnGPS also shows strong performance by attaining an AUC of 0.988 (0.978–0.998) on bacterial-versus-other and an AUC of 0.994 (0.984–1.000) on viral-versus-other. bvnGPS has the potential to be used in clinical practice and the proposed procedure provides insight into data integration, feature selection and multiclass classification for host transcriptomics data. AVAILABILITY AND IMPLEMENTATION: The codes implementing bvnGPS are available at https://github.com/Ritchiegit/bvnGPS. The construction of iPAGE algorithm and the training of neural network was conducted on Python 3.7 with Scikit-learn 0.24.1 and PyTorch 1.7. The visualization of the results was implemented on R 4.2, Python 3.7, and Matplotlib 3.3.4. Oxford University Press 2023-03-01 /pmc/articles/PMC9997702/ /pubmed/36857587 http://dx.doi.org/10.1093/bioinformatics/btad109 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Li, Qizhi
Zheng, Xubin
Xie, Jize
Wang, Ran
Li, Mengyao
Wong, Man-Hon
Leung, Kwong-Sak
Li, Shuai
Geng, Qingshan
Cheng, Lixin
bvnGPS: a generalizable diagnostic model for acute bacterial and viral infection using integrative host transcriptomics and pretrained neural networks
title bvnGPS: a generalizable diagnostic model for acute bacterial and viral infection using integrative host transcriptomics and pretrained neural networks
title_full bvnGPS: a generalizable diagnostic model for acute bacterial and viral infection using integrative host transcriptomics and pretrained neural networks
title_fullStr bvnGPS: a generalizable diagnostic model for acute bacterial and viral infection using integrative host transcriptomics and pretrained neural networks
title_full_unstemmed bvnGPS: a generalizable diagnostic model for acute bacterial and viral infection using integrative host transcriptomics and pretrained neural networks
title_short bvnGPS: a generalizable diagnostic model for acute bacterial and viral infection using integrative host transcriptomics and pretrained neural networks
title_sort bvngps: a generalizable diagnostic model for acute bacterial and viral infection using integrative host transcriptomics and pretrained neural networks
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9997702/
https://www.ncbi.nlm.nih.gov/pubmed/36857587
http://dx.doi.org/10.1093/bioinformatics/btad109
work_keys_str_mv AT liqizhi bvngpsageneralizablediagnosticmodelforacutebacterialandviralinfectionusingintegrativehosttranscriptomicsandpretrainedneuralnetworks
AT zhengxubin bvngpsageneralizablediagnosticmodelforacutebacterialandviralinfectionusingintegrativehosttranscriptomicsandpretrainedneuralnetworks
AT xiejize bvngpsageneralizablediagnosticmodelforacutebacterialandviralinfectionusingintegrativehosttranscriptomicsandpretrainedneuralnetworks
AT wangran bvngpsageneralizablediagnosticmodelforacutebacterialandviralinfectionusingintegrativehosttranscriptomicsandpretrainedneuralnetworks
AT limengyao bvngpsageneralizablediagnosticmodelforacutebacterialandviralinfectionusingintegrativehosttranscriptomicsandpretrainedneuralnetworks
AT wongmanhon bvngpsageneralizablediagnosticmodelforacutebacterialandviralinfectionusingintegrativehosttranscriptomicsandpretrainedneuralnetworks
AT leungkwongsak bvngpsageneralizablediagnosticmodelforacutebacterialandviralinfectionusingintegrativehosttranscriptomicsandpretrainedneuralnetworks
AT lishuai bvngpsageneralizablediagnosticmodelforacutebacterialandviralinfectionusingintegrativehosttranscriptomicsandpretrainedneuralnetworks
AT gengqingshan bvngpsageneralizablediagnosticmodelforacutebacterialandviralinfectionusingintegrativehosttranscriptomicsandpretrainedneuralnetworks
AT chenglixin bvngpsageneralizablediagnosticmodelforacutebacterialandviralinfectionusingintegrativehosttranscriptomicsandpretrainedneuralnetworks