Cargando…

Systematic comparison of published host gene expression signatures for bacterial/viral discrimination

BACKGROUND: Measuring host gene expression is a promising diagnostic strategy to discriminate bacterial and viral infections. Multiple signatures of varying size, complexity, and target populations have been described. However, there is little information to indicate how the performance of various p...

Descripción completa

Detalles Bibliográficos
Autores principales: Bodkin, Nicholas, Ross, Melissa, McClain, Micah T., Ko, Emily R., Woods, Christopher W., Ginsburg, Geoffrey S., Henao, Ricardo, Tsalik, Ephraim L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8858657/
https://www.ncbi.nlm.nih.gov/pubmed/35184750
http://dx.doi.org/10.1186/s13073-022-01025-x
_version_ 1784654285854736384
author Bodkin, Nicholas
Ross, Melissa
McClain, Micah T.
Ko, Emily R.
Woods, Christopher W.
Ginsburg, Geoffrey S.
Henao, Ricardo
Tsalik, Ephraim L.
author_facet Bodkin, Nicholas
Ross, Melissa
McClain, Micah T.
Ko, Emily R.
Woods, Christopher W.
Ginsburg, Geoffrey S.
Henao, Ricardo
Tsalik, Ephraim L.
author_sort Bodkin, Nicholas
collection PubMed
description BACKGROUND: Measuring host gene expression is a promising diagnostic strategy to discriminate bacterial and viral infections. Multiple signatures of varying size, complexity, and target populations have been described. However, there is little information to indicate how the performance of various published signatures compare to one another. METHODS: This systematic comparison of host gene expression signatures evaluated the performance of 28 signatures, validating them in 4589 subjects from 51 publicly available datasets. Thirteen COVID-specific datasets with 1416 subjects were included in a separate analysis. Individual signature performance was evaluated using the area under the receiving operating characteristic curve (AUC) value. Overall signature performance was evaluated using median AUCs and accuracies. RESULTS: Signature performance varied widely, with median AUCs ranging from 0.55 to 0.96 for bacterial classification and 0.69–0.97 for viral classification. Signature size varied (1–398 genes), with smaller signatures generally performing more poorly (P < 0.04). Viral infection was easier to diagnose than bacterial infection (84% vs. 79% overall accuracy, respectively; P < .001). Host gene expression classifiers performed more poorly in some pediatric populations (3 months–1 year and 2–11 years) compared to the adult population for both bacterial infection (73% and 70% vs. 82%, respectively; P < .001) and viral infection (80% and 79% vs. 88%, respectively; P < .001). We did not observe classification differences based on illness severity as defined by ICU admission for bacterial or viral infections. The median AUC across all signatures for COVID-19 classification was 0.80 compared to 0.83 for viral classification in the same datasets. CONCLUSIONS: In this systematic comparison of 28 host gene expression signatures, we observed differences based on a signature’s size and characteristics of the validation population, including age and infection type. However, populations used for signature discovery did not impact performance, underscoring the redundancy among many of these signatures. Furthermore, differential performance in specific populations may only be observable through this type of large-scale validation. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13073-022-01025-x.
format Online
Article
Text
id pubmed-8858657
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-88586572022-02-22 Systematic comparison of published host gene expression signatures for bacterial/viral discrimination Bodkin, Nicholas Ross, Melissa McClain, Micah T. Ko, Emily R. Woods, Christopher W. Ginsburg, Geoffrey S. Henao, Ricardo Tsalik, Ephraim L. Genome Med Research BACKGROUND: Measuring host gene expression is a promising diagnostic strategy to discriminate bacterial and viral infections. Multiple signatures of varying size, complexity, and target populations have been described. However, there is little information to indicate how the performance of various published signatures compare to one another. METHODS: This systematic comparison of host gene expression signatures evaluated the performance of 28 signatures, validating them in 4589 subjects from 51 publicly available datasets. Thirteen COVID-specific datasets with 1416 subjects were included in a separate analysis. Individual signature performance was evaluated using the area under the receiving operating characteristic curve (AUC) value. Overall signature performance was evaluated using median AUCs and accuracies. RESULTS: Signature performance varied widely, with median AUCs ranging from 0.55 to 0.96 for bacterial classification and 0.69–0.97 for viral classification. Signature size varied (1–398 genes), with smaller signatures generally performing more poorly (P < 0.04). Viral infection was easier to diagnose than bacterial infection (84% vs. 79% overall accuracy, respectively; P < .001). Host gene expression classifiers performed more poorly in some pediatric populations (3 months–1 year and 2–11 years) compared to the adult population for both bacterial infection (73% and 70% vs. 82%, respectively; P < .001) and viral infection (80% and 79% vs. 88%, respectively; P < .001). We did not observe classification differences based on illness severity as defined by ICU admission for bacterial or viral infections. The median AUC across all signatures for COVID-19 classification was 0.80 compared to 0.83 for viral classification in the same datasets. CONCLUSIONS: In this systematic comparison of 28 host gene expression signatures, we observed differences based on a signature’s size and characteristics of the validation population, including age and infection type. However, populations used for signature discovery did not impact performance, underscoring the redundancy among many of these signatures. Furthermore, differential performance in specific populations may only be observable through this type of large-scale validation. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13073-022-01025-x. BioMed Central 2022-02-21 /pmc/articles/PMC8858657/ /pubmed/35184750 http://dx.doi.org/10.1186/s13073-022-01025-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Bodkin, Nicholas
Ross, Melissa
McClain, Micah T.
Ko, Emily R.
Woods, Christopher W.
Ginsburg, Geoffrey S.
Henao, Ricardo
Tsalik, Ephraim L.
Systematic comparison of published host gene expression signatures for bacterial/viral discrimination
title Systematic comparison of published host gene expression signatures for bacterial/viral discrimination
title_full Systematic comparison of published host gene expression signatures for bacterial/viral discrimination
title_fullStr Systematic comparison of published host gene expression signatures for bacterial/viral discrimination
title_full_unstemmed Systematic comparison of published host gene expression signatures for bacterial/viral discrimination
title_short Systematic comparison of published host gene expression signatures for bacterial/viral discrimination
title_sort systematic comparison of published host gene expression signatures for bacterial/viral discrimination
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8858657/
https://www.ncbi.nlm.nih.gov/pubmed/35184750
http://dx.doi.org/10.1186/s13073-022-01025-x
work_keys_str_mv AT bodkinnicholas systematiccomparisonofpublishedhostgeneexpressionsignaturesforbacterialviraldiscrimination
AT rossmelissa systematiccomparisonofpublishedhostgeneexpressionsignaturesforbacterialviraldiscrimination
AT mcclainmicaht systematiccomparisonofpublishedhostgeneexpressionsignaturesforbacterialviraldiscrimination
AT koemilyr systematiccomparisonofpublishedhostgeneexpressionsignaturesforbacterialviraldiscrimination
AT woodschristopherw systematiccomparisonofpublishedhostgeneexpressionsignaturesforbacterialviraldiscrimination
AT ginsburggeoffreys systematiccomparisonofpublishedhostgeneexpressionsignaturesforbacterialviraldiscrimination
AT henaoricardo systematiccomparisonofpublishedhostgeneexpressionsignaturesforbacterialviraldiscrimination
AT tsalikephraiml systematiccomparisonofpublishedhostgeneexpressionsignaturesforbacterialviraldiscrimination