Cargando…
A machine learning classifier using 33 host immune response mRNAs accurately distinguishes viral and non-viral acute respiratory illnesses in nasal swab samples
BACKGROUND: Viral acute respiratory illnesses (viral ARIs) contribute significantly to human morbidity and mortality worldwide, but their successful treatment requires timely diagnosis of viral etiology, which is complicated by overlap in clinical presentation with the non-viral ARIs. Multiple pande...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10463681/ https://www.ncbi.nlm.nih.gov/pubmed/37641125 http://dx.doi.org/10.1186/s13073-023-01216-0 |
_version_ | 1785098289039802368 |
---|---|
author | Pandya, Rushika He, Yudong D. Sweeney, Timothy E. Hasin-Brumshtein, Yehudit Khatri, Purvesh |
author_facet | Pandya, Rushika He, Yudong D. Sweeney, Timothy E. Hasin-Brumshtein, Yehudit Khatri, Purvesh |
author_sort | Pandya, Rushika |
collection | PubMed |
description | BACKGROUND: Viral acute respiratory illnesses (viral ARIs) contribute significantly to human morbidity and mortality worldwide, but their successful treatment requires timely diagnosis of viral etiology, which is complicated by overlap in clinical presentation with the non-viral ARIs. Multiple pandemics in the twenty-first century to date have further highlighted the unmet need for effective monitoring of clinically relevant emerging viruses. Recent studies have identified conserved host response to viral infections in the blood. METHODS: We hypothesize that a similarly conserved host response in nasal samples can be utilized for diagnosis and to rule out viral infection in symptomatic patients when current diagnostic tests are negative. Using a multi-cohort analysis framework, we analyzed 1555 nasal samples across 10 independent cohorts dividing them into training and validation. RESULTS: Using six of the datasets for training, we identified 119 genes that are consistently differentially expressed in viral ARI patients (N = 236) compared to healthy controls (N = 146) and further down-selected 33 genes for classifier development. The resulting locked logistic regression-based classifier using the 33-mRNAs had AUC of 0.94 and 0.89 in the six training and four validation datasets, respectively. Furthermore, we found that although trained on healthy controls only, in the four validation datasets, the 33-mRNA classifier distinguished viral ARI from both healthy or non-viral ARI samples with > 80% specificity and sensitivity, irrespective of age, viral type, and viral load. Single-cell RNA-sequencing data showed that the 33-mRNA signature is dominated by macrophages and neutrophils in nasal samples. CONCLUSION: This proof-of-concept signature has potential to be adapted as a clinical point-of-care test (‘RespVerity’) to improve the diagnosis of viral ARIs. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13073-023-01216-0. |
format | Online Article Text |
id | pubmed-10463681 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-104636812023-08-30 A machine learning classifier using 33 host immune response mRNAs accurately distinguishes viral and non-viral acute respiratory illnesses in nasal swab samples Pandya, Rushika He, Yudong D. Sweeney, Timothy E. Hasin-Brumshtein, Yehudit Khatri, Purvesh Genome Med Research BACKGROUND: Viral acute respiratory illnesses (viral ARIs) contribute significantly to human morbidity and mortality worldwide, but their successful treatment requires timely diagnosis of viral etiology, which is complicated by overlap in clinical presentation with the non-viral ARIs. Multiple pandemics in the twenty-first century to date have further highlighted the unmet need for effective monitoring of clinically relevant emerging viruses. Recent studies have identified conserved host response to viral infections in the blood. METHODS: We hypothesize that a similarly conserved host response in nasal samples can be utilized for diagnosis and to rule out viral infection in symptomatic patients when current diagnostic tests are negative. Using a multi-cohort analysis framework, we analyzed 1555 nasal samples across 10 independent cohorts dividing them into training and validation. RESULTS: Using six of the datasets for training, we identified 119 genes that are consistently differentially expressed in viral ARI patients (N = 236) compared to healthy controls (N = 146) and further down-selected 33 genes for classifier development. The resulting locked logistic regression-based classifier using the 33-mRNAs had AUC of 0.94 and 0.89 in the six training and four validation datasets, respectively. Furthermore, we found that although trained on healthy controls only, in the four validation datasets, the 33-mRNA classifier distinguished viral ARI from both healthy or non-viral ARI samples with > 80% specificity and sensitivity, irrespective of age, viral type, and viral load. Single-cell RNA-sequencing data showed that the 33-mRNA signature is dominated by macrophages and neutrophils in nasal samples. CONCLUSION: This proof-of-concept signature has potential to be adapted as a clinical point-of-care test (‘RespVerity’) to improve the diagnosis of viral ARIs. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13073-023-01216-0. BioMed Central 2023-08-28 /pmc/articles/PMC10463681/ /pubmed/37641125 http://dx.doi.org/10.1186/s13073-023-01216-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Pandya, Rushika He, Yudong D. Sweeney, Timothy E. Hasin-Brumshtein, Yehudit Khatri, Purvesh A machine learning classifier using 33 host immune response mRNAs accurately distinguishes viral and non-viral acute respiratory illnesses in nasal swab samples |
title | A machine learning classifier using 33 host immune response mRNAs accurately distinguishes viral and non-viral acute respiratory illnesses in nasal swab samples |
title_full | A machine learning classifier using 33 host immune response mRNAs accurately distinguishes viral and non-viral acute respiratory illnesses in nasal swab samples |
title_fullStr | A machine learning classifier using 33 host immune response mRNAs accurately distinguishes viral and non-viral acute respiratory illnesses in nasal swab samples |
title_full_unstemmed | A machine learning classifier using 33 host immune response mRNAs accurately distinguishes viral and non-viral acute respiratory illnesses in nasal swab samples |
title_short | A machine learning classifier using 33 host immune response mRNAs accurately distinguishes viral and non-viral acute respiratory illnesses in nasal swab samples |
title_sort | machine learning classifier using 33 host immune response mrnas accurately distinguishes viral and non-viral acute respiratory illnesses in nasal swab samples |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10463681/ https://www.ncbi.nlm.nih.gov/pubmed/37641125 http://dx.doi.org/10.1186/s13073-023-01216-0 |
work_keys_str_mv | AT pandyarushika amachinelearningclassifierusing33hostimmuneresponsemrnasaccuratelydistinguishesviralandnonviralacuterespiratoryillnessesinnasalswabsamples AT heyudongd amachinelearningclassifierusing33hostimmuneresponsemrnasaccuratelydistinguishesviralandnonviralacuterespiratoryillnessesinnasalswabsamples AT sweeneytimothye amachinelearningclassifierusing33hostimmuneresponsemrnasaccuratelydistinguishesviralandnonviralacuterespiratoryillnessesinnasalswabsamples AT hasinbrumshteinyehudit amachinelearningclassifierusing33hostimmuneresponsemrnasaccuratelydistinguishesviralandnonviralacuterespiratoryillnessesinnasalswabsamples AT khatripurvesh amachinelearningclassifierusing33hostimmuneresponsemrnasaccuratelydistinguishesviralandnonviralacuterespiratoryillnessesinnasalswabsamples AT pandyarushika machinelearningclassifierusing33hostimmuneresponsemrnasaccuratelydistinguishesviralandnonviralacuterespiratoryillnessesinnasalswabsamples AT heyudongd machinelearningclassifierusing33hostimmuneresponsemrnasaccuratelydistinguishesviralandnonviralacuterespiratoryillnessesinnasalswabsamples AT sweeneytimothye machinelearningclassifierusing33hostimmuneresponsemrnasaccuratelydistinguishesviralandnonviralacuterespiratoryillnessesinnasalswabsamples AT hasinbrumshteinyehudit machinelearningclassifierusing33hostimmuneresponsemrnasaccuratelydistinguishesviralandnonviralacuterespiratoryillnessesinnasalswabsamples AT khatripurvesh machinelearningclassifierusing33hostimmuneresponsemrnasaccuratelydistinguishesviralandnonviralacuterespiratoryillnessesinnasalswabsamples |