Cargando…

A machine learning classifier using 33 host immune response mRNAs accurately distinguishes viral and non-viral acute respiratory illnesses in nasal swab samples

BACKGROUND: Viral acute respiratory illnesses (viral ARIs) contribute significantly to human morbidity and mortality worldwide, but their successful treatment requires timely diagnosis of viral etiology, which is complicated by overlap in clinical presentation with the non-viral ARIs. Multiple pande...

Descripción completa

Detalles Bibliográficos
Autores principales: Pandya, Rushika, He, Yudong D., Sweeney, Timothy E., Hasin-Brumshtein, Yehudit, Khatri, Purvesh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10463681/
https://www.ncbi.nlm.nih.gov/pubmed/37641125
http://dx.doi.org/10.1186/s13073-023-01216-0
_version_ 1785098289039802368
author Pandya, Rushika
He, Yudong D.
Sweeney, Timothy E.
Hasin-Brumshtein, Yehudit
Khatri, Purvesh
author_facet Pandya, Rushika
He, Yudong D.
Sweeney, Timothy E.
Hasin-Brumshtein, Yehudit
Khatri, Purvesh
author_sort Pandya, Rushika
collection PubMed
description BACKGROUND: Viral acute respiratory illnesses (viral ARIs) contribute significantly to human morbidity and mortality worldwide, but their successful treatment requires timely diagnosis of viral etiology, which is complicated by overlap in clinical presentation with the non-viral ARIs. Multiple pandemics in the twenty-first century to date have further highlighted the unmet need for effective monitoring of clinically relevant emerging viruses. Recent studies have identified conserved host response to viral infections in the blood. METHODS: We hypothesize that a similarly conserved host response in nasal samples can be utilized for diagnosis and to rule out viral infection in symptomatic patients when current diagnostic tests are negative. Using a multi-cohort analysis framework, we analyzed 1555 nasal samples across 10 independent cohorts dividing them into training and validation. RESULTS: Using six of the datasets for training, we identified 119 genes that are consistently differentially expressed in viral ARI patients (N = 236) compared to healthy controls (N = 146) and further down-selected 33 genes for classifier development. The resulting locked logistic regression-based classifier using the 33-mRNAs had AUC of 0.94 and 0.89 in the six training and four validation datasets, respectively. Furthermore, we found that although trained on healthy controls only, in the four validation datasets, the 33-mRNA classifier distinguished viral ARI from both healthy or non-viral ARI samples with > 80% specificity and sensitivity, irrespective of age, viral type, and viral load. Single-cell RNA-sequencing data showed that the 33-mRNA signature is dominated by macrophages and neutrophils in nasal samples. CONCLUSION: This proof-of-concept signature has potential to be adapted as a clinical point-of-care test (‘RespVerity’) to improve the diagnosis of viral ARIs. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13073-023-01216-0.
format Online
Article
Text
id pubmed-10463681
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-104636812023-08-30 A machine learning classifier using 33 host immune response mRNAs accurately distinguishes viral and non-viral acute respiratory illnesses in nasal swab samples Pandya, Rushika He, Yudong D. Sweeney, Timothy E. Hasin-Brumshtein, Yehudit Khatri, Purvesh Genome Med Research BACKGROUND: Viral acute respiratory illnesses (viral ARIs) contribute significantly to human morbidity and mortality worldwide, but their successful treatment requires timely diagnosis of viral etiology, which is complicated by overlap in clinical presentation with the non-viral ARIs. Multiple pandemics in the twenty-first century to date have further highlighted the unmet need for effective monitoring of clinically relevant emerging viruses. Recent studies have identified conserved host response to viral infections in the blood. METHODS: We hypothesize that a similarly conserved host response in nasal samples can be utilized for diagnosis and to rule out viral infection in symptomatic patients when current diagnostic tests are negative. Using a multi-cohort analysis framework, we analyzed 1555 nasal samples across 10 independent cohorts dividing them into training and validation. RESULTS: Using six of the datasets for training, we identified 119 genes that are consistently differentially expressed in viral ARI patients (N = 236) compared to healthy controls (N = 146) and further down-selected 33 genes for classifier development. The resulting locked logistic regression-based classifier using the 33-mRNAs had AUC of 0.94 and 0.89 in the six training and four validation datasets, respectively. Furthermore, we found that although trained on healthy controls only, in the four validation datasets, the 33-mRNA classifier distinguished viral ARI from both healthy or non-viral ARI samples with > 80% specificity and sensitivity, irrespective of age, viral type, and viral load. Single-cell RNA-sequencing data showed that the 33-mRNA signature is dominated by macrophages and neutrophils in nasal samples. CONCLUSION: This proof-of-concept signature has potential to be adapted as a clinical point-of-care test (‘RespVerity’) to improve the diagnosis of viral ARIs. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13073-023-01216-0. BioMed Central 2023-08-28 /pmc/articles/PMC10463681/ /pubmed/37641125 http://dx.doi.org/10.1186/s13073-023-01216-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Pandya, Rushika
He, Yudong D.
Sweeney, Timothy E.
Hasin-Brumshtein, Yehudit
Khatri, Purvesh
A machine learning classifier using 33 host immune response mRNAs accurately distinguishes viral and non-viral acute respiratory illnesses in nasal swab samples
title A machine learning classifier using 33 host immune response mRNAs accurately distinguishes viral and non-viral acute respiratory illnesses in nasal swab samples
title_full A machine learning classifier using 33 host immune response mRNAs accurately distinguishes viral and non-viral acute respiratory illnesses in nasal swab samples
title_fullStr A machine learning classifier using 33 host immune response mRNAs accurately distinguishes viral and non-viral acute respiratory illnesses in nasal swab samples
title_full_unstemmed A machine learning classifier using 33 host immune response mRNAs accurately distinguishes viral and non-viral acute respiratory illnesses in nasal swab samples
title_short A machine learning classifier using 33 host immune response mRNAs accurately distinguishes viral and non-viral acute respiratory illnesses in nasal swab samples
title_sort machine learning classifier using 33 host immune response mrnas accurately distinguishes viral and non-viral acute respiratory illnesses in nasal swab samples
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10463681/
https://www.ncbi.nlm.nih.gov/pubmed/37641125
http://dx.doi.org/10.1186/s13073-023-01216-0
work_keys_str_mv AT pandyarushika amachinelearningclassifierusing33hostimmuneresponsemrnasaccuratelydistinguishesviralandnonviralacuterespiratoryillnessesinnasalswabsamples
AT heyudongd amachinelearningclassifierusing33hostimmuneresponsemrnasaccuratelydistinguishesviralandnonviralacuterespiratoryillnessesinnasalswabsamples
AT sweeneytimothye amachinelearningclassifierusing33hostimmuneresponsemrnasaccuratelydistinguishesviralandnonviralacuterespiratoryillnessesinnasalswabsamples
AT hasinbrumshteinyehudit amachinelearningclassifierusing33hostimmuneresponsemrnasaccuratelydistinguishesviralandnonviralacuterespiratoryillnessesinnasalswabsamples
AT khatripurvesh amachinelearningclassifierusing33hostimmuneresponsemrnasaccuratelydistinguishesviralandnonviralacuterespiratoryillnessesinnasalswabsamples
AT pandyarushika machinelearningclassifierusing33hostimmuneresponsemrnasaccuratelydistinguishesviralandnonviralacuterespiratoryillnessesinnasalswabsamples
AT heyudongd machinelearningclassifierusing33hostimmuneresponsemrnasaccuratelydistinguishesviralandnonviralacuterespiratoryillnessesinnasalswabsamples
AT sweeneytimothye machinelearningclassifierusing33hostimmuneresponsemrnasaccuratelydistinguishesviralandnonviralacuterespiratoryillnessesinnasalswabsamples
AT hasinbrumshteinyehudit machinelearningclassifierusing33hostimmuneresponsemrnasaccuratelydistinguishesviralandnonviralacuterespiratoryillnessesinnasalswabsamples
AT khatripurvesh machinelearningclassifierusing33hostimmuneresponsemrnasaccuratelydistinguishesviralandnonviralacuterespiratoryillnessesinnasalswabsamples