Cargando…
A machine learning study of COVID-19 serology and molecular tests and predictions
Serology and molecular tests are the two most commonly used methods for rapid COVID-19 infection testing. The two types of tests have different mechanisms to detect infection, by measuring the presence of viral SARS-CoV-2 RNA (molecular test) or detecting the presence of antibodies triggered by the...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9583626/ https://www.ncbi.nlm.nih.gov/pubmed/36281350 http://dx.doi.org/10.1016/j.smhl.2022.100331 |
_version_ | 1784813112936890368 |
---|---|
author | Elkin, Magdalyn E. Zhu, Xingquan |
author_facet | Elkin, Magdalyn E. Zhu, Xingquan |
author_sort | Elkin, Magdalyn E. |
collection | PubMed |
description | Serology and molecular tests are the two most commonly used methods for rapid COVID-19 infection testing. The two types of tests have different mechanisms to detect infection, by measuring the presence of viral SARS-CoV-2 RNA (molecular test) or detecting the presence of antibodies triggered by the SARS-CoV-2 virus (serology test). A handful of studies have shown that symptoms, combined with demographic and/or diagnosis features, can be helpful for the prediction of COVID-19 test outcomes. However, due to nature of the test, serology and molecular tests vary significantly. There is no existing study on the correlation between serology and molecular tests, and what type of symptoms are the key factors indicating the COVID-19 positive tests. In this study, we propose a machine learning based approach to study serology and molecular tests, and use features to predict test outcomes. A total of 2,467 donors, each tested using one or multiple types of COVID-19 tests, are collected as our testbed. By cross checking test types and results, we study correlation between serology and molecular tests. For test outcome prediction, we label 2,467 donors as positive or negative, by using their serology or molecular test results, and create symptom features to represent each donor for learning. Because COVID-19 produces a wide range of symptoms and the data collection process is essentially error prone, we group similar symptoms into bins. This decreases the feature space and sparsity. Using binned symptoms, combined with demographic features, we train five classification algorithms to predict COVID-19 test results. Experiments show that XGBoost achieves the best performance with 76.85% accuracy and 81.4% AUC scores, demonstrating that symptoms are indeed helpful for predicting COVID-19 test outcomes. Our study investigates the relationship between serology and molecular tests, identifies meaningful symptom features associated with COVID-19 infection, and also provides a way for rapid screening and cost effective detection of COVID-19 infection. |
format | Online Article Text |
id | pubmed-9583626 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-95836262022-10-20 A machine learning study of COVID-19 serology and molecular tests and predictions Elkin, Magdalyn E. Zhu, Xingquan Smart Health (Amst) Article Serology and molecular tests are the two most commonly used methods for rapid COVID-19 infection testing. The two types of tests have different mechanisms to detect infection, by measuring the presence of viral SARS-CoV-2 RNA (molecular test) or detecting the presence of antibodies triggered by the SARS-CoV-2 virus (serology test). A handful of studies have shown that symptoms, combined with demographic and/or diagnosis features, can be helpful for the prediction of COVID-19 test outcomes. However, due to nature of the test, serology and molecular tests vary significantly. There is no existing study on the correlation between serology and molecular tests, and what type of symptoms are the key factors indicating the COVID-19 positive tests. In this study, we propose a machine learning based approach to study serology and molecular tests, and use features to predict test outcomes. A total of 2,467 donors, each tested using one or multiple types of COVID-19 tests, are collected as our testbed. By cross checking test types and results, we study correlation between serology and molecular tests. For test outcome prediction, we label 2,467 donors as positive or negative, by using their serology or molecular test results, and create symptom features to represent each donor for learning. Because COVID-19 produces a wide range of symptoms and the data collection process is essentially error prone, we group similar symptoms into bins. This decreases the feature space and sparsity. Using binned symptoms, combined with demographic features, we train five classification algorithms to predict COVID-19 test results. Experiments show that XGBoost achieves the best performance with 76.85% accuracy and 81.4% AUC scores, demonstrating that symptoms are indeed helpful for predicting COVID-19 test outcomes. Our study investigates the relationship between serology and molecular tests, identifies meaningful symptom features associated with COVID-19 infection, and also provides a way for rapid screening and cost effective detection of COVID-19 infection. Elsevier Inc. 2022-12 2022-10-20 /pmc/articles/PMC9583626/ /pubmed/36281350 http://dx.doi.org/10.1016/j.smhl.2022.100331 Text en © 2022 Elsevier Inc. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Elkin, Magdalyn E. Zhu, Xingquan A machine learning study of COVID-19 serology and molecular tests and predictions |
title | A machine learning study of COVID-19 serology and molecular tests and predictions |
title_full | A machine learning study of COVID-19 serology and molecular tests and predictions |
title_fullStr | A machine learning study of COVID-19 serology and molecular tests and predictions |
title_full_unstemmed | A machine learning study of COVID-19 serology and molecular tests and predictions |
title_short | A machine learning study of COVID-19 serology and molecular tests and predictions |
title_sort | machine learning study of covid-19 serology and molecular tests and predictions |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9583626/ https://www.ncbi.nlm.nih.gov/pubmed/36281350 http://dx.doi.org/10.1016/j.smhl.2022.100331 |
work_keys_str_mv | AT elkinmagdalyne amachinelearningstudyofcovid19serologyandmoleculartestsandpredictions AT zhuxingquan amachinelearningstudyofcovid19serologyandmoleculartestsandpredictions AT elkinmagdalyne machinelearningstudyofcovid19serologyandmoleculartestsandpredictions AT zhuxingquan machinelearningstudyofcovid19serologyandmoleculartestsandpredictions |