Cargando…

Reliability of crowdsourced data and patient-reported outcome measures in cough-based COVID-19 screening

Mass community testing is a critical means for monitoring the spread of the COVID-19 pandemic. Polymerase chain reaction (PCR) is the gold standard for detecting the causative coronavirus 2 (SARS-CoV-2) but the test is invasive, test centers may not be readily available, and the wait for laboratory...

Descripción completa

Detalles Bibliográficos
Autores principales:	Xiong, Hao, Berkovsky, Shlomo, Kâafar, Mohamed Ali, Jaffe, Adam, Coiera, Enrico, Sharan, Roneel V.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9764298/ https://www.ncbi.nlm.nih.gov/pubmed/36539519 http://dx.doi.org/10.1038/s41598-022-26492-5

_version_	1784853244823994368
author	Xiong, Hao Berkovsky, Shlomo Kâafar, Mohamed Ali Jaffe, Adam Coiera, Enrico Sharan, Roneel V.
author_facet	Xiong, Hao Berkovsky, Shlomo Kâafar, Mohamed Ali Jaffe, Adam Coiera, Enrico Sharan, Roneel V.
author_sort	Xiong, Hao
collection	PubMed
description	Mass community testing is a critical means for monitoring the spread of the COVID-19 pandemic. Polymerase chain reaction (PCR) is the gold standard for detecting the causative coronavirus 2 (SARS-CoV-2) but the test is invasive, test centers may not be readily available, and the wait for laboratory results can take several days. Various machine learning based alternatives to PCR screening for SARS-CoV-2 have been proposed, including cough sound analysis. Cough classification models appear to be a robust means to predict infective status, but collecting reliable PCR confirmed data for their development is challenging and recent work using unverified crowdsourced data is seen as a viable alternative. In this study, we report experiments that assess cough classification models trained (i) using data from PCR-confirmed COVID subjects and (ii) using data of individuals self-reporting their infective status. We compare performance using PCR-confirmed data. Models trained on PCR-confirmed data perform better than those trained on patient-reported data. Models using PCR-confirmed data also exploit more stable predictive features and converge faster. Crowd-sourced cough data is less reliable than PCR-confirmed data for developing predictive models for COVID-19, and raises concerns about the utility of patient reported outcome data in developing other clinical predictive models when better gold-standard data are available.
format	Online Article Text
id	pubmed-9764298
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-97642982022-12-20 Reliability of crowdsourced data and patient-reported outcome measures in cough-based COVID-19 screening Xiong, Hao Berkovsky, Shlomo Kâafar, Mohamed Ali Jaffe, Adam Coiera, Enrico Sharan, Roneel V. Sci Rep Article Mass community testing is a critical means for monitoring the spread of the COVID-19 pandemic. Polymerase chain reaction (PCR) is the gold standard for detecting the causative coronavirus 2 (SARS-CoV-2) but the test is invasive, test centers may not be readily available, and the wait for laboratory results can take several days. Various machine learning based alternatives to PCR screening for SARS-CoV-2 have been proposed, including cough sound analysis. Cough classification models appear to be a robust means to predict infective status, but collecting reliable PCR confirmed data for their development is challenging and recent work using unverified crowdsourced data is seen as a viable alternative. In this study, we report experiments that assess cough classification models trained (i) using data from PCR-confirmed COVID subjects and (ii) using data of individuals self-reporting their infective status. We compare performance using PCR-confirmed data. Models trained on PCR-confirmed data perform better than those trained on patient-reported data. Models using PCR-confirmed data also exploit more stable predictive features and converge faster. Crowd-sourced cough data is less reliable than PCR-confirmed data for developing predictive models for COVID-19, and raises concerns about the utility of patient reported outcome data in developing other clinical predictive models when better gold-standard data are available. Nature Publishing Group UK 2022-12-20 /pmc/articles/PMC9764298/ /pubmed/36539519 http://dx.doi.org/10.1038/s41598-022-26492-5 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Article Xiong, Hao Berkovsky, Shlomo Kâafar, Mohamed Ali Jaffe, Adam Coiera, Enrico Sharan, Roneel V. Reliability of crowdsourced data and patient-reported outcome measures in cough-based COVID-19 screening
title	Reliability of crowdsourced data and patient-reported outcome measures in cough-based COVID-19 screening
title_full	Reliability of crowdsourced data and patient-reported outcome measures in cough-based COVID-19 screening
title_fullStr	Reliability of crowdsourced data and patient-reported outcome measures in cough-based COVID-19 screening
title_full_unstemmed	Reliability of crowdsourced data and patient-reported outcome measures in cough-based COVID-19 screening
title_short	Reliability of crowdsourced data and patient-reported outcome measures in cough-based COVID-19 screening
title_sort	reliability of crowdsourced data and patient-reported outcome measures in cough-based covid-19 screening
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9764298/ https://www.ncbi.nlm.nih.gov/pubmed/36539519 http://dx.doi.org/10.1038/s41598-022-26492-5
work_keys_str_mv	AT xionghao reliabilityofcrowdsourceddataandpatientreportedoutcomemeasuresincoughbasedcovid19screening AT berkovskyshlomo reliabilityofcrowdsourceddataandpatientreportedoutcomemeasuresincoughbasedcovid19screening AT kaafarmohamedali reliabilityofcrowdsourceddataandpatientreportedoutcomemeasuresincoughbasedcovid19screening AT jaffeadam reliabilityofcrowdsourceddataandpatientreportedoutcomemeasuresincoughbasedcovid19screening AT coieraenrico reliabilityofcrowdsourceddataandpatientreportedoutcomemeasuresincoughbasedcovid19screening AT sharanroneelv reliabilityofcrowdsourceddataandpatientreportedoutcomemeasuresincoughbasedcovid19screening

Reliability of crowdsourced data and patient-reported outcome measures in cough-based COVID-19 screening

Ejemplares similares