Cargando…
Deep learning and machine learning-based voice analysis for the detection of COVID-19: A proposal and comparison of architectures
Alongside the currently used nasal swab testing, the COVID-19 pandemic situation would gain noticeable advantages from low-cost tests that are available at any-time, anywhere, at a large-scale, and with real time answers. A novel approach for COVID-19 assessment is adopted here, discriminating negat...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier B.V.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9328841/ https://www.ncbi.nlm.nih.gov/pubmed/35915642 http://dx.doi.org/10.1016/j.knosys.2022.109539 |
_version_ | 1784757807971565568 |
---|---|
author | Costantini, Giovanni Dr., Valerio Cesarini Robotti, Carlo Benazzo, Marco Pietrantonio, Filomena Di Girolamo, Stefano Pisani, Antonio Canzi, Pietro Mauramati, Simone Bertino, Giulia Cassaniti, Irene Baldanti, Fausto Saggio, Giovanni |
author_facet | Costantini, Giovanni Dr., Valerio Cesarini Robotti, Carlo Benazzo, Marco Pietrantonio, Filomena Di Girolamo, Stefano Pisani, Antonio Canzi, Pietro Mauramati, Simone Bertino, Giulia Cassaniti, Irene Baldanti, Fausto Saggio, Giovanni |
author_sort | Costantini, Giovanni |
collection | PubMed |
description | Alongside the currently used nasal swab testing, the COVID-19 pandemic situation would gain noticeable advantages from low-cost tests that are available at any-time, anywhere, at a large-scale, and with real time answers. A novel approach for COVID-19 assessment is adopted here, discriminating negative subjects versus positive or recovered subjects. The scope is to identify potential discriminating features, highlight mid and short-term effects of COVID on the voice and compare two custom algorithms. A pool of 310 subjects took part in the study; recordings were collected in a low-noise, controlled setting employing three different vocal tasks. Binary classifications followed, using two different custom algorithms. The first was based on the coupling of boosting and bagging, with an AdaBoost classifier using Random Forest learners. A feature selection process was employed for the training, identifying a subset of features acting as clinically relevant biomarkers. The other approach was centered on two custom CNN architectures applied to mel-Spectrograms, with a custom knowledge-based data augmentation. Performances, evaluated on an independent test set, were comparable: Adaboost and CNN differentiated COVID-19 positive from negative with accuracies of 100% and 95% respectively, and recovered from negative individuals with accuracies of 86.1% and 75% respectively. This study highlights the possibility to identify COVID-19 positive subjects, foreseeing a tool for on-site screening, while also considering recovered subjects and the effects of COVID-19 on the voice. The two proposed novel architectures allow for the identification of biomarkers and demonstrate the ongoing relevance of traditional ML versus deep learning in speech analysis. |
format | Online Article Text |
id | pubmed-9328841 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier B.V. |
record_format | MEDLINE/PubMed |
spelling | pubmed-93288412022-07-28 Deep learning and machine learning-based voice analysis for the detection of COVID-19: A proposal and comparison of architectures Costantini, Giovanni Dr., Valerio Cesarini Robotti, Carlo Benazzo, Marco Pietrantonio, Filomena Di Girolamo, Stefano Pisani, Antonio Canzi, Pietro Mauramati, Simone Bertino, Giulia Cassaniti, Irene Baldanti, Fausto Saggio, Giovanni Knowl Based Syst Article Alongside the currently used nasal swab testing, the COVID-19 pandemic situation would gain noticeable advantages from low-cost tests that are available at any-time, anywhere, at a large-scale, and with real time answers. A novel approach for COVID-19 assessment is adopted here, discriminating negative subjects versus positive or recovered subjects. The scope is to identify potential discriminating features, highlight mid and short-term effects of COVID on the voice and compare two custom algorithms. A pool of 310 subjects took part in the study; recordings were collected in a low-noise, controlled setting employing three different vocal tasks. Binary classifications followed, using two different custom algorithms. The first was based on the coupling of boosting and bagging, with an AdaBoost classifier using Random Forest learners. A feature selection process was employed for the training, identifying a subset of features acting as clinically relevant biomarkers. The other approach was centered on two custom CNN architectures applied to mel-Spectrograms, with a custom knowledge-based data augmentation. Performances, evaluated on an independent test set, were comparable: Adaboost and CNN differentiated COVID-19 positive from negative with accuracies of 100% and 95% respectively, and recovered from negative individuals with accuracies of 86.1% and 75% respectively. This study highlights the possibility to identify COVID-19 positive subjects, foreseeing a tool for on-site screening, while also considering recovered subjects and the effects of COVID-19 on the voice. The two proposed novel architectures allow for the identification of biomarkers and demonstrate the ongoing relevance of traditional ML versus deep learning in speech analysis. Elsevier B.V. 2022-10-11 2022-07-28 /pmc/articles/PMC9328841/ /pubmed/35915642 http://dx.doi.org/10.1016/j.knosys.2022.109539 Text en © 2022 Elsevier B.V. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Costantini, Giovanni Dr., Valerio Cesarini Robotti, Carlo Benazzo, Marco Pietrantonio, Filomena Di Girolamo, Stefano Pisani, Antonio Canzi, Pietro Mauramati, Simone Bertino, Giulia Cassaniti, Irene Baldanti, Fausto Saggio, Giovanni Deep learning and machine learning-based voice analysis for the detection of COVID-19: A proposal and comparison of architectures |
title | Deep learning and machine learning-based voice analysis for the detection of COVID-19: A proposal and comparison of architectures |
title_full | Deep learning and machine learning-based voice analysis for the detection of COVID-19: A proposal and comparison of architectures |
title_fullStr | Deep learning and machine learning-based voice analysis for the detection of COVID-19: A proposal and comparison of architectures |
title_full_unstemmed | Deep learning and machine learning-based voice analysis for the detection of COVID-19: A proposal and comparison of architectures |
title_short | Deep learning and machine learning-based voice analysis for the detection of COVID-19: A proposal and comparison of architectures |
title_sort | deep learning and machine learning-based voice analysis for the detection of covid-19: a proposal and comparison of architectures |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9328841/ https://www.ncbi.nlm.nih.gov/pubmed/35915642 http://dx.doi.org/10.1016/j.knosys.2022.109539 |
work_keys_str_mv | AT costantinigiovanni deeplearningandmachinelearningbasedvoiceanalysisforthedetectionofcovid19aproposalandcomparisonofarchitectures AT drvaleriocesarini deeplearningandmachinelearningbasedvoiceanalysisforthedetectionofcovid19aproposalandcomparisonofarchitectures AT robotticarlo deeplearningandmachinelearningbasedvoiceanalysisforthedetectionofcovid19aproposalandcomparisonofarchitectures AT benazzomarco deeplearningandmachinelearningbasedvoiceanalysisforthedetectionofcovid19aproposalandcomparisonofarchitectures AT pietrantoniofilomena deeplearningandmachinelearningbasedvoiceanalysisforthedetectionofcovid19aproposalandcomparisonofarchitectures AT digirolamostefano deeplearningandmachinelearningbasedvoiceanalysisforthedetectionofcovid19aproposalandcomparisonofarchitectures AT pisaniantonio deeplearningandmachinelearningbasedvoiceanalysisforthedetectionofcovid19aproposalandcomparisonofarchitectures AT canzipietro deeplearningandmachinelearningbasedvoiceanalysisforthedetectionofcovid19aproposalandcomparisonofarchitectures AT mauramatisimone deeplearningandmachinelearningbasedvoiceanalysisforthedetectionofcovid19aproposalandcomparisonofarchitectures AT bertinogiulia deeplearningandmachinelearningbasedvoiceanalysisforthedetectionofcovid19aproposalandcomparisonofarchitectures AT cassanitiirene deeplearningandmachinelearningbasedvoiceanalysisforthedetectionofcovid19aproposalandcomparisonofarchitectures AT baldantifausto deeplearningandmachinelearningbasedvoiceanalysisforthedetectionofcovid19aproposalandcomparisonofarchitectures AT saggiogiovanni deeplearningandmachinelearningbasedvoiceanalysisforthedetectionofcovid19aproposalandcomparisonofarchitectures |