Cargando…

Development of a graphical user interface for automatic separation of human voice from Doppler ultrasound audio in diving experiments

Doppler ultrasound (DU) is used in decompression research to detect venous gas emboli in the precordium or subclavian vein, as a marker of decompression stress. This is of relevance to scuba divers, compressed air workers and astronauts to prevent decompression sickness (DCS) that can be caused by t...

Descripción completa

Detalles Bibliográficos
Autores principales: Azarang, Arian, Blogg, S. Lesley, Currens, Joshua, Lance, Rachel M., Moon, Richard E., Lindholm, Peter, Papadopoulou, Virginie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10414643/
https://www.ncbi.nlm.nih.gov/pubmed/37561745
http://dx.doi.org/10.1371/journal.pone.0283953
_version_ 1785087385469452288
author Azarang, Arian
Blogg, S. Lesley
Currens, Joshua
Lance, Rachel M.
Moon, Richard E.
Lindholm, Peter
Papadopoulou, Virginie
author_facet Azarang, Arian
Blogg, S. Lesley
Currens, Joshua
Lance, Rachel M.
Moon, Richard E.
Lindholm, Peter
Papadopoulou, Virginie
author_sort Azarang, Arian
collection PubMed
description Doppler ultrasound (DU) is used in decompression research to detect venous gas emboli in the precordium or subclavian vein, as a marker of decompression stress. This is of relevance to scuba divers, compressed air workers and astronauts to prevent decompression sickness (DCS) that can be caused by these bubbles upon or after a sudden reduction in ambient pressure. Doppler ultrasound data is graded by expert raters on the Kisman-Masurel or Spencer scales that are associated to DCS risk. Meta-analyses, as well as efforts to computer-automate DU grading, both necessitate access to large databases of well-curated and graded data. Leveraging previously collected data is especially important due to the difficulty of repeating large-scale extreme military pressure exposures that were conducted in the 70-90s in austere environments. Historically, DU data (Non-speech) were often captured on cassettes in one-channel audio with superimposed human speech describing the experiment (Speech). Digitizing and separating these audio files is currently a lengthy, manual task. In this paper, we develop a graphical user interface (GUI) to perform automatic speech recognition and aid in Non-speech and Speech separation. This constitutes the first study incorporating speech processing technology in the field of diving research. If successful, it has the potential to significantly accelerate the reuse of previously-acquired datasets. The recognition task incorporates the Google speech recognizer to detect the presence of human voice activity together with corresponding timestamps. The detected human speech is then separated from the audio Doppler ultrasound within the developed GUI. Several experiments were conducted on recently digitized audio Doppler recordings to corroborate the effectiveness of the developed GUI in recognition and separations tasks, and these are compared to manual labels for Speech timestamps. The following metrics are used to evaluate performance: the average absolute differences between the reference and detected Speech starting points, as well as the percentage of detected Speech over the total duration of the reference Speech. Results have shown the efficacy of the developed GUI in Speech/Non-speech component separation.
format Online
Article
Text
id pubmed-10414643
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-104146432023-08-11 Development of a graphical user interface for automatic separation of human voice from Doppler ultrasound audio in diving experiments Azarang, Arian Blogg, S. Lesley Currens, Joshua Lance, Rachel M. Moon, Richard E. Lindholm, Peter Papadopoulou, Virginie PLoS One Research Article Doppler ultrasound (DU) is used in decompression research to detect venous gas emboli in the precordium or subclavian vein, as a marker of decompression stress. This is of relevance to scuba divers, compressed air workers and astronauts to prevent decompression sickness (DCS) that can be caused by these bubbles upon or after a sudden reduction in ambient pressure. Doppler ultrasound data is graded by expert raters on the Kisman-Masurel or Spencer scales that are associated to DCS risk. Meta-analyses, as well as efforts to computer-automate DU grading, both necessitate access to large databases of well-curated and graded data. Leveraging previously collected data is especially important due to the difficulty of repeating large-scale extreme military pressure exposures that were conducted in the 70-90s in austere environments. Historically, DU data (Non-speech) were often captured on cassettes in one-channel audio with superimposed human speech describing the experiment (Speech). Digitizing and separating these audio files is currently a lengthy, manual task. In this paper, we develop a graphical user interface (GUI) to perform automatic speech recognition and aid in Non-speech and Speech separation. This constitutes the first study incorporating speech processing technology in the field of diving research. If successful, it has the potential to significantly accelerate the reuse of previously-acquired datasets. The recognition task incorporates the Google speech recognizer to detect the presence of human voice activity together with corresponding timestamps. The detected human speech is then separated from the audio Doppler ultrasound within the developed GUI. Several experiments were conducted on recently digitized audio Doppler recordings to corroborate the effectiveness of the developed GUI in recognition and separations tasks, and these are compared to manual labels for Speech timestamps. The following metrics are used to evaluate performance: the average absolute differences between the reference and detected Speech starting points, as well as the percentage of detected Speech over the total duration of the reference Speech. Results have shown the efficacy of the developed GUI in Speech/Non-speech component separation. Public Library of Science 2023-08-10 /pmc/articles/PMC10414643/ /pubmed/37561745 http://dx.doi.org/10.1371/journal.pone.0283953 Text en © 2023 Azarang et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Azarang, Arian
Blogg, S. Lesley
Currens, Joshua
Lance, Rachel M.
Moon, Richard E.
Lindholm, Peter
Papadopoulou, Virginie
Development of a graphical user interface for automatic separation of human voice from Doppler ultrasound audio in diving experiments
title Development of a graphical user interface for automatic separation of human voice from Doppler ultrasound audio in diving experiments
title_full Development of a graphical user interface for automatic separation of human voice from Doppler ultrasound audio in diving experiments
title_fullStr Development of a graphical user interface for automatic separation of human voice from Doppler ultrasound audio in diving experiments
title_full_unstemmed Development of a graphical user interface for automatic separation of human voice from Doppler ultrasound audio in diving experiments
title_short Development of a graphical user interface for automatic separation of human voice from Doppler ultrasound audio in diving experiments
title_sort development of a graphical user interface for automatic separation of human voice from doppler ultrasound audio in diving experiments
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10414643/
https://www.ncbi.nlm.nih.gov/pubmed/37561745
http://dx.doi.org/10.1371/journal.pone.0283953
work_keys_str_mv AT azarangarian developmentofagraphicaluserinterfaceforautomaticseparationofhumanvoicefromdopplerultrasoundaudioindivingexperiments
AT bloggslesley developmentofagraphicaluserinterfaceforautomaticseparationofhumanvoicefromdopplerultrasoundaudioindivingexperiments
AT currensjoshua developmentofagraphicaluserinterfaceforautomaticseparationofhumanvoicefromdopplerultrasoundaudioindivingexperiments
AT lancerachelm developmentofagraphicaluserinterfaceforautomaticseparationofhumanvoicefromdopplerultrasoundaudioindivingexperiments
AT moonricharde developmentofagraphicaluserinterfaceforautomaticseparationofhumanvoicefromdopplerultrasoundaudioindivingexperiments
AT lindholmpeter developmentofagraphicaluserinterfaceforautomaticseparationofhumanvoicefromdopplerultrasoundaudioindivingexperiments
AT papadopoulouvirginie developmentofagraphicaluserinterfaceforautomaticseparationofhumanvoicefromdopplerultrasoundaudioindivingexperiments