Cargando…

Acoustic compression in Zoom audio does not compromise voice recognition performance

Human voice recognition over telephone channels typically yields lower accuracy when compared to audio recorded in a studio environment with higher quality. Here, we investigated the extent to which audio in video conferencing, subject to various lossy compression mechanisms, affects human voice rec...

Descripción completa

Detalles Bibliográficos
Autores principales: Perepelytsia, Valeriia, Dellwo, Volker
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10618539/
https://www.ncbi.nlm.nih.gov/pubmed/37907749
http://dx.doi.org/10.1038/s41598-023-45971-x
_version_ 1785129797934907392
author Perepelytsia, Valeriia
Dellwo, Volker
author_facet Perepelytsia, Valeriia
Dellwo, Volker
author_sort Perepelytsia, Valeriia
collection PubMed
description Human voice recognition over telephone channels typically yields lower accuracy when compared to audio recorded in a studio environment with higher quality. Here, we investigated the extent to which audio in video conferencing, subject to various lossy compression mechanisms, affects human voice recognition performance. Voice recognition performance was tested in an old–new recognition task under three audio conditions (telephone, Zoom, studio) across all matched (familiarization and test with same audio condition) and mismatched combinations (familiarization and test with different audio conditions). Participants were familiarized with female voices presented in either studio-quality (N = 22), Zoom-quality (N = 21), or telephone-quality (N = 20) stimuli. Subsequently, all listeners performed an identical voice recognition test containing a balanced stimulus set from all three conditions. Results revealed that voice recognition performance (dʹ) in Zoom audio was not significantly different to studio audio but both in Zoom and studio audio listeners performed significantly better compared to telephone audio. This suggests that signal processing of the speech codec used by Zoom provides equally relevant information in terms of voice recognition compared to studio audio. Interestingly, listeners familiarized with voices via Zoom audio showed a trend towards a better recognition performance in the test (p = 0.056) compared to listeners familiarized with studio audio. We discuss future directions according to which a possible advantage of Zoom audio for voice recognition might be related to some of the speech coding mechanisms used by Zoom.
format Online
Article
Text
id pubmed-10618539
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-106185392023-11-02 Acoustic compression in Zoom audio does not compromise voice recognition performance Perepelytsia, Valeriia Dellwo, Volker Sci Rep Article Human voice recognition over telephone channels typically yields lower accuracy when compared to audio recorded in a studio environment with higher quality. Here, we investigated the extent to which audio in video conferencing, subject to various lossy compression mechanisms, affects human voice recognition performance. Voice recognition performance was tested in an old–new recognition task under three audio conditions (telephone, Zoom, studio) across all matched (familiarization and test with same audio condition) and mismatched combinations (familiarization and test with different audio conditions). Participants were familiarized with female voices presented in either studio-quality (N = 22), Zoom-quality (N = 21), or telephone-quality (N = 20) stimuli. Subsequently, all listeners performed an identical voice recognition test containing a balanced stimulus set from all three conditions. Results revealed that voice recognition performance (dʹ) in Zoom audio was not significantly different to studio audio but both in Zoom and studio audio listeners performed significantly better compared to telephone audio. This suggests that signal processing of the speech codec used by Zoom provides equally relevant information in terms of voice recognition compared to studio audio. Interestingly, listeners familiarized with voices via Zoom audio showed a trend towards a better recognition performance in the test (p = 0.056) compared to listeners familiarized with studio audio. We discuss future directions according to which a possible advantage of Zoom audio for voice recognition might be related to some of the speech coding mechanisms used by Zoom. Nature Publishing Group UK 2023-10-31 /pmc/articles/PMC10618539/ /pubmed/37907749 http://dx.doi.org/10.1038/s41598-023-45971-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Perepelytsia, Valeriia
Dellwo, Volker
Acoustic compression in Zoom audio does not compromise voice recognition performance
title Acoustic compression in Zoom audio does not compromise voice recognition performance
title_full Acoustic compression in Zoom audio does not compromise voice recognition performance
title_fullStr Acoustic compression in Zoom audio does not compromise voice recognition performance
title_full_unstemmed Acoustic compression in Zoom audio does not compromise voice recognition performance
title_short Acoustic compression in Zoom audio does not compromise voice recognition performance
title_sort acoustic compression in zoom audio does not compromise voice recognition performance
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10618539/
https://www.ncbi.nlm.nih.gov/pubmed/37907749
http://dx.doi.org/10.1038/s41598-023-45971-x
work_keys_str_mv AT perepelytsiavaleriia acousticcompressioninzoomaudiodoesnotcompromisevoicerecognitionperformance
AT dellwovolker acousticcompressioninzoomaudiodoesnotcompromisevoicerecognitionperformance