Cargando…

The effect of speech pathology on automatic speaker verification: a large-scale study

Navigating the challenges of data-driven speech processing, one of the primary hurdles is accessing reliable pathological speech data. While public datasets appear to offer solutions, they come with inherent risks of potential unintended exposure of patient health information via re-identification a...

Descripción completa

Detalles Bibliográficos
Autores principales: Tayebi Arasteh, Soroosh, Weise, Tobias, Schuster, Maria, Noeth, Elmar, Maier, Andreas, Yang, Seung Hee
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665418/
https://www.ncbi.nlm.nih.gov/pubmed/37993490
http://dx.doi.org/10.1038/s41598-023-47711-7
_version_ 1785148866291564544
author Tayebi Arasteh, Soroosh
Weise, Tobias
Schuster, Maria
Noeth, Elmar
Maier, Andreas
Yang, Seung Hee
author_facet Tayebi Arasteh, Soroosh
Weise, Tobias
Schuster, Maria
Noeth, Elmar
Maier, Andreas
Yang, Seung Hee
author_sort Tayebi Arasteh, Soroosh
collection PubMed
description Navigating the challenges of data-driven speech processing, one of the primary hurdles is accessing reliable pathological speech data. While public datasets appear to offer solutions, they come with inherent risks of potential unintended exposure of patient health information via re-identification attacks. Using a comprehensive real-world pathological speech corpus, with over n[Formula: see text] 3800 test subjects spanning various age groups and speech disorders, we employed a deep-learning-driven automatic speaker verification (ASV) approach. This resulted in a notable mean equal error rate (EER) of [Formula: see text] , outstripping traditional benchmarks. Our comprehensive assessments demonstrate that pathological speech overall faces heightened privacy breach risks compared to healthy speech. Specifically, adults with dysphonia are at heightened re-identification risks, whereas conditions like dysarthria yield results comparable to those of healthy speakers. Crucially, speech intelligibility does not influence the ASV system’s performance metrics. In pediatric cases, particularly those with cleft lip and palate, the recording environment plays a decisive role in re-identification. Merging data across pathological types led to a marked EER decrease, suggesting the potential benefits of pathological diversity in ASV, accompanied by a logarithmic boost in ASV effectiveness. In essence, this research sheds light on the dynamics between pathological speech and speaker verification, emphasizing its crucial role in safeguarding patient confidentiality in our increasingly digitized healthcare era.
format Online
Article
Text
id pubmed-10665418
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-106654182023-11-22 The effect of speech pathology on automatic speaker verification: a large-scale study Tayebi Arasteh, Soroosh Weise, Tobias Schuster, Maria Noeth, Elmar Maier, Andreas Yang, Seung Hee Sci Rep Article Navigating the challenges of data-driven speech processing, one of the primary hurdles is accessing reliable pathological speech data. While public datasets appear to offer solutions, they come with inherent risks of potential unintended exposure of patient health information via re-identification attacks. Using a comprehensive real-world pathological speech corpus, with over n[Formula: see text] 3800 test subjects spanning various age groups and speech disorders, we employed a deep-learning-driven automatic speaker verification (ASV) approach. This resulted in a notable mean equal error rate (EER) of [Formula: see text] , outstripping traditional benchmarks. Our comprehensive assessments demonstrate that pathological speech overall faces heightened privacy breach risks compared to healthy speech. Specifically, adults with dysphonia are at heightened re-identification risks, whereas conditions like dysarthria yield results comparable to those of healthy speakers. Crucially, speech intelligibility does not influence the ASV system’s performance metrics. In pediatric cases, particularly those with cleft lip and palate, the recording environment plays a decisive role in re-identification. Merging data across pathological types led to a marked EER decrease, suggesting the potential benefits of pathological diversity in ASV, accompanied by a logarithmic boost in ASV effectiveness. In essence, this research sheds light on the dynamics between pathological speech and speaker verification, emphasizing its crucial role in safeguarding patient confidentiality in our increasingly digitized healthcare era. Nature Publishing Group UK 2023-11-22 /pmc/articles/PMC10665418/ /pubmed/37993490 http://dx.doi.org/10.1038/s41598-023-47711-7 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Tayebi Arasteh, Soroosh
Weise, Tobias
Schuster, Maria
Noeth, Elmar
Maier, Andreas
Yang, Seung Hee
The effect of speech pathology on automatic speaker verification: a large-scale study
title The effect of speech pathology on automatic speaker verification: a large-scale study
title_full The effect of speech pathology on automatic speaker verification: a large-scale study
title_fullStr The effect of speech pathology on automatic speaker verification: a large-scale study
title_full_unstemmed The effect of speech pathology on automatic speaker verification: a large-scale study
title_short The effect of speech pathology on automatic speaker verification: a large-scale study
title_sort effect of speech pathology on automatic speaker verification: a large-scale study
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665418/
https://www.ncbi.nlm.nih.gov/pubmed/37993490
http://dx.doi.org/10.1038/s41598-023-47711-7
work_keys_str_mv AT tayebiarastehsoroosh theeffectofspeechpathologyonautomaticspeakerverificationalargescalestudy
AT weisetobias theeffectofspeechpathologyonautomaticspeakerverificationalargescalestudy
AT schustermaria theeffectofspeechpathologyonautomaticspeakerverificationalargescalestudy
AT noethelmar theeffectofspeechpathologyonautomaticspeakerverificationalargescalestudy
AT maierandreas theeffectofspeechpathologyonautomaticspeakerverificationalargescalestudy
AT yangseunghee theeffectofspeechpathologyonautomaticspeakerverificationalargescalestudy
AT tayebiarastehsoroosh effectofspeechpathologyonautomaticspeakerverificationalargescalestudy
AT weisetobias effectofspeechpathologyonautomaticspeakerverificationalargescalestudy
AT schustermaria effectofspeechpathologyonautomaticspeakerverificationalargescalestudy
AT noethelmar effectofspeechpathologyonautomaticspeakerverificationalargescalestudy
AT maierandreas effectofspeechpathologyonautomaticspeakerverificationalargescalestudy
AT yangseunghee effectofspeechpathologyonautomaticspeakerverificationalargescalestudy