Cargando…

On the speaker discriminatory power asymmetry regarding acoustic-phonetic parameters and the impact of speaking style

This study aimed to assess what we refer to as the speaker discriminatory power asymmetry and its forensic implications in comparisons performed in different speaking styles: spontaneous dialogues vs. interviews. We also addressed the impact of data sampling on the speaker's discriminatory perf...

Descripción completa

Detalles Bibliográficos
Autores principales: Cavalcanti, Julio Cesar, Eriksson, Anders, Barbosa, Plinio A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10150585/
https://www.ncbi.nlm.nih.gov/pubmed/37138997
http://dx.doi.org/10.3389/fpsyg.2023.1101187
_version_ 1785035384221073408
author Cavalcanti, Julio Cesar
Eriksson, Anders
Barbosa, Plinio A.
author_facet Cavalcanti, Julio Cesar
Eriksson, Anders
Barbosa, Plinio A.
author_sort Cavalcanti, Julio Cesar
collection PubMed
description This study aimed to assess what we refer to as the speaker discriminatory power asymmetry and its forensic implications in comparisons performed in different speaking styles: spontaneous dialogues vs. interviews. We also addressed the impact of data sampling on the speaker's discriminatory performance concerning different acoustic-phonetic estimates. The participants were 20 male speakers, Brazilian Portuguese speakers from the same dialectal area. The speech material consisted of spontaneous telephone conversations between familiar individuals, and interviews conducted between each individual participant and the researcher. Nine acoustic-phonetic parameters were chosen for the comparisons, spanning from temporal and melodic to spectral acoustic-phonetic estimates. Ultimately, an analysis based on the combination of different parameters was also conducted. Two speaker discriminatory metrics were examined: Cost Log-likelihood-ratio (Cllr) and Equal Error Rate (EER) values. A general speaker discriminatory trend was suggested when assessing the parameters individually. Parameters pertaining to the temporal acoustic-phonetic class depicted the weakest performance in terms of speaker contrasting power as evidenced by the relatively higher Cllr and EER values. Moreover, from the set of acoustic parameters assessed, spectral parameters, mainly high formant frequencies, i.e., F3 and F4, were the best performing in terms of speaker discrimination, depicting the lowest EER and Cllr scores. The results appear to suggest a speaker discriminatory power asymmetry concerning parameters from different acoustic-phonetic classes, in which temporal parameters tended to present a lower discriminatory power. The speaking style mismatch also seemed to considerably impact the speaker comparison task, by undermining the overall discriminatory performance. A statistical model based on the combination of different acoustic-phonetic estimates was found to perform best in this case. Finally, data sampling has proven to be of crucial relevance for the reliability of discriminatory power assessment.
format Online
Article
Text
id pubmed-10150585
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-101505852023-05-02 On the speaker discriminatory power asymmetry regarding acoustic-phonetic parameters and the impact of speaking style Cavalcanti, Julio Cesar Eriksson, Anders Barbosa, Plinio A. Front Psychol Psychology This study aimed to assess what we refer to as the speaker discriminatory power asymmetry and its forensic implications in comparisons performed in different speaking styles: spontaneous dialogues vs. interviews. We also addressed the impact of data sampling on the speaker's discriminatory performance concerning different acoustic-phonetic estimates. The participants were 20 male speakers, Brazilian Portuguese speakers from the same dialectal area. The speech material consisted of spontaneous telephone conversations between familiar individuals, and interviews conducted between each individual participant and the researcher. Nine acoustic-phonetic parameters were chosen for the comparisons, spanning from temporal and melodic to spectral acoustic-phonetic estimates. Ultimately, an analysis based on the combination of different parameters was also conducted. Two speaker discriminatory metrics were examined: Cost Log-likelihood-ratio (Cllr) and Equal Error Rate (EER) values. A general speaker discriminatory trend was suggested when assessing the parameters individually. Parameters pertaining to the temporal acoustic-phonetic class depicted the weakest performance in terms of speaker contrasting power as evidenced by the relatively higher Cllr and EER values. Moreover, from the set of acoustic parameters assessed, spectral parameters, mainly high formant frequencies, i.e., F3 and F4, were the best performing in terms of speaker discrimination, depicting the lowest EER and Cllr scores. The results appear to suggest a speaker discriminatory power asymmetry concerning parameters from different acoustic-phonetic classes, in which temporal parameters tended to present a lower discriminatory power. The speaking style mismatch also seemed to considerably impact the speaker comparison task, by undermining the overall discriminatory performance. A statistical model based on the combination of different acoustic-phonetic estimates was found to perform best in this case. Finally, data sampling has proven to be of crucial relevance for the reliability of discriminatory power assessment. Frontiers Media S.A. 2023-04-17 /pmc/articles/PMC10150585/ /pubmed/37138997 http://dx.doi.org/10.3389/fpsyg.2023.1101187 Text en Copyright © 2023 Cavalcanti, Eriksson and Barbosa. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychology
Cavalcanti, Julio Cesar
Eriksson, Anders
Barbosa, Plinio A.
On the speaker discriminatory power asymmetry regarding acoustic-phonetic parameters and the impact of speaking style
title On the speaker discriminatory power asymmetry regarding acoustic-phonetic parameters and the impact of speaking style
title_full On the speaker discriminatory power asymmetry regarding acoustic-phonetic parameters and the impact of speaking style
title_fullStr On the speaker discriminatory power asymmetry regarding acoustic-phonetic parameters and the impact of speaking style
title_full_unstemmed On the speaker discriminatory power asymmetry regarding acoustic-phonetic parameters and the impact of speaking style
title_short On the speaker discriminatory power asymmetry regarding acoustic-phonetic parameters and the impact of speaking style
title_sort on the speaker discriminatory power asymmetry regarding acoustic-phonetic parameters and the impact of speaking style
topic Psychology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10150585/
https://www.ncbi.nlm.nih.gov/pubmed/37138997
http://dx.doi.org/10.3389/fpsyg.2023.1101187
work_keys_str_mv AT cavalcantijuliocesar onthespeakerdiscriminatorypowerasymmetryregardingacousticphoneticparametersandtheimpactofspeakingstyle
AT erikssonanders onthespeakerdiscriminatorypowerasymmetryregardingacousticphoneticparametersandtheimpactofspeakingstyle
AT barbosaplinioa onthespeakerdiscriminatorypowerasymmetryregardingacousticphoneticparametersandtheimpactofspeakingstyle