Cargando…

Origins of scale invariance in vocalization sequences and speech

To communicate effectively animals need to detect temporal vocalization cues that vary over several orders of magnitude in their amplitude and frequency content. This large range of temporal cues is evident in the power-law scale-invariant relationship between the power of temporal fluctuations in s...

Descripción completa

Detalles Bibliográficos
Autores principales: Khatami, Fatemeh, Wöhr, Markus, Read, Heather L., Escabí, Monty A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5919684/
https://www.ncbi.nlm.nih.gov/pubmed/29659561
http://dx.doi.org/10.1371/journal.pcbi.1005996
_version_ 1783317684065140736
author Khatami, Fatemeh
Wöhr, Markus
Read, Heather L.
Escabí, Monty A.
author_facet Khatami, Fatemeh
Wöhr, Markus
Read, Heather L.
Escabí, Monty A.
author_sort Khatami, Fatemeh
collection PubMed
description To communicate effectively animals need to detect temporal vocalization cues that vary over several orders of magnitude in their amplitude and frequency content. This large range of temporal cues is evident in the power-law scale-invariant relationship between the power of temporal fluctuations in sounds and the sound modulation frequency (f). Though various forms of scale invariance have been described for natural sounds, the origins and implications of scale invariant phenomenon remain unknown. Using animal vocalization sequences, including continuous human speech, and a stochastic model of temporal amplitude fluctuations we demonstrate that temporal acoustic edges are the primary acoustic cue accounting for the scale invariant phenomenon. The modulation spectrum of vocalization sequences and the model both exhibit a dual regime lowpass structure with a flat region at low modulation frequencies and scale invariant 1/f(2) trend for high modulation frequencies. Moreover, we find a time-frequency tradeoff between the average vocalization duration of each vocalization sequence and the cutoff frequency beyond which scale invariant behavior is observed. These results indicate that temporal edges are universal features responsible for scale invariance in vocalized sounds. This is significant since temporal acoustic edges are salient perceptually and the auditory system could exploit such statistical regularities to minimize redundancies and generate compact neural representations of vocalized sounds.
format Online
Article
Text
id pubmed-5919684
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-59196842018-05-11 Origins of scale invariance in vocalization sequences and speech Khatami, Fatemeh Wöhr, Markus Read, Heather L. Escabí, Monty A. PLoS Comput Biol Research Article To communicate effectively animals need to detect temporal vocalization cues that vary over several orders of magnitude in their amplitude and frequency content. This large range of temporal cues is evident in the power-law scale-invariant relationship between the power of temporal fluctuations in sounds and the sound modulation frequency (f). Though various forms of scale invariance have been described for natural sounds, the origins and implications of scale invariant phenomenon remain unknown. Using animal vocalization sequences, including continuous human speech, and a stochastic model of temporal amplitude fluctuations we demonstrate that temporal acoustic edges are the primary acoustic cue accounting for the scale invariant phenomenon. The modulation spectrum of vocalization sequences and the model both exhibit a dual regime lowpass structure with a flat region at low modulation frequencies and scale invariant 1/f(2) trend for high modulation frequencies. Moreover, we find a time-frequency tradeoff between the average vocalization duration of each vocalization sequence and the cutoff frequency beyond which scale invariant behavior is observed. These results indicate that temporal edges are universal features responsible for scale invariance in vocalized sounds. This is significant since temporal acoustic edges are salient perceptually and the auditory system could exploit such statistical regularities to minimize redundancies and generate compact neural representations of vocalized sounds. Public Library of Science 2018-04-16 /pmc/articles/PMC5919684/ /pubmed/29659561 http://dx.doi.org/10.1371/journal.pcbi.1005996 Text en © 2018 Khatami et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Khatami, Fatemeh
Wöhr, Markus
Read, Heather L.
Escabí, Monty A.
Origins of scale invariance in vocalization sequences and speech
title Origins of scale invariance in vocalization sequences and speech
title_full Origins of scale invariance in vocalization sequences and speech
title_fullStr Origins of scale invariance in vocalization sequences and speech
title_full_unstemmed Origins of scale invariance in vocalization sequences and speech
title_short Origins of scale invariance in vocalization sequences and speech
title_sort origins of scale invariance in vocalization sequences and speech
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5919684/
https://www.ncbi.nlm.nih.gov/pubmed/29659561
http://dx.doi.org/10.1371/journal.pcbi.1005996
work_keys_str_mv AT khatamifatemeh originsofscaleinvarianceinvocalizationsequencesandspeech
AT wohrmarkus originsofscaleinvarianceinvocalizationsequencesandspeech
AT readheatherl originsofscaleinvarianceinvocalizationsequencesandspeech
AT escabimontya originsofscaleinvarianceinvocalizationsequencesandspeech