Cargando…

Scaling Laws for Phonotactic Complexity in Spoken English Language Data

Two prominent statistical laws in language and other complex systems are Zipf’s law and Heaps’ law. We investigate the extent to which these two laws apply to the linguistic domain of phonotactics—that is, to sequences of sounds. We analyze phonotactic sequences with different lengths within words a...

Descripción completa

Detalles Bibliográficos
Autores principales: Baumann, Andreas, Kaźmierski, Kamil, Matzinger, Theresa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8406375/
https://www.ncbi.nlm.nih.gov/pubmed/32744167
http://dx.doi.org/10.1177/0023830920944445
_version_ 1783746504138162176
author Baumann, Andreas
Kaźmierski, Kamil
Matzinger, Theresa
author_facet Baumann, Andreas
Kaźmierski, Kamil
Matzinger, Theresa
author_sort Baumann, Andreas
collection PubMed
description Two prominent statistical laws in language and other complex systems are Zipf’s law and Heaps’ law. We investigate the extent to which these two laws apply to the linguistic domain of phonotactics—that is, to sequences of sounds. We analyze phonotactic sequences with different lengths within words and across word boundaries taken from a corpus of spoken English (Buckeye). We demonstrate that the expected relationship between the two scaling laws can only be attested when boundary spanning phonotactic sequences are also taken into account. Furthermore, it is shown that Zipf’s law exhibits both high goodness-of-fit and a high scaling coefficient if sequences of more than two sounds are considered. Our results support the notion that phonotactic cognition employs information about boundary spanning phonotactic sequences.
format Online
Article
Text
id pubmed-8406375
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-84063752021-09-01 Scaling Laws for Phonotactic Complexity in Spoken English Language Data Baumann, Andreas Kaźmierski, Kamil Matzinger, Theresa Lang Speech Articles Two prominent statistical laws in language and other complex systems are Zipf’s law and Heaps’ law. We investigate the extent to which these two laws apply to the linguistic domain of phonotactics—that is, to sequences of sounds. We analyze phonotactic sequences with different lengths within words and across word boundaries taken from a corpus of spoken English (Buckeye). We demonstrate that the expected relationship between the two scaling laws can only be attested when boundary spanning phonotactic sequences are also taken into account. Furthermore, it is shown that Zipf’s law exhibits both high goodness-of-fit and a high scaling coefficient if sequences of more than two sounds are considered. Our results support the notion that phonotactic cognition employs information about boundary spanning phonotactic sequences. SAGE Publications 2020-08-01 2021-09 /pmc/articles/PMC8406375/ /pubmed/32744167 http://dx.doi.org/10.1177/0023830920944445 Text en © The Author(s) 2020 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Articles
Baumann, Andreas
Kaźmierski, Kamil
Matzinger, Theresa
Scaling Laws for Phonotactic Complexity in Spoken English Language Data
title Scaling Laws for Phonotactic Complexity in Spoken English Language Data
title_full Scaling Laws for Phonotactic Complexity in Spoken English Language Data
title_fullStr Scaling Laws for Phonotactic Complexity in Spoken English Language Data
title_full_unstemmed Scaling Laws for Phonotactic Complexity in Spoken English Language Data
title_short Scaling Laws for Phonotactic Complexity in Spoken English Language Data
title_sort scaling laws for phonotactic complexity in spoken english language data
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8406375/
https://www.ncbi.nlm.nih.gov/pubmed/32744167
http://dx.doi.org/10.1177/0023830920944445
work_keys_str_mv AT baumannandreas scalinglawsforphonotacticcomplexityinspokenenglishlanguagedata
AT kazmierskikamil scalinglawsforphonotacticcomplexityinspokenenglishlanguagedata
AT matzingertheresa scalinglawsforphonotacticcomplexityinspokenenglishlanguagedata