Cargando…

Statistical universals of language: mathematical chance vs. human choice

This volume explores the universal mathematical properties underlying big language data and possible reasons why such properties exist, revealing how we may be unconsciously mathematical in our language use. These properties are statistical and thus different from linguistic universals that contribu...

Descripción completa

Detalles Bibliográficos
Autor principal: Tanaka-Ishii, Kumiko
Lenguaje:eng
Publicado: Springer 2021
Materias:
Acceso en línea:https://dx.doi.org/10.1007/978-3-030-59377-3
http://cds.cern.ch/record/2763334
_version_ 1780970903261151232
author Tanaka-Ishii, Kumiko
author_facet Tanaka-Ishii, Kumiko
author_sort Tanaka-Ishii, Kumiko
collection CERN
description This volume explores the universal mathematical properties underlying big language data and possible reasons why such properties exist, revealing how we may be unconsciously mathematical in our language use. These properties are statistical and thus different from linguistic universals that contribute to describing the variation of human languages, and they can only be identified over a large accumulation of usages. The book provides an overview of state-of-the art findings on these statistical universals and reconsiders the nature of language accordingly, with Zipf's law as a well-known example. The main focus of the book further lies in explaining the property of long memory, which was discovered and studied more recently by borrowing concepts from complex systems theory. The statistical universals not only possibly lie as the precursor of language system formation, but they also highlight the qualities of language that remain weak points in today's machine learning. In summary, this book provides an overview of language's global properties. It will be of interest to anyone engaged in fields related to language and computing or statistical analysis methods, with an emphasis on researchers and students in computational linguistics and natural language processing. While the book does apply mathematical concepts, all possible effort has been made to speak to a non-mathematical audience as well by communicating mathematical content intuitively, with concise examples taken from real texts.
id cern-2763334
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2021
publisher Springer
record_format invenio
spelling cern-27633342021-04-21T16:38:34Zdoi:10.1007/978-3-030-59377-3http://cds.cern.ch/record/2763334engTanaka-Ishii, KumikoStatistical universals of language: mathematical chance vs. human choiceMathematical Physics and MathematicsThis volume explores the universal mathematical properties underlying big language data and possible reasons why such properties exist, revealing how we may be unconsciously mathematical in our language use. These properties are statistical and thus different from linguistic universals that contribute to describing the variation of human languages, and they can only be identified over a large accumulation of usages. The book provides an overview of state-of-the art findings on these statistical universals and reconsiders the nature of language accordingly, with Zipf's law as a well-known example. The main focus of the book further lies in explaining the property of long memory, which was discovered and studied more recently by borrowing concepts from complex systems theory. The statistical universals not only possibly lie as the precursor of language system formation, but they also highlight the qualities of language that remain weak points in today's machine learning. In summary, this book provides an overview of language's global properties. It will be of interest to anyone engaged in fields related to language and computing or statistical analysis methods, with an emphasis on researchers and students in computational linguistics and natural language processing. While the book does apply mathematical concepts, all possible effort has been made to speak to a non-mathematical audience as well by communicating mathematical content intuitively, with concise examples taken from real texts.Springeroai:cds.cern.ch:27633342021
spellingShingle Mathematical Physics and Mathematics
Tanaka-Ishii, Kumiko
Statistical universals of language: mathematical chance vs. human choice
title Statistical universals of language: mathematical chance vs. human choice
title_full Statistical universals of language: mathematical chance vs. human choice
title_fullStr Statistical universals of language: mathematical chance vs. human choice
title_full_unstemmed Statistical universals of language: mathematical chance vs. human choice
title_short Statistical universals of language: mathematical chance vs. human choice
title_sort statistical universals of language: mathematical chance vs. human choice
topic Mathematical Physics and Mathematics
url https://dx.doi.org/10.1007/978-3-030-59377-3
http://cds.cern.ch/record/2763334
work_keys_str_mv AT tanakaishiikumiko statisticaluniversalsoflanguagemathematicalchancevshumanchoice