Cargando…

Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings

Stretched words like ‘heellllp’ or ‘heyyyyy’ are a regular feature of spoken language, often used to emphasize or exaggerate the underlying meaning of the root word. While stretched words are rarely found in formal written language and dictionaries, they are prevalent within social media. In this pa...

Descripción completa

Detalles Bibliográficos
Autores principales: Gray, Tyler J., Danforth, Christopher M., Dodds, Peter Sheridan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7252599/
https://www.ncbi.nlm.nih.gov/pubmed/32459802
http://dx.doi.org/10.1371/journal.pone.0232938
_version_ 1783539178421616640
author Gray, Tyler J.
Danforth, Christopher M.
Dodds, Peter Sheridan
author_facet Gray, Tyler J.
Danforth, Christopher M.
Dodds, Peter Sheridan
author_sort Gray, Tyler J.
collection PubMed
description Stretched words like ‘heellllp’ or ‘heyyyyy’ are a regular feature of spoken language, often used to emphasize or exaggerate the underlying meaning of the root word. While stretched words are rarely found in formal written language and dictionaries, they are prevalent within social media. In this paper, we examine the frequency distributions of ‘stretchable words’ found in roughly 100 billion tweets authored over an 8 year period. We introduce two central parameters, ‘balance’ and ‘stretch’, that capture their main characteristics, and explore their dynamics by creating visual tools we call ‘balance plots’ and ‘spelling trees’. We discuss how the tools and methods we develop here could be used to study the statistical patterns of mistypings and misspellings and be used as a basis for other linguistic research involving stretchable words, along with the potential applications in augmenting dictionaries, improving language processing, and in any area where sequence construction matters, such as genetics.
format Online
Article
Text
id pubmed-7252599
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-72525992020-06-08 Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings Gray, Tyler J. Danforth, Christopher M. Dodds, Peter Sheridan PLoS One Research Article Stretched words like ‘heellllp’ or ‘heyyyyy’ are a regular feature of spoken language, often used to emphasize or exaggerate the underlying meaning of the root word. While stretched words are rarely found in formal written language and dictionaries, they are prevalent within social media. In this paper, we examine the frequency distributions of ‘stretchable words’ found in roughly 100 billion tweets authored over an 8 year period. We introduce two central parameters, ‘balance’ and ‘stretch’, that capture their main characteristics, and explore their dynamics by creating visual tools we call ‘balance plots’ and ‘spelling trees’. We discuss how the tools and methods we develop here could be used to study the statistical patterns of mistypings and misspellings and be used as a basis for other linguistic research involving stretchable words, along with the potential applications in augmenting dictionaries, improving language processing, and in any area where sequence construction matters, such as genetics. Public Library of Science 2020-05-27 /pmc/articles/PMC7252599/ /pubmed/32459802 http://dx.doi.org/10.1371/journal.pone.0232938 Text en © 2020 Gray et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Gray, Tyler J.
Danforth, Christopher M.
Dodds, Peter Sheridan
Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings
title Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings
title_full Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings
title_fullStr Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings
title_full_unstemmed Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings
title_short Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings
title_sort hahahahaha, duuuuude, yeeessss!: a two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7252599/
https://www.ncbi.nlm.nih.gov/pubmed/32459802
http://dx.doi.org/10.1371/journal.pone.0232938
work_keys_str_mv AT graytylerj hahahahahaduuuuudeyeeessssatwoparametercharacterizationofstretchablewordsandthedynamicsofmistypingsandmisspellings
AT danforthchristopherm hahahahahaduuuuudeyeeessssatwoparametercharacterizationofstretchablewordsandthedynamicsofmistypingsandmisspellings
AT doddspetersheridan hahahahahaduuuuudeyeeessssatwoparametercharacterizationofstretchablewordsandthedynamicsofmistypingsandmisspellings