Cargando…
Implicit Standardization in a Minority Language Community: Real-Time Syntactic Change among Hasidic Yiddish Writers
The recent turn to “big data” from social media corpora has enabled sociolinguists to investigate patterns of language variation and change at unprecedented scales. However, research in this paradigm has been slow to address variable phenomena in minority languages, where data scarcity and the absen...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7861311/ https://www.ncbi.nlm.nih.gov/pubmed/33733153 http://dx.doi.org/10.3389/frai.2020.00035 |
_version_ | 1783647059156402176 |
---|---|
author | Bleaman, Isaac L. |
author_facet | Bleaman, Isaac L. |
author_sort | Bleaman, Isaac L. |
collection | PubMed |
description | The recent turn to “big data” from social media corpora has enabled sociolinguists to investigate patterns of language variation and change at unprecedented scales. However, research in this paradigm has been slow to address variable phenomena in minority languages, where data scarcity and the absence of computational tools (e.g., taggers, parsers) often present significant barriers to entry. This article analyzes socio-syntactic variation in one minority language variety, Hasidic Yiddish, focusing on a variable for which tokens can be identified in raw text using purely morphological criteria. In non-finite particle verbs, the overt tense marker tsu (cf. English to, German zu) is variably realized either between the preverbal particle and verb (e.g., oyf-tsu-es-n up-to-eat-INF ‘to eat up’; the conservative variant) or before both elements (tsu oyf-es-n to up-eat-INF; the innovative variant). Nearly 38,000 tokens of non-finite particle verbs were extracted from the popular Hasidic Yiddish discussion forum Kave Shtiebel (the ‘coffee room’; kaveshtiebel.com). A mixed-effects regression analysis reveals that despite a forum-wide favoring effect for the innovative variant, users favor the conservative variant the longer their accounts remain open and active. This process of rapid implicit standardization is supported by ethnographic evidence highlighting the spread of language norms among Hasidic writers on the internet, most of whom did not have the opportunity to express themselves in written Yiddish prior to the advent of social media. |
format | Online Article Text |
id | pubmed-7861311 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-78613112021-03-16 Implicit Standardization in a Minority Language Community: Real-Time Syntactic Change among Hasidic Yiddish Writers Bleaman, Isaac L. Front Artif Intell Artificial Intelligence The recent turn to “big data” from social media corpora has enabled sociolinguists to investigate patterns of language variation and change at unprecedented scales. However, research in this paradigm has been slow to address variable phenomena in minority languages, where data scarcity and the absence of computational tools (e.g., taggers, parsers) often present significant barriers to entry. This article analyzes socio-syntactic variation in one minority language variety, Hasidic Yiddish, focusing on a variable for which tokens can be identified in raw text using purely morphological criteria. In non-finite particle verbs, the overt tense marker tsu (cf. English to, German zu) is variably realized either between the preverbal particle and verb (e.g., oyf-tsu-es-n up-to-eat-INF ‘to eat up’; the conservative variant) or before both elements (tsu oyf-es-n to up-eat-INF; the innovative variant). Nearly 38,000 tokens of non-finite particle verbs were extracted from the popular Hasidic Yiddish discussion forum Kave Shtiebel (the ‘coffee room’; kaveshtiebel.com). A mixed-effects regression analysis reveals that despite a forum-wide favoring effect for the innovative variant, users favor the conservative variant the longer their accounts remain open and active. This process of rapid implicit standardization is supported by ethnographic evidence highlighting the spread of language norms among Hasidic writers on the internet, most of whom did not have the opportunity to express themselves in written Yiddish prior to the advent of social media. Frontiers Media S.A. 2020-05-29 /pmc/articles/PMC7861311/ /pubmed/33733153 http://dx.doi.org/10.3389/frai.2020.00035 Text en Copyright © 2020 Bleaman. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Artificial Intelligence Bleaman, Isaac L. Implicit Standardization in a Minority Language Community: Real-Time Syntactic Change among Hasidic Yiddish Writers |
title | Implicit Standardization in a Minority Language Community: Real-Time Syntactic Change among Hasidic Yiddish Writers |
title_full | Implicit Standardization in a Minority Language Community: Real-Time Syntactic Change among Hasidic Yiddish Writers |
title_fullStr | Implicit Standardization in a Minority Language Community: Real-Time Syntactic Change among Hasidic Yiddish Writers |
title_full_unstemmed | Implicit Standardization in a Minority Language Community: Real-Time Syntactic Change among Hasidic Yiddish Writers |
title_short | Implicit Standardization in a Minority Language Community: Real-Time Syntactic Change among Hasidic Yiddish Writers |
title_sort | implicit standardization in a minority language community: real-time syntactic change among hasidic yiddish writers |
topic | Artificial Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7861311/ https://www.ncbi.nlm.nih.gov/pubmed/33733153 http://dx.doi.org/10.3389/frai.2020.00035 |
work_keys_str_mv | AT bleamanisaacl implicitstandardizationinaminoritylanguagecommunityrealtimesyntacticchangeamonghasidicyiddishwriters |