Cargando…

Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges

The aim of this work is to identify and analyze a set of challenges that are likely to be encountered when one embarks on fieldwork in linguistic communities that feature small, young, and/or non-standard languages with a goal to elicit big sets of rich data. For each challenge, we (i) explain its n...

Descripción completa

Detalles Bibliográficos
Autores principales: Leivada, Evelina, D’Alessandro, Roberta, Grohmann, Kleanthes K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6382742/
https://www.ncbi.nlm.nih.gov/pubmed/30837922
http://dx.doi.org/10.3389/fpsyg.2019.00313
_version_ 1783396706547662848
author Leivada, Evelina
D’Alessandro, Roberta
Grohmann, Kleanthes K.
author_facet Leivada, Evelina
D’Alessandro, Roberta
Grohmann, Kleanthes K.
author_sort Leivada, Evelina
collection PubMed
description The aim of this work is to identify and analyze a set of challenges that are likely to be encountered when one embarks on fieldwork in linguistic communities that feature small, young, and/or non-standard languages with a goal to elicit big sets of rich data. For each challenge, we (i) explain its nature and implications, (ii) offer one or more examples of how it is manifested in actual linguistic communities, and (iii) where possible, offer recommendations for addressing it effectively. Our list of challenges involves static characteristics (e.g., absence of orthographic conventions and how it affects data collection), dynamic processes (e.g., speed of language change in small languages and how it affects longitudinal collection of big amounts of data), and interactive relations between non-dynamic features that are nevertheless subject to potentially rapid change (e.g., absence of standardized assessment tools or estimates for psycholinguistic variables). The identified challenges represent the domains of data collection and handling, participant recruitment, and experimental design. Among other issues, we discuss population limits and degree of power, inter- and intraspeaker variation, absence of metalanguage and its implications for the process of eliciting acceptability judgments, and challenges that arise from absence of local funding, conflicting regulations in relation to privacy issues, and exporting large samples of data across countries. Finally, the ten experimental challenges presented are relevant to languages from a broad typological spectrum, encompassing both spoken and sign, extant and nearly extinct languages.
format Online
Article
Text
id pubmed-6382742
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-63827422019-03-05 Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges Leivada, Evelina D’Alessandro, Roberta Grohmann, Kleanthes K. Front Psychol Psychology The aim of this work is to identify and analyze a set of challenges that are likely to be encountered when one embarks on fieldwork in linguistic communities that feature small, young, and/or non-standard languages with a goal to elicit big sets of rich data. For each challenge, we (i) explain its nature and implications, (ii) offer one or more examples of how it is manifested in actual linguistic communities, and (iii) where possible, offer recommendations for addressing it effectively. Our list of challenges involves static characteristics (e.g., absence of orthographic conventions and how it affects data collection), dynamic processes (e.g., speed of language change in small languages and how it affects longitudinal collection of big amounts of data), and interactive relations between non-dynamic features that are nevertheless subject to potentially rapid change (e.g., absence of standardized assessment tools or estimates for psycholinguistic variables). The identified challenges represent the domains of data collection and handling, participant recruitment, and experimental design. Among other issues, we discuss population limits and degree of power, inter- and intraspeaker variation, absence of metalanguage and its implications for the process of eliciting acceptability judgments, and challenges that arise from absence of local funding, conflicting regulations in relation to privacy issues, and exporting large samples of data across countries. Finally, the ten experimental challenges presented are relevant to languages from a broad typological spectrum, encompassing both spoken and sign, extant and nearly extinct languages. Frontiers Media S.A. 2019-02-14 /pmc/articles/PMC6382742/ /pubmed/30837922 http://dx.doi.org/10.3389/fpsyg.2019.00313 Text en Copyright © 2019 Leivada, D’Alessandro and Grohmann. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychology
Leivada, Evelina
D’Alessandro, Roberta
Grohmann, Kleanthes K.
Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges
title Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges
title_full Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges
title_fullStr Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges
title_full_unstemmed Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges
title_short Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges
title_sort eliciting big data from small, young, or non-standard languages: 10 experimental challenges
topic Psychology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6382742/
https://www.ncbi.nlm.nih.gov/pubmed/30837922
http://dx.doi.org/10.3389/fpsyg.2019.00313
work_keys_str_mv AT leivadaevelina elicitingbigdatafromsmallyoungornonstandardlanguages10experimentalchallenges
AT dalessandroroberta elicitingbigdatafromsmallyoungornonstandardlanguages10experimentalchallenges
AT grohmannkleanthesk elicitingbigdatafromsmallyoungornonstandardlanguages10experimentalchallenges