Cargando…
Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges
The aim of this work is to identify and analyze a set of challenges that are likely to be encountered when one embarks on fieldwork in linguistic communities that feature small, young, and/or non-standard languages with a goal to elicit big sets of rich data. For each challenge, we (i) explain its n...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6382742/ https://www.ncbi.nlm.nih.gov/pubmed/30837922 http://dx.doi.org/10.3389/fpsyg.2019.00313 |
_version_ | 1783396706547662848 |
---|---|
author | Leivada, Evelina D’Alessandro, Roberta Grohmann, Kleanthes K. |
author_facet | Leivada, Evelina D’Alessandro, Roberta Grohmann, Kleanthes K. |
author_sort | Leivada, Evelina |
collection | PubMed |
description | The aim of this work is to identify and analyze a set of challenges that are likely to be encountered when one embarks on fieldwork in linguistic communities that feature small, young, and/or non-standard languages with a goal to elicit big sets of rich data. For each challenge, we (i) explain its nature and implications, (ii) offer one or more examples of how it is manifested in actual linguistic communities, and (iii) where possible, offer recommendations for addressing it effectively. Our list of challenges involves static characteristics (e.g., absence of orthographic conventions and how it affects data collection), dynamic processes (e.g., speed of language change in small languages and how it affects longitudinal collection of big amounts of data), and interactive relations between non-dynamic features that are nevertheless subject to potentially rapid change (e.g., absence of standardized assessment tools or estimates for psycholinguistic variables). The identified challenges represent the domains of data collection and handling, participant recruitment, and experimental design. Among other issues, we discuss population limits and degree of power, inter- and intraspeaker variation, absence of metalanguage and its implications for the process of eliciting acceptability judgments, and challenges that arise from absence of local funding, conflicting regulations in relation to privacy issues, and exporting large samples of data across countries. Finally, the ten experimental challenges presented are relevant to languages from a broad typological spectrum, encompassing both spoken and sign, extant and nearly extinct languages. |
format | Online Article Text |
id | pubmed-6382742 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-63827422019-03-05 Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges Leivada, Evelina D’Alessandro, Roberta Grohmann, Kleanthes K. Front Psychol Psychology The aim of this work is to identify and analyze a set of challenges that are likely to be encountered when one embarks on fieldwork in linguistic communities that feature small, young, and/or non-standard languages with a goal to elicit big sets of rich data. For each challenge, we (i) explain its nature and implications, (ii) offer one or more examples of how it is manifested in actual linguistic communities, and (iii) where possible, offer recommendations for addressing it effectively. Our list of challenges involves static characteristics (e.g., absence of orthographic conventions and how it affects data collection), dynamic processes (e.g., speed of language change in small languages and how it affects longitudinal collection of big amounts of data), and interactive relations between non-dynamic features that are nevertheless subject to potentially rapid change (e.g., absence of standardized assessment tools or estimates for psycholinguistic variables). The identified challenges represent the domains of data collection and handling, participant recruitment, and experimental design. Among other issues, we discuss population limits and degree of power, inter- and intraspeaker variation, absence of metalanguage and its implications for the process of eliciting acceptability judgments, and challenges that arise from absence of local funding, conflicting regulations in relation to privacy issues, and exporting large samples of data across countries. Finally, the ten experimental challenges presented are relevant to languages from a broad typological spectrum, encompassing both spoken and sign, extant and nearly extinct languages. Frontiers Media S.A. 2019-02-14 /pmc/articles/PMC6382742/ /pubmed/30837922 http://dx.doi.org/10.3389/fpsyg.2019.00313 Text en Copyright © 2019 Leivada, D’Alessandro and Grohmann. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Psychology Leivada, Evelina D’Alessandro, Roberta Grohmann, Kleanthes K. Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges |
title | Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges |
title_full | Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges |
title_fullStr | Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges |
title_full_unstemmed | Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges |
title_short | Eliciting Big Data From Small, Young, or Non-standard Languages: 10 Experimental Challenges |
title_sort | eliciting big data from small, young, or non-standard languages: 10 experimental challenges |
topic | Psychology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6382742/ https://www.ncbi.nlm.nih.gov/pubmed/30837922 http://dx.doi.org/10.3389/fpsyg.2019.00313 |
work_keys_str_mv | AT leivadaevelina elicitingbigdatafromsmallyoungornonstandardlanguages10experimentalchallenges AT dalessandroroberta elicitingbigdatafromsmallyoungornonstandardlanguages10experimentalchallenges AT grohmannkleanthesk elicitingbigdatafromsmallyoungornonstandardlanguages10experimentalchallenges |