Cargando…

LexiRumah: An online lexical database of the Lesser Sunda Islands

The Lesser Sunda Islands in eastern Indonesia cover a longitudinal distance of some 600 kilometres. They are the westernmost place where languages of the Austronesian family come into contact with a family of Papuan languages and constitute an area of high linguistic diversity. Despite its diversity...

Descripción completa

Detalles Bibliográficos
Autores principales: Kaiping, Gereon A., Klamer, Marian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6192618/
https://www.ncbi.nlm.nih.gov/pubmed/30332446
http://dx.doi.org/10.1371/journal.pone.0205250
_version_ 1783363931131084800
author Kaiping, Gereon A.
Klamer, Marian
author_facet Kaiping, Gereon A.
Klamer, Marian
author_sort Kaiping, Gereon A.
collection PubMed
description The Lesser Sunda Islands in eastern Indonesia cover a longitudinal distance of some 600 kilometres. They are the westernmost place where languages of the Austronesian family come into contact with a family of Papuan languages and constitute an area of high linguistic diversity. Despite its diversity, the Lesser Sundas are little studied and for most of the region, written historical records, as well as archaeological and ethnographic data are lacking. In such circumstances the study of relationships between languages through their lexicon is a unique tool for making inferences about human (pre-)history and tracing population movements. However, the lack of a collective body of lexical data has severely limited our understanding of the history of the languages and peoples in the Lesser Sundas. The LexiRumah database fills this gap by assembling lexicons of Lesser Sunda languages from published and unpublished sources, and making those lexicons available online in a consistent format. This database makes it possible for researchers to explore the linguistic data collated from different primary sources, to formulate hypotheses on how the languages of the two families might be internally related and to compare competing hypotheses about subgroupings and language contact in the region. In this article, we present observations from aggregating lexical data from sources of different type and quality, including fieldwork, and generalize our lessons learned towards practical guidelines for creating a consistent database of comparable lexical items, derived from the design and development of LexiRumah. Databases like this are instrumental in developing theories of language evolution and change in understudied regions where small-scale, pre-industrial, pre-literate societies are the majority. It is therefore vital to follow reliable design choices when creating such databases, as described in this paper.
format Online
Article
Text
id pubmed-6192618
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-61926182018-11-05 LexiRumah: An online lexical database of the Lesser Sunda Islands Kaiping, Gereon A. Klamer, Marian PLoS One Research Article The Lesser Sunda Islands in eastern Indonesia cover a longitudinal distance of some 600 kilometres. They are the westernmost place where languages of the Austronesian family come into contact with a family of Papuan languages and constitute an area of high linguistic diversity. Despite its diversity, the Lesser Sundas are little studied and for most of the region, written historical records, as well as archaeological and ethnographic data are lacking. In such circumstances the study of relationships between languages through their lexicon is a unique tool for making inferences about human (pre-)history and tracing population movements. However, the lack of a collective body of lexical data has severely limited our understanding of the history of the languages and peoples in the Lesser Sundas. The LexiRumah database fills this gap by assembling lexicons of Lesser Sunda languages from published and unpublished sources, and making those lexicons available online in a consistent format. This database makes it possible for researchers to explore the linguistic data collated from different primary sources, to formulate hypotheses on how the languages of the two families might be internally related and to compare competing hypotheses about subgroupings and language contact in the region. In this article, we present observations from aggregating lexical data from sources of different type and quality, including fieldwork, and generalize our lessons learned towards practical guidelines for creating a consistent database of comparable lexical items, derived from the design and development of LexiRumah. Databases like this are instrumental in developing theories of language evolution and change in understudied regions where small-scale, pre-industrial, pre-literate societies are the majority. It is therefore vital to follow reliable design choices when creating such databases, as described in this paper. Public Library of Science 2018-10-17 /pmc/articles/PMC6192618/ /pubmed/30332446 http://dx.doi.org/10.1371/journal.pone.0205250 Text en © 2018 Kaiping, Klamer http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Kaiping, Gereon A.
Klamer, Marian
LexiRumah: An online lexical database of the Lesser Sunda Islands
title LexiRumah: An online lexical database of the Lesser Sunda Islands
title_full LexiRumah: An online lexical database of the Lesser Sunda Islands
title_fullStr LexiRumah: An online lexical database of the Lesser Sunda Islands
title_full_unstemmed LexiRumah: An online lexical database of the Lesser Sunda Islands
title_short LexiRumah: An online lexical database of the Lesser Sunda Islands
title_sort lexirumah: an online lexical database of the lesser sunda islands
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6192618/
https://www.ncbi.nlm.nih.gov/pubmed/30332446
http://dx.doi.org/10.1371/journal.pone.0205250
work_keys_str_mv AT kaipinggereona lexirumahanonlinelexicaldatabaseofthelessersundaislands
AT klamermarian lexirumahanonlinelexicaldatabaseofthelessersundaislands