Cargando…
LexiRumah: An online lexical database of the Lesser Sunda Islands
The Lesser Sunda Islands in eastern Indonesia cover a longitudinal distance of some 600 kilometres. They are the westernmost place where languages of the Austronesian family come into contact with a family of Papuan languages and constitute an area of high linguistic diversity. Despite its diversity...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6192618/ https://www.ncbi.nlm.nih.gov/pubmed/30332446 http://dx.doi.org/10.1371/journal.pone.0205250 |
_version_ | 1783363931131084800 |
---|---|
author | Kaiping, Gereon A. Klamer, Marian |
author_facet | Kaiping, Gereon A. Klamer, Marian |
author_sort | Kaiping, Gereon A. |
collection | PubMed |
description | The Lesser Sunda Islands in eastern Indonesia cover a longitudinal distance of some 600 kilometres. They are the westernmost place where languages of the Austronesian family come into contact with a family of Papuan languages and constitute an area of high linguistic diversity. Despite its diversity, the Lesser Sundas are little studied and for most of the region, written historical records, as well as archaeological and ethnographic data are lacking. In such circumstances the study of relationships between languages through their lexicon is a unique tool for making inferences about human (pre-)history and tracing population movements. However, the lack of a collective body of lexical data has severely limited our understanding of the history of the languages and peoples in the Lesser Sundas. The LexiRumah database fills this gap by assembling lexicons of Lesser Sunda languages from published and unpublished sources, and making those lexicons available online in a consistent format. This database makes it possible for researchers to explore the linguistic data collated from different primary sources, to formulate hypotheses on how the languages of the two families might be internally related and to compare competing hypotheses about subgroupings and language contact in the region. In this article, we present observations from aggregating lexical data from sources of different type and quality, including fieldwork, and generalize our lessons learned towards practical guidelines for creating a consistent database of comparable lexical items, derived from the design and development of LexiRumah. Databases like this are instrumental in developing theories of language evolution and change in understudied regions where small-scale, pre-industrial, pre-literate societies are the majority. It is therefore vital to follow reliable design choices when creating such databases, as described in this paper. |
format | Online Article Text |
id | pubmed-6192618 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-61926182018-11-05 LexiRumah: An online lexical database of the Lesser Sunda Islands Kaiping, Gereon A. Klamer, Marian PLoS One Research Article The Lesser Sunda Islands in eastern Indonesia cover a longitudinal distance of some 600 kilometres. They are the westernmost place where languages of the Austronesian family come into contact with a family of Papuan languages and constitute an area of high linguistic diversity. Despite its diversity, the Lesser Sundas are little studied and for most of the region, written historical records, as well as archaeological and ethnographic data are lacking. In such circumstances the study of relationships between languages through their lexicon is a unique tool for making inferences about human (pre-)history and tracing population movements. However, the lack of a collective body of lexical data has severely limited our understanding of the history of the languages and peoples in the Lesser Sundas. The LexiRumah database fills this gap by assembling lexicons of Lesser Sunda languages from published and unpublished sources, and making those lexicons available online in a consistent format. This database makes it possible for researchers to explore the linguistic data collated from different primary sources, to formulate hypotheses on how the languages of the two families might be internally related and to compare competing hypotheses about subgroupings and language contact in the region. In this article, we present observations from aggregating lexical data from sources of different type and quality, including fieldwork, and generalize our lessons learned towards practical guidelines for creating a consistent database of comparable lexical items, derived from the design and development of LexiRumah. Databases like this are instrumental in developing theories of language evolution and change in understudied regions where small-scale, pre-industrial, pre-literate societies are the majority. It is therefore vital to follow reliable design choices when creating such databases, as described in this paper. Public Library of Science 2018-10-17 /pmc/articles/PMC6192618/ /pubmed/30332446 http://dx.doi.org/10.1371/journal.pone.0205250 Text en © 2018 Kaiping, Klamer http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Kaiping, Gereon A. Klamer, Marian LexiRumah: An online lexical database of the Lesser Sunda Islands |
title | LexiRumah: An online lexical database of the Lesser Sunda Islands |
title_full | LexiRumah: An online lexical database of the Lesser Sunda Islands |
title_fullStr | LexiRumah: An online lexical database of the Lesser Sunda Islands |
title_full_unstemmed | LexiRumah: An online lexical database of the Lesser Sunda Islands |
title_short | LexiRumah: An online lexical database of the Lesser Sunda Islands |
title_sort | lexirumah: an online lexical database of the lesser sunda islands |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6192618/ https://www.ncbi.nlm.nih.gov/pubmed/30332446 http://dx.doi.org/10.1371/journal.pone.0205250 |
work_keys_str_mv | AT kaipinggereona lexirumahanonlinelexicaldatabaseofthelessersundaislands AT klamermarian lexirumahanonlinelexicaldatabaseofthelessersundaislands |