Cargando…

Population Size Predicts Lexical Diversity, but so Does the Mean Sea Level – Why It Is Important to Correctly Account for the Structure of Temporal Data

In order to demonstrate why it is important to correctly account for the (serial dependent) structure of temporal data, we document an apparently spectacular relationship between population size and lexical diversity: for five out of seven investigated languages, there is a strong relationship betwe...

Descripción completa

Detalles Bibliográficos
Autores principales: Koplenig, Alexander, Müller-Spitzer, Carolin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4777502/
https://www.ncbi.nlm.nih.gov/pubmed/26938719
http://dx.doi.org/10.1371/journal.pone.0150771
_version_ 1782419313473355776
author Koplenig, Alexander
Müller-Spitzer, Carolin
author_facet Koplenig, Alexander
Müller-Spitzer, Carolin
author_sort Koplenig, Alexander
collection PubMed
description In order to demonstrate why it is important to correctly account for the (serial dependent) structure of temporal data, we document an apparently spectacular relationship between population size and lexical diversity: for five out of seven investigated languages, there is a strong relationship between population size and lexical diversity of the primary language in this country. We show that this relationship is the result of a misspecified model that does not consider the temporal aspect of the data by presenting a similar but nonsensical relationship between the global annual mean sea level and lexical diversity. Given the fact that in the recent past, several studies were published that present surprising links between different economic, cultural, political and (socio-)demographical variables on the one hand and cultural or linguistic characteristics on the other hand, but seem to suffer from exactly this problem, we explain the cause of the misspecification and show that it has profound consequences. We demonstrate how simple transformation of the time series can often solve problems of this type and argue that the evaluation of the plausibility of a relationship is important in this context. We hope that our paper will help both researchers and reviewers to understand why it is important to use special models for the analysis of data with a natural temporal ordering.
format Online
Article
Text
id pubmed-4777502
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-47775022016-03-10 Population Size Predicts Lexical Diversity, but so Does the Mean Sea Level – Why It Is Important to Correctly Account for the Structure of Temporal Data Koplenig, Alexander Müller-Spitzer, Carolin PLoS One Research Article In order to demonstrate why it is important to correctly account for the (serial dependent) structure of temporal data, we document an apparently spectacular relationship between population size and lexical diversity: for five out of seven investigated languages, there is a strong relationship between population size and lexical diversity of the primary language in this country. We show that this relationship is the result of a misspecified model that does not consider the temporal aspect of the data by presenting a similar but nonsensical relationship between the global annual mean sea level and lexical diversity. Given the fact that in the recent past, several studies were published that present surprising links between different economic, cultural, political and (socio-)demographical variables on the one hand and cultural or linguistic characteristics on the other hand, but seem to suffer from exactly this problem, we explain the cause of the misspecification and show that it has profound consequences. We demonstrate how simple transformation of the time series can often solve problems of this type and argue that the evaluation of the plausibility of a relationship is important in this context. We hope that our paper will help both researchers and reviewers to understand why it is important to use special models for the analysis of data with a natural temporal ordering. Public Library of Science 2016-03-03 /pmc/articles/PMC4777502/ /pubmed/26938719 http://dx.doi.org/10.1371/journal.pone.0150771 Text en © 2016 Koplenig, Müller-Spitzer http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Koplenig, Alexander
Müller-Spitzer, Carolin
Population Size Predicts Lexical Diversity, but so Does the Mean Sea Level – Why It Is Important to Correctly Account for the Structure of Temporal Data
title Population Size Predicts Lexical Diversity, but so Does the Mean Sea Level – Why It Is Important to Correctly Account for the Structure of Temporal Data
title_full Population Size Predicts Lexical Diversity, but so Does the Mean Sea Level – Why It Is Important to Correctly Account for the Structure of Temporal Data
title_fullStr Population Size Predicts Lexical Diversity, but so Does the Mean Sea Level – Why It Is Important to Correctly Account for the Structure of Temporal Data
title_full_unstemmed Population Size Predicts Lexical Diversity, but so Does the Mean Sea Level – Why It Is Important to Correctly Account for the Structure of Temporal Data
title_short Population Size Predicts Lexical Diversity, but so Does the Mean Sea Level – Why It Is Important to Correctly Account for the Structure of Temporal Data
title_sort population size predicts lexical diversity, but so does the mean sea level – why it is important to correctly account for the structure of temporal data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4777502/
https://www.ncbi.nlm.nih.gov/pubmed/26938719
http://dx.doi.org/10.1371/journal.pone.0150771
work_keys_str_mv AT koplenigalexander populationsizepredictslexicaldiversitybutsodoesthemeansealevelwhyitisimportanttocorrectlyaccountforthestructureoftemporaldata
AT mullerspitzercarolin populationsizepredictslexicaldiversitybutsodoesthemeansealevelwhyitisimportanttocorrectlyaccountforthestructureoftemporaldata