Cargando…
An inconvenient dataset: bias and inappropriate inference with the multilevel model
The multilevel model has become a staple of social research. I textually and formally explicate sample design features that, I contend, are required for unbiased estimation of macro-level multilevel model parameters and the use of tools for statistical inference, such as standard errors. After detai...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Netherlands
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3965847/ https://www.ncbi.nlm.nih.gov/pubmed/24683276 http://dx.doi.org/10.1007/s11135-013-9865-x |
_version_ | 1782308846832713728 |
---|---|
author | Lucas, Samuel R. |
author_facet | Lucas, Samuel R. |
author_sort | Lucas, Samuel R. |
collection | PubMed |
description | The multilevel model has become a staple of social research. I textually and formally explicate sample design features that, I contend, are required for unbiased estimation of macro-level multilevel model parameters and the use of tools for statistical inference, such as standard errors. After detailing the limited and conflicting guidance on sample design in the multilevel model didactic literature, illustrative nationally-representative datasets and published examples that violate the posited requirements are identified. Because the didactic literature is either silent on sample design requirements or in disagreement with the constraints posited here, two Monte Carlo simulations are conducted to clarify the issues. The results indicate that bias follows use of samples that fail to satisfy the requirements outlined; notably, the bias is poorly-behaved, such that estimates provide neither upper nor lower bounds for the population parameter. Further, hypothesis tests are unjustified. Thus, published multilevel model analyses using many workhorse datasets, including NELS, AdHealth, NLSY, GSS, PSID, and SIPP, often unwittingly convey substantive results and theoretical conclusions that lack foundation. Future research using the multilevel model should be limited to cases that satisfy the sample requirements described. |
format | Online Article Text |
id | pubmed-3965847 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Springer Netherlands |
record_format | MEDLINE/PubMed |
spelling | pubmed-39658472014-03-28 An inconvenient dataset: bias and inappropriate inference with the multilevel model Lucas, Samuel R. Qual Quant Article The multilevel model has become a staple of social research. I textually and formally explicate sample design features that, I contend, are required for unbiased estimation of macro-level multilevel model parameters and the use of tools for statistical inference, such as standard errors. After detailing the limited and conflicting guidance on sample design in the multilevel model didactic literature, illustrative nationally-representative datasets and published examples that violate the posited requirements are identified. Because the didactic literature is either silent on sample design requirements or in disagreement with the constraints posited here, two Monte Carlo simulations are conducted to clarify the issues. The results indicate that bias follows use of samples that fail to satisfy the requirements outlined; notably, the bias is poorly-behaved, such that estimates provide neither upper nor lower bounds for the population parameter. Further, hypothesis tests are unjustified. Thus, published multilevel model analyses using many workhorse datasets, including NELS, AdHealth, NLSY, GSS, PSID, and SIPP, often unwittingly convey substantive results and theoretical conclusions that lack foundation. Future research using the multilevel model should be limited to cases that satisfy the sample requirements described. Springer Netherlands 2013-06-06 2014 /pmc/articles/PMC3965847/ /pubmed/24683276 http://dx.doi.org/10.1007/s11135-013-9865-x Text en © The Author(s) 2013 https://creativecommons.org/licenses/by/2.0/ Open AccessThis article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited. |
spellingShingle | Article Lucas, Samuel R. An inconvenient dataset: bias and inappropriate inference with the multilevel model |
title | An inconvenient dataset: bias and inappropriate inference with the multilevel model |
title_full | An inconvenient dataset: bias and inappropriate inference with the multilevel model |
title_fullStr | An inconvenient dataset: bias and inappropriate inference with the multilevel model |
title_full_unstemmed | An inconvenient dataset: bias and inappropriate inference with the multilevel model |
title_short | An inconvenient dataset: bias and inappropriate inference with the multilevel model |
title_sort | inconvenient dataset: bias and inappropriate inference with the multilevel model |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3965847/ https://www.ncbi.nlm.nih.gov/pubmed/24683276 http://dx.doi.org/10.1007/s11135-013-9865-x |
work_keys_str_mv | AT lucassamuelr aninconvenientdatasetbiasandinappropriateinferencewiththemultilevelmodel AT lucassamuelr inconvenientdatasetbiasandinappropriateinferencewiththemultilevelmodel |