Cargando…
Risky Business: Factor Analysis of Survey Data – Assessing the Probability of Incorrect Dimensionalisation
This paper undertakes a systematic assessment of the extent to which factor analysis the correct number of latent dimensions (factors) when applied to ordered-categorical survey items (so-called Likert items). We simulate 2400 data sets of uni-dimensional Likert items that vary systematically over a...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4366083/ https://www.ncbi.nlm.nih.gov/pubmed/25789992 http://dx.doi.org/10.1371/journal.pone.0118900 |
_version_ | 1782362313302474752 |
---|---|
author | van der Eijk, Cees Rose, Jonathan |
author_facet | van der Eijk, Cees Rose, Jonathan |
author_sort | van der Eijk, Cees |
collection | PubMed |
description | This paper undertakes a systematic assessment of the extent to which factor analysis the correct number of latent dimensions (factors) when applied to ordered-categorical survey items (so-called Likert items). We simulate 2400 data sets of uni-dimensional Likert items that vary systematically over a range of conditions such as the underlying population distribution, the number of items, the level of random error, and characteristics of items and item-sets. Each of these datasets is factor analysed in a variety of ways that are frequently used in the extant literature, or that are recommended in current methodological texts. These include exploratory factor retention heuristics such as Kaiser’s criterion, Parallel Analysis and a non-graphical scree test, and (for exploratory and confirmatory analyses) evaluations of model fit. These analyses are conducted on the basis of Pearson and polychoric correlations. We find that, irrespective of the particular mode of analysis, factor analysis applied to ordered-categorical survey data very often leads to over-dimensionalisation. The magnitude of this risk depends on the specific way in which factor analysis is conducted, the number of items, the properties of the set of items, and the underlying population distribution. The paper concludes with a discussion of the consequences of over-dimensionalisation, and a brief mention of alternative modes of analysis that are much less prone to such problems. |
format | Online Article Text |
id | pubmed-4366083 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-43660832015-03-23 Risky Business: Factor Analysis of Survey Data – Assessing the Probability of Incorrect Dimensionalisation van der Eijk, Cees Rose, Jonathan PLoS One Research Article This paper undertakes a systematic assessment of the extent to which factor analysis the correct number of latent dimensions (factors) when applied to ordered-categorical survey items (so-called Likert items). We simulate 2400 data sets of uni-dimensional Likert items that vary systematically over a range of conditions such as the underlying population distribution, the number of items, the level of random error, and characteristics of items and item-sets. Each of these datasets is factor analysed in a variety of ways that are frequently used in the extant literature, or that are recommended in current methodological texts. These include exploratory factor retention heuristics such as Kaiser’s criterion, Parallel Analysis and a non-graphical scree test, and (for exploratory and confirmatory analyses) evaluations of model fit. These analyses are conducted on the basis of Pearson and polychoric correlations. We find that, irrespective of the particular mode of analysis, factor analysis applied to ordered-categorical survey data very often leads to over-dimensionalisation. The magnitude of this risk depends on the specific way in which factor analysis is conducted, the number of items, the properties of the set of items, and the underlying population distribution. The paper concludes with a discussion of the consequences of over-dimensionalisation, and a brief mention of alternative modes of analysis that are much less prone to such problems. Public Library of Science 2015-03-19 /pmc/articles/PMC4366083/ /pubmed/25789992 http://dx.doi.org/10.1371/journal.pone.0118900 Text en © 2015 van der Eijk, Rose http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article van der Eijk, Cees Rose, Jonathan Risky Business: Factor Analysis of Survey Data – Assessing the Probability of Incorrect Dimensionalisation |
title | Risky Business: Factor Analysis of Survey Data – Assessing the Probability of Incorrect Dimensionalisation |
title_full | Risky Business: Factor Analysis of Survey Data – Assessing the Probability of Incorrect Dimensionalisation |
title_fullStr | Risky Business: Factor Analysis of Survey Data – Assessing the Probability of Incorrect Dimensionalisation |
title_full_unstemmed | Risky Business: Factor Analysis of Survey Data – Assessing the Probability of Incorrect Dimensionalisation |
title_short | Risky Business: Factor Analysis of Survey Data – Assessing the Probability of Incorrect Dimensionalisation |
title_sort | risky business: factor analysis of survey data – assessing the probability of incorrect dimensionalisation |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4366083/ https://www.ncbi.nlm.nih.gov/pubmed/25789992 http://dx.doi.org/10.1371/journal.pone.0118900 |
work_keys_str_mv | AT vandereijkcees riskybusinessfactoranalysisofsurveydataassessingtheprobabilityofincorrectdimensionalisation AT rosejonathan riskybusinessfactoranalysisofsurveydataassessingtheprobabilityofincorrectdimensionalisation |