Cargando…
Factor Retention in Exploratory Factor Analysis With Missing Data
Determining the number of factors in exploratory factor analysis is arguably the most crucial decision a researcher faces when conducting the analysis. While several simulation studies exist that compare various so-called factor retention criteria under different data conditions, little is known abo...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9014734/ https://www.ncbi.nlm.nih.gov/pubmed/35444335 http://dx.doi.org/10.1177/00131644211022031 |
_version_ | 1784688244969963520 |
---|---|
author | Goretzko, David |
author_facet | Goretzko, David |
author_sort | Goretzko, David |
collection | PubMed |
description | Determining the number of factors in exploratory factor analysis is arguably the most crucial decision a researcher faces when conducting the analysis. While several simulation studies exist that compare various so-called factor retention criteria under different data conditions, little is known about the impact of missing data on this process. Hence, in this study, we evaluated the performance of different factor retention criteria—the Factor Forest, parallel analysis based on a principal component analysis as well as parallel analysis based on the common factor model and the comparison data approach—in combination with different missing data methods, namely an expectation-maximization algorithm called Amelia, predictive mean matching, and random forest imputation within the multiple imputations by chained equations (MICE) framework as well as pairwise deletion with regard to their accuracy in determining the number of factors when data are missing. Data were simulated for different sample sizes, numbers of factors, numbers of manifest variables (indicators), between-factor correlations, missing data mechanisms and proportions of missing values. In the majority of conditions and for all factor retention criteria except the comparison data approach, the missing data mechanism had little impact on the accuracy and pairwise deletion performed comparably well as the more sophisticated imputation methods. In some conditions, especially small-sample cases and when comparison data were used to determine the number of factors, random forest imputation was preferable to other missing data methods, though. Accordingly, depending on data characteristics and the selected factor retention criterion, choosing an appropriate missing data method is crucial to obtain a valid estimate of the number of factors to extract. |
format | Online Article Text |
id | pubmed-9014734 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-90147342022-04-19 Factor Retention in Exploratory Factor Analysis With Missing Data Goretzko, David Educ Psychol Meas Article Determining the number of factors in exploratory factor analysis is arguably the most crucial decision a researcher faces when conducting the analysis. While several simulation studies exist that compare various so-called factor retention criteria under different data conditions, little is known about the impact of missing data on this process. Hence, in this study, we evaluated the performance of different factor retention criteria—the Factor Forest, parallel analysis based on a principal component analysis as well as parallel analysis based on the common factor model and the comparison data approach—in combination with different missing data methods, namely an expectation-maximization algorithm called Amelia, predictive mean matching, and random forest imputation within the multiple imputations by chained equations (MICE) framework as well as pairwise deletion with regard to their accuracy in determining the number of factors when data are missing. Data were simulated for different sample sizes, numbers of factors, numbers of manifest variables (indicators), between-factor correlations, missing data mechanisms and proportions of missing values. In the majority of conditions and for all factor retention criteria except the comparison data approach, the missing data mechanism had little impact on the accuracy and pairwise deletion performed comparably well as the more sophisticated imputation methods. In some conditions, especially small-sample cases and when comparison data were used to determine the number of factors, random forest imputation was preferable to other missing data methods, though. Accordingly, depending on data characteristics and the selected factor retention criterion, choosing an appropriate missing data method is crucial to obtain a valid estimate of the number of factors to extract. SAGE Publications 2021-06-11 2022-06 /pmc/articles/PMC9014734/ /pubmed/35444335 http://dx.doi.org/10.1177/00131644211022031 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Article Goretzko, David Factor Retention in Exploratory Factor Analysis With Missing Data |
title | Factor Retention in Exploratory Factor Analysis With Missing Data |
title_full | Factor Retention in Exploratory Factor Analysis With Missing Data |
title_fullStr | Factor Retention in Exploratory Factor Analysis With Missing Data |
title_full_unstemmed | Factor Retention in Exploratory Factor Analysis With Missing Data |
title_short | Factor Retention in Exploratory Factor Analysis With Missing Data |
title_sort | factor retention in exploratory factor analysis with missing data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9014734/ https://www.ncbi.nlm.nih.gov/pubmed/35444335 http://dx.doi.org/10.1177/00131644211022031 |
work_keys_str_mv | AT goretzkodavid factorretentioninexploratoryfactoranalysiswithmissingdata |