Cargando…

Factor Retention in Exploratory Factor Analysis With Missing Data

Determining the number of factors in exploratory factor analysis is arguably the most crucial decision a researcher faces when conducting the analysis. While several simulation studies exist that compare various so-called factor retention criteria under different data conditions, little is known abo...

Descripción completa

Detalles Bibliográficos
Autor principal: Goretzko, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9014734/
https://www.ncbi.nlm.nih.gov/pubmed/35444335
http://dx.doi.org/10.1177/00131644211022031
_version_ 1784688244969963520
author Goretzko, David
author_facet Goretzko, David
author_sort Goretzko, David
collection PubMed
description Determining the number of factors in exploratory factor analysis is arguably the most crucial decision a researcher faces when conducting the analysis. While several simulation studies exist that compare various so-called factor retention criteria under different data conditions, little is known about the impact of missing data on this process. Hence, in this study, we evaluated the performance of different factor retention criteria—the Factor Forest, parallel analysis based on a principal component analysis as well as parallel analysis based on the common factor model and the comparison data approach—in combination with different missing data methods, namely an expectation-maximization algorithm called Amelia, predictive mean matching, and random forest imputation within the multiple imputations by chained equations (MICE) framework as well as pairwise deletion with regard to their accuracy in determining the number of factors when data are missing. Data were simulated for different sample sizes, numbers of factors, numbers of manifest variables (indicators), between-factor correlations, missing data mechanisms and proportions of missing values. In the majority of conditions and for all factor retention criteria except the comparison data approach, the missing data mechanism had little impact on the accuracy and pairwise deletion performed comparably well as the more sophisticated imputation methods. In some conditions, especially small-sample cases and when comparison data were used to determine the number of factors, random forest imputation was preferable to other missing data methods, though. Accordingly, depending on data characteristics and the selected factor retention criterion, choosing an appropriate missing data method is crucial to obtain a valid estimate of the number of factors to extract.
format Online
Article
Text
id pubmed-9014734
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-90147342022-04-19 Factor Retention in Exploratory Factor Analysis With Missing Data Goretzko, David Educ Psychol Meas Article Determining the number of factors in exploratory factor analysis is arguably the most crucial decision a researcher faces when conducting the analysis. While several simulation studies exist that compare various so-called factor retention criteria under different data conditions, little is known about the impact of missing data on this process. Hence, in this study, we evaluated the performance of different factor retention criteria—the Factor Forest, parallel analysis based on a principal component analysis as well as parallel analysis based on the common factor model and the comparison data approach—in combination with different missing data methods, namely an expectation-maximization algorithm called Amelia, predictive mean matching, and random forest imputation within the multiple imputations by chained equations (MICE) framework as well as pairwise deletion with regard to their accuracy in determining the number of factors when data are missing. Data were simulated for different sample sizes, numbers of factors, numbers of manifest variables (indicators), between-factor correlations, missing data mechanisms and proportions of missing values. In the majority of conditions and for all factor retention criteria except the comparison data approach, the missing data mechanism had little impact on the accuracy and pairwise deletion performed comparably well as the more sophisticated imputation methods. In some conditions, especially small-sample cases and when comparison data were used to determine the number of factors, random forest imputation was preferable to other missing data methods, though. Accordingly, depending on data characteristics and the selected factor retention criterion, choosing an appropriate missing data method is crucial to obtain a valid estimate of the number of factors to extract. SAGE Publications 2021-06-11 2022-06 /pmc/articles/PMC9014734/ /pubmed/35444335 http://dx.doi.org/10.1177/00131644211022031 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Article
Goretzko, David
Factor Retention in Exploratory Factor Analysis With Missing Data
title Factor Retention in Exploratory Factor Analysis With Missing Data
title_full Factor Retention in Exploratory Factor Analysis With Missing Data
title_fullStr Factor Retention in Exploratory Factor Analysis With Missing Data
title_full_unstemmed Factor Retention in Exploratory Factor Analysis With Missing Data
title_short Factor Retention in Exploratory Factor Analysis With Missing Data
title_sort factor retention in exploratory factor analysis with missing data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9014734/
https://www.ncbi.nlm.nih.gov/pubmed/35444335
http://dx.doi.org/10.1177/00131644211022031
work_keys_str_mv AT goretzkodavid factorretentioninexploratoryfactoranalysiswithmissingdata