Cargando…

Factor Retention Using Machine Learning With Ordinal Data

Determining the number of factors in exploratory factor analysis is probably the most crucial decision when conducting the analysis as it clearly influences the meaningfulness of the results (i.e., factorial validity). A new method called the Factor Forest that combines data simulation and machine l...

Descripción completa

Detalles Bibliográficos
Autores principales: Goretzko, David, Bühner, Markus
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9265486/
https://www.ncbi.nlm.nih.gov/pubmed/35812814
http://dx.doi.org/10.1177/01466216221089345
_version_ 1784743226582761472
author Goretzko, David
Bühner, Markus
author_facet Goretzko, David
Bühner, Markus
author_sort Goretzko, David
collection PubMed
description Determining the number of factors in exploratory factor analysis is probably the most crucial decision when conducting the analysis as it clearly influences the meaningfulness of the results (i.e., factorial validity). A new method called the Factor Forest that combines data simulation and machine learning has been developed recently. This method based on simulated data reached very high accuracy for multivariate normal data, but it has not yet been tested with ordinal data. Hence, in this simulation study, we evaluated the Factor Forest with ordinal data based on different numbers of categories (2–6 categories) and compared it to common factor retention criteria. It showed higher overall accuracy for all types of ordinal data than all common factor retention criteria that were used for comparison (Parallel Analysis, Comparison Data, the Empirical Kaiser Criterion and the Kaiser Guttman Rule). The results indicate that the Factor Forest is applicable to ordinal data with at least five categories (typical scale in questionnaire research) in the majority of conditions and to binary or ordinal data based on items with less categories when the sample size is large.
format Online
Article
Text
id pubmed-9265486
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-92654862022-07-09 Factor Retention Using Machine Learning With Ordinal Data Goretzko, David Bühner, Markus Appl Psychol Meas Articles Determining the number of factors in exploratory factor analysis is probably the most crucial decision when conducting the analysis as it clearly influences the meaningfulness of the results (i.e., factorial validity). A new method called the Factor Forest that combines data simulation and machine learning has been developed recently. This method based on simulated data reached very high accuracy for multivariate normal data, but it has not yet been tested with ordinal data. Hence, in this simulation study, we evaluated the Factor Forest with ordinal data based on different numbers of categories (2–6 categories) and compared it to common factor retention criteria. It showed higher overall accuracy for all types of ordinal data than all common factor retention criteria that were used for comparison (Parallel Analysis, Comparison Data, the Empirical Kaiser Criterion and the Kaiser Guttman Rule). The results indicate that the Factor Forest is applicable to ordinal data with at least five categories (typical scale in questionnaire research) in the majority of conditions and to binary or ordinal data based on items with less categories when the sample size is large. SAGE Publications 2022-05-04 2022-07 /pmc/articles/PMC9265486/ /pubmed/35812814 http://dx.doi.org/10.1177/01466216221089345 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Articles
Goretzko, David
Bühner, Markus
Factor Retention Using Machine Learning With Ordinal Data
title Factor Retention Using Machine Learning With Ordinal Data
title_full Factor Retention Using Machine Learning With Ordinal Data
title_fullStr Factor Retention Using Machine Learning With Ordinal Data
title_full_unstemmed Factor Retention Using Machine Learning With Ordinal Data
title_short Factor Retention Using Machine Learning With Ordinal Data
title_sort factor retention using machine learning with ordinal data
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9265486/
https://www.ncbi.nlm.nih.gov/pubmed/35812814
http://dx.doi.org/10.1177/01466216221089345
work_keys_str_mv AT goretzkodavid factorretentionusingmachinelearningwithordinaldata
AT buhnermarkus factorretentionusingmachinelearningwithordinaldata