Cargando…
Factor Retention Using Machine Learning With Ordinal Data
Determining the number of factors in exploratory factor analysis is probably the most crucial decision when conducting the analysis as it clearly influences the meaningfulness of the results (i.e., factorial validity). A new method called the Factor Forest that combines data simulation and machine l...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9265486/ https://www.ncbi.nlm.nih.gov/pubmed/35812814 http://dx.doi.org/10.1177/01466216221089345 |
_version_ | 1784743226582761472 |
---|---|
author | Goretzko, David Bühner, Markus |
author_facet | Goretzko, David Bühner, Markus |
author_sort | Goretzko, David |
collection | PubMed |
description | Determining the number of factors in exploratory factor analysis is probably the most crucial decision when conducting the analysis as it clearly influences the meaningfulness of the results (i.e., factorial validity). A new method called the Factor Forest that combines data simulation and machine learning has been developed recently. This method based on simulated data reached very high accuracy for multivariate normal data, but it has not yet been tested with ordinal data. Hence, in this simulation study, we evaluated the Factor Forest with ordinal data based on different numbers of categories (2–6 categories) and compared it to common factor retention criteria. It showed higher overall accuracy for all types of ordinal data than all common factor retention criteria that were used for comparison (Parallel Analysis, Comparison Data, the Empirical Kaiser Criterion and the Kaiser Guttman Rule). The results indicate that the Factor Forest is applicable to ordinal data with at least five categories (typical scale in questionnaire research) in the majority of conditions and to binary or ordinal data based on items with less categories when the sample size is large. |
format | Online Article Text |
id | pubmed-9265486 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-92654862022-07-09 Factor Retention Using Machine Learning With Ordinal Data Goretzko, David Bühner, Markus Appl Psychol Meas Articles Determining the number of factors in exploratory factor analysis is probably the most crucial decision when conducting the analysis as it clearly influences the meaningfulness of the results (i.e., factorial validity). A new method called the Factor Forest that combines data simulation and machine learning has been developed recently. This method based on simulated data reached very high accuracy for multivariate normal data, but it has not yet been tested with ordinal data. Hence, in this simulation study, we evaluated the Factor Forest with ordinal data based on different numbers of categories (2–6 categories) and compared it to common factor retention criteria. It showed higher overall accuracy for all types of ordinal data than all common factor retention criteria that were used for comparison (Parallel Analysis, Comparison Data, the Empirical Kaiser Criterion and the Kaiser Guttman Rule). The results indicate that the Factor Forest is applicable to ordinal data with at least five categories (typical scale in questionnaire research) in the majority of conditions and to binary or ordinal data based on items with less categories when the sample size is large. SAGE Publications 2022-05-04 2022-07 /pmc/articles/PMC9265486/ /pubmed/35812814 http://dx.doi.org/10.1177/01466216221089345 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Articles Goretzko, David Bühner, Markus Factor Retention Using Machine Learning With Ordinal Data |
title | Factor Retention Using Machine Learning With Ordinal Data |
title_full | Factor Retention Using Machine Learning With Ordinal Data |
title_fullStr | Factor Retention Using Machine Learning With Ordinal Data |
title_full_unstemmed | Factor Retention Using Machine Learning With Ordinal Data |
title_short | Factor Retention Using Machine Learning With Ordinal Data |
title_sort | factor retention using machine learning with ordinal data |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9265486/ https://www.ncbi.nlm.nih.gov/pubmed/35812814 http://dx.doi.org/10.1177/01466216221089345 |
work_keys_str_mv | AT goretzkodavid factorretentionusingmachinelearningwithordinaldata AT buhnermarkus factorretentionusingmachinelearningwithordinaldata |