Cargando…

Scale length does matter: Recommendations for measurement invariance testing with categorical factor analysis and item response theory approaches

In social sciences, the study of group differences concerning latent constructs is ubiquitous. These constructs are generally measured by means of scales composed of ordinal items. In order to compare these constructs across groups, one crucial requirement is that they are measured equivalently or,...

Descripción completa

Detalles Bibliográficos
Autores principales: D’Urso, E. Damiano, De Roover, Kim, Vermunt, Jeroen K., Tijmstra, Jesper
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9579096/
https://www.ncbi.nlm.nih.gov/pubmed/34910286
http://dx.doi.org/10.3758/s13428-021-01690-7
_version_ 1784812110362968064
author D’Urso, E. Damiano
De Roover, Kim
Vermunt, Jeroen K.
Tijmstra, Jesper
author_facet D’Urso, E. Damiano
De Roover, Kim
Vermunt, Jeroen K.
Tijmstra, Jesper
author_sort D’Urso, E. Damiano
collection PubMed
description In social sciences, the study of group differences concerning latent constructs is ubiquitous. These constructs are generally measured by means of scales composed of ordinal items. In order to compare these constructs across groups, one crucial requirement is that they are measured equivalently or, in technical jargon, that measurement invariance (MI) holds across the groups. This study compared the performance of scale- and item-level approaches based on multiple group categorical confirmatory factor analysis (MG-CCFA) and multiple group item response theory (MG-IRT) in testing MI with ordinal data. In general, the results of the simulation studies showed that MG-CCFA-based approaches outperformed MG-IRT-based approaches when testing MI at the scale level, whereas, at the item level, the best performing approach depends on the tested parameter (i.e., loadings or thresholds). That is, when testing loadings equivalence, the likelihood ratio test provided the best trade-off between true-positive rate and false-positive rate, whereas, when testing thresholds equivalence, the χ(2) test outperformed the other testing strategies. In addition, the performance of MG-CCFA’s fit measures, such as RMSEA and CFI, seemed to depend largely on the length of the scale, especially when MI was tested at the item level. General caution is recommended when using these measures, especially when MI is tested for each item individually.
format Online
Article
Text
id pubmed-9579096
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-95790962022-10-20 Scale length does matter: Recommendations for measurement invariance testing with categorical factor analysis and item response theory approaches D’Urso, E. Damiano De Roover, Kim Vermunt, Jeroen K. Tijmstra, Jesper Behav Res Methods Article In social sciences, the study of group differences concerning latent constructs is ubiquitous. These constructs are generally measured by means of scales composed of ordinal items. In order to compare these constructs across groups, one crucial requirement is that they are measured equivalently or, in technical jargon, that measurement invariance (MI) holds across the groups. This study compared the performance of scale- and item-level approaches based on multiple group categorical confirmatory factor analysis (MG-CCFA) and multiple group item response theory (MG-IRT) in testing MI with ordinal data. In general, the results of the simulation studies showed that MG-CCFA-based approaches outperformed MG-IRT-based approaches when testing MI at the scale level, whereas, at the item level, the best performing approach depends on the tested parameter (i.e., loadings or thresholds). That is, when testing loadings equivalence, the likelihood ratio test provided the best trade-off between true-positive rate and false-positive rate, whereas, when testing thresholds equivalence, the χ(2) test outperformed the other testing strategies. In addition, the performance of MG-CCFA’s fit measures, such as RMSEA and CFI, seemed to depend largely on the length of the scale, especially when MI was tested at the item level. General caution is recommended when using these measures, especially when MI is tested for each item individually. Springer US 2021-12-15 2022 /pmc/articles/PMC9579096/ /pubmed/34910286 http://dx.doi.org/10.3758/s13428-021-01690-7 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
D’Urso, E. Damiano
De Roover, Kim
Vermunt, Jeroen K.
Tijmstra, Jesper
Scale length does matter: Recommendations for measurement invariance testing with categorical factor analysis and item response theory approaches
title Scale length does matter: Recommendations for measurement invariance testing with categorical factor analysis and item response theory approaches
title_full Scale length does matter: Recommendations for measurement invariance testing with categorical factor analysis and item response theory approaches
title_fullStr Scale length does matter: Recommendations for measurement invariance testing with categorical factor analysis and item response theory approaches
title_full_unstemmed Scale length does matter: Recommendations for measurement invariance testing with categorical factor analysis and item response theory approaches
title_short Scale length does matter: Recommendations for measurement invariance testing with categorical factor analysis and item response theory approaches
title_sort scale length does matter: recommendations for measurement invariance testing with categorical factor analysis and item response theory approaches
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9579096/
https://www.ncbi.nlm.nih.gov/pubmed/34910286
http://dx.doi.org/10.3758/s13428-021-01690-7
work_keys_str_mv AT dursoedamiano scalelengthdoesmatterrecommendationsformeasurementinvariancetestingwithcategoricalfactoranalysisanditemresponsetheoryapproaches
AT derooverkim scalelengthdoesmatterrecommendationsformeasurementinvariancetestingwithcategoricalfactoranalysisanditemresponsetheoryapproaches
AT vermuntjeroenk scalelengthdoesmatterrecommendationsformeasurementinvariancetestingwithcategoricalfactoranalysisanditemresponsetheoryapproaches
AT tijmstrajesper scalelengthdoesmatterrecommendationsformeasurementinvariancetestingwithcategoricalfactoranalysisanditemresponsetheoryapproaches