Cargando…

Applying Logistic Regression to Detect Differential Item Functioning in Multidimensional Data

Conventional differential item functioning (DIF) approaches such as logistic regression (LR) often assume unidimensionality of a scale and match participants in the reference and focal groups based on total scores. However, many educational and psychological assessments are multidimensional by desig...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Hui-Fang, Jin, Kuan-Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6072859/
https://www.ncbi.nlm.nih.gov/pubmed/30100891
http://dx.doi.org/10.3389/fpsyg.2018.01302
_version_ 1783344073796485120
author Chen, Hui-Fang
Jin, Kuan-Yu
author_facet Chen, Hui-Fang
Jin, Kuan-Yu
author_sort Chen, Hui-Fang
collection PubMed
description Conventional differential item functioning (DIF) approaches such as logistic regression (LR) often assume unidimensionality of a scale and match participants in the reference and focal groups based on total scores. However, many educational and psychological assessments are multidimensional by design, and a matching variable using total scores that does not reflect the test structure may not be good practice in multidimensional items for DIF detection. We propose the use of all subscores of a scale in LR and compare its performance with alternative matching methods, including the use of total score and individual subscores. We focused on uniform DIF situation in which 250, 500, or 1,000 participants in each group answered 21 items reflecting two dimensions, and the 21st item was the studied item. Five factors were manipulated in the study: (a) the test structure, (b) numbers of cross-loaded items, (c) group differences in latent abilities, (d) the magnitude of DIF, and (e) group sample size. The results showed that, when the studied item measured a single domain, the conventional LR incorporating total scores as a matching variable yielded inflated false positive rates (FPRs) when two groups differed in one latent ability. The situation worsened when one group had a higher ability in one domain and lower ability in another. The LR using a single subscore as the matching variable performed well in terms of FPRs and true positive rates (TPRs) when two groups did not differ in either one latent ability or differed in one latent ability. However, this approach yielded inflated FPRs when two groups differed in two latent abilities. The proposed LR using two subscores yielded well-controlled FPRs across all conditions and yielded the highest TPRs. When the studied item measured two domains, the use of either the total score or two subscores worked well in the control of FPRs and yielded similar TPRs across conditions, whereas the use of a single subscore resulted in inflated FPRs when two groups differed in one or two latent abilities. In conclusion, we recommend the use of multiple subscores to match subjects in DIF detection for multidimensional data.
format Online
Article
Text
id pubmed-6072859
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-60728592018-08-10 Applying Logistic Regression to Detect Differential Item Functioning in Multidimensional Data Chen, Hui-Fang Jin, Kuan-Yu Front Psychol Psychology Conventional differential item functioning (DIF) approaches such as logistic regression (LR) often assume unidimensionality of a scale and match participants in the reference and focal groups based on total scores. However, many educational and psychological assessments are multidimensional by design, and a matching variable using total scores that does not reflect the test structure may not be good practice in multidimensional items for DIF detection. We propose the use of all subscores of a scale in LR and compare its performance with alternative matching methods, including the use of total score and individual subscores. We focused on uniform DIF situation in which 250, 500, or 1,000 participants in each group answered 21 items reflecting two dimensions, and the 21st item was the studied item. Five factors were manipulated in the study: (a) the test structure, (b) numbers of cross-loaded items, (c) group differences in latent abilities, (d) the magnitude of DIF, and (e) group sample size. The results showed that, when the studied item measured a single domain, the conventional LR incorporating total scores as a matching variable yielded inflated false positive rates (FPRs) when two groups differed in one latent ability. The situation worsened when one group had a higher ability in one domain and lower ability in another. The LR using a single subscore as the matching variable performed well in terms of FPRs and true positive rates (TPRs) when two groups did not differ in either one latent ability or differed in one latent ability. However, this approach yielded inflated FPRs when two groups differed in two latent abilities. The proposed LR using two subscores yielded well-controlled FPRs across all conditions and yielded the highest TPRs. When the studied item measured two domains, the use of either the total score or two subscores worked well in the control of FPRs and yielded similar TPRs across conditions, whereas the use of a single subscore resulted in inflated FPRs when two groups differed in one or two latent abilities. In conclusion, we recommend the use of multiple subscores to match subjects in DIF detection for multidimensional data. Frontiers Media S.A. 2018-07-27 /pmc/articles/PMC6072859/ /pubmed/30100891 http://dx.doi.org/10.3389/fpsyg.2018.01302 Text en Copyright © 2018 Chen and Jin. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychology
Chen, Hui-Fang
Jin, Kuan-Yu
Applying Logistic Regression to Detect Differential Item Functioning in Multidimensional Data
title Applying Logistic Regression to Detect Differential Item Functioning in Multidimensional Data
title_full Applying Logistic Regression to Detect Differential Item Functioning in Multidimensional Data
title_fullStr Applying Logistic Regression to Detect Differential Item Functioning in Multidimensional Data
title_full_unstemmed Applying Logistic Regression to Detect Differential Item Functioning in Multidimensional Data
title_short Applying Logistic Regression to Detect Differential Item Functioning in Multidimensional Data
title_sort applying logistic regression to detect differential item functioning in multidimensional data
topic Psychology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6072859/
https://www.ncbi.nlm.nih.gov/pubmed/30100891
http://dx.doi.org/10.3389/fpsyg.2018.01302
work_keys_str_mv AT chenhuifang applyinglogisticregressiontodetectdifferentialitemfunctioninginmultidimensionaldata
AT jinkuanyu applyinglogisticregressiontodetectdifferentialitemfunctioninginmultidimensionaldata