Cargando…

Detecting rater bias using a person-fit statistic: a Monte Carlo simulation study

INTRODUCTION: With the Standards voicing concern for the appropriateness of response processes, we need to explore strategies that would allow us to identify inappropriate rater response processes. Although certain statistics can be used to help detect rater bias, their use is complicated by either...

Descripción completa

Detalles Bibliográficos
Autores principales: Aubin, André-Sébastien, St-Onge, Christina, Renaud, Jean-Sébastien
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Bohn Stafleu van Loghum 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5889374/
https://www.ncbi.nlm.nih.gov/pubmed/29294255
http://dx.doi.org/10.1007/s40037-017-0391-8
_version_ 1783312681471574016
author Aubin, André-Sébastien
St-Onge, Christina
Renaud, Jean-Sébastien
author_facet Aubin, André-Sébastien
St-Onge, Christina
Renaud, Jean-Sébastien
author_sort Aubin, André-Sébastien
collection PubMed
description INTRODUCTION: With the Standards voicing concern for the appropriateness of response processes, we need to explore strategies that would allow us to identify inappropriate rater response processes. Although certain statistics can be used to help detect rater bias, their use is complicated by either a lack of data about their actual power to detect rater bias or the difficulty related to their application in the context of health professions education. This exploratory study aimed to establish the worthiness of pursuing the use of l (z) to detect rater bias. METHODS: We conducted a Monte Carlo simulation study to investigate the power of a specific detection statistic, that is: the standardized likelihood l (z) person-fit statistics (PFS). Our primary outcome was the detection rate of biased raters, namely: raters whom we manipulated into being either stringent (giving lower scores) or lenient (giving higher scores), using the l (z) statistic while controlling for the number of biased raters in a sample (6 levels) and the rate of bias per rater (6 levels). RESULTS: Overall, stringent raters (M = 0.84, SD = 0.23) were easier to detect than lenient raters (M = 0.31, SD = 0.28). More biased raters were easier to detect then less biased raters (60% bias: 62, SD = 0.37; 10% bias: 43, SD = 0.36). DISCUSSION: The PFS l (z) seems to offer an interesting potential to identify biased raters. We observed detection rates as high as 90% for stringent raters, for whom we manipulated more than half their checklist. Although we observed very interesting results, we cannot generalize these results to the use of PFS with estimated item/station parameters or real data. Such studies should be conducted to assess the feasibility of using PFS to identify rater bias.
format Online
Article
Text
id pubmed-5889374
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Bohn Stafleu van Loghum
record_format MEDLINE/PubMed
spelling pubmed-58893742018-04-12 Detecting rater bias using a person-fit statistic: a Monte Carlo simulation study Aubin, André-Sébastien St-Onge, Christina Renaud, Jean-Sébastien Perspect Med Educ Original Article INTRODUCTION: With the Standards voicing concern for the appropriateness of response processes, we need to explore strategies that would allow us to identify inappropriate rater response processes. Although certain statistics can be used to help detect rater bias, their use is complicated by either a lack of data about their actual power to detect rater bias or the difficulty related to their application in the context of health professions education. This exploratory study aimed to establish the worthiness of pursuing the use of l (z) to detect rater bias. METHODS: We conducted a Monte Carlo simulation study to investigate the power of a specific detection statistic, that is: the standardized likelihood l (z) person-fit statistics (PFS). Our primary outcome was the detection rate of biased raters, namely: raters whom we manipulated into being either stringent (giving lower scores) or lenient (giving higher scores), using the l (z) statistic while controlling for the number of biased raters in a sample (6 levels) and the rate of bias per rater (6 levels). RESULTS: Overall, stringent raters (M = 0.84, SD = 0.23) were easier to detect than lenient raters (M = 0.31, SD = 0.28). More biased raters were easier to detect then less biased raters (60% bias: 62, SD = 0.37; 10% bias: 43, SD = 0.36). DISCUSSION: The PFS l (z) seems to offer an interesting potential to identify biased raters. We observed detection rates as high as 90% for stringent raters, for whom we manipulated more than half their checklist. Although we observed very interesting results, we cannot generalize these results to the use of PFS with estimated item/station parameters or real data. Such studies should be conducted to assess the feasibility of using PFS to identify rater bias. Bohn Stafleu van Loghum 2018-01-02 2018-04 /pmc/articles/PMC5889374/ /pubmed/29294255 http://dx.doi.org/10.1007/s40037-017-0391-8 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Original Article
Aubin, André-Sébastien
St-Onge, Christina
Renaud, Jean-Sébastien
Detecting rater bias using a person-fit statistic: a Monte Carlo simulation study
title Detecting rater bias using a person-fit statistic: a Monte Carlo simulation study
title_full Detecting rater bias using a person-fit statistic: a Monte Carlo simulation study
title_fullStr Detecting rater bias using a person-fit statistic: a Monte Carlo simulation study
title_full_unstemmed Detecting rater bias using a person-fit statistic: a Monte Carlo simulation study
title_short Detecting rater bias using a person-fit statistic: a Monte Carlo simulation study
title_sort detecting rater bias using a person-fit statistic: a monte carlo simulation study
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5889374/
https://www.ncbi.nlm.nih.gov/pubmed/29294255
http://dx.doi.org/10.1007/s40037-017-0391-8
work_keys_str_mv AT aubinandresebastien detectingraterbiasusingapersonfitstatisticamontecarlosimulationstudy
AT stongechristina detectingraterbiasusingapersonfitstatisticamontecarlosimulationstudy
AT renaudjeansebastien detectingraterbiasusingapersonfitstatisticamontecarlosimulationstudy