Cargando…

A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment

We propose a novel approach to modelling rater effects in scoring-based assessment. The approach is based on a Bayesian hierarchical model and simulations from the posterior distribution. We apply it to large-scale essay assessment data over a period of 5 years. Empirical results suggest that the mo...

Descripción completa

Detalles Bibliográficos
Autores principales: Zupanc, Kaja, Štrumbelj, Erik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5882162/
https://www.ncbi.nlm.nih.gov/pubmed/29614129
http://dx.doi.org/10.1371/journal.pone.0195297
_version_ 1783311422748360704
author Zupanc, Kaja
Štrumbelj, Erik
author_facet Zupanc, Kaja
Štrumbelj, Erik
author_sort Zupanc, Kaja
collection PubMed
description We propose a novel approach to modelling rater effects in scoring-based assessment. The approach is based on a Bayesian hierarchical model and simulations from the posterior distribution. We apply it to large-scale essay assessment data over a period of 5 years. Empirical results suggest that the model provides a good fit for both the total scores and when applied to individual rubrics. We estimate the median impact of rater effects on the final grade to be ± 2 points on a 50 point scale, while 10% of essays would receive a score at least ± 5 different from their actual quality. Most of the impact is due to rater unreliability, not rater bias.
format Online
Article
Text
id pubmed-5882162
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-58821622018-04-13 A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment Zupanc, Kaja Štrumbelj, Erik PLoS One Research Article We propose a novel approach to modelling rater effects in scoring-based assessment. The approach is based on a Bayesian hierarchical model and simulations from the posterior distribution. We apply it to large-scale essay assessment data over a period of 5 years. Empirical results suggest that the model provides a good fit for both the total scores and when applied to individual rubrics. We estimate the median impact of rater effects on the final grade to be ± 2 points on a 50 point scale, while 10% of essays would receive a score at least ± 5 different from their actual quality. Most of the impact is due to rater unreliability, not rater bias. Public Library of Science 2018-04-03 /pmc/articles/PMC5882162/ /pubmed/29614129 http://dx.doi.org/10.1371/journal.pone.0195297 Text en © 2018 Zupanc, Štrumbelj http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zupanc, Kaja
Štrumbelj, Erik
A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment
title A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment
title_full A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment
title_fullStr A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment
title_full_unstemmed A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment
title_short A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment
title_sort bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5882162/
https://www.ncbi.nlm.nih.gov/pubmed/29614129
http://dx.doi.org/10.1371/journal.pone.0195297
work_keys_str_mv AT zupanckaja abayesianhierarchicallatenttraitmodelforestimatingraterbiasandreliabilityinlargescaleperformanceassessment
AT strumbeljerik abayesianhierarchicallatenttraitmodelforestimatingraterbiasandreliabilityinlargescaleperformanceassessment
AT zupanckaja bayesianhierarchicallatenttraitmodelforestimatingraterbiasandreliabilityinlargescaleperformanceassessment
AT strumbeljerik bayesianhierarchicallatenttraitmodelforestimatingraterbiasandreliabilityinlargescaleperformanceassessment