Cargando…

A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment

We propose a novel approach to modelling rater effects in scoring-based assessment. The approach is based on a Bayesian hierarchical model and simulations from the posterior distribution. We apply it to large-scale essay assessment data over a period of 5 years. Empirical results suggest that the mo...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zupanc, Kaja, Štrumbelj, Erik
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2018
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5882162/ https://www.ncbi.nlm.nih.gov/pubmed/29614129 http://dx.doi.org/10.1371/journal.pone.0195297

Descripción
Sumario:	We propose a novel approach to modelling rater effects in scoring-based assessment. The approach is based on a Bayesian hierarchical model and simulations from the posterior distribution. We apply it to large-scale essay assessment data over a period of 5 years. Empirical results suggest that the model provides a good fit for both the total scores and when applied to individual rubrics. We estimate the median impact of rater effects on the final grade to be ± 2 points on a 50 point scale, while 10% of essays would receive a score at least ± 5 different from their actual quality. Most of the impact is due to rater unreliability, not rater bias.

A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment

Ejemplares similares