Cargando…
A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment
We propose a novel approach to modelling rater effects in scoring-based assessment. The approach is based on a Bayesian hierarchical model and simulations from the posterior distribution. We apply it to large-scale essay assessment data over a period of 5 years. Empirical results suggest that the mo...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5882162/ https://www.ncbi.nlm.nih.gov/pubmed/29614129 http://dx.doi.org/10.1371/journal.pone.0195297 |
_version_ | 1783311422748360704 |
---|---|
author | Zupanc, Kaja Štrumbelj, Erik |
author_facet | Zupanc, Kaja Štrumbelj, Erik |
author_sort | Zupanc, Kaja |
collection | PubMed |
description | We propose a novel approach to modelling rater effects in scoring-based assessment. The approach is based on a Bayesian hierarchical model and simulations from the posterior distribution. We apply it to large-scale essay assessment data over a period of 5 years. Empirical results suggest that the model provides a good fit for both the total scores and when applied to individual rubrics. We estimate the median impact of rater effects on the final grade to be ± 2 points on a 50 point scale, while 10% of essays would receive a score at least ± 5 different from their actual quality. Most of the impact is due to rater unreliability, not rater bias. |
format | Online Article Text |
id | pubmed-5882162 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-58821622018-04-13 A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment Zupanc, Kaja Štrumbelj, Erik PLoS One Research Article We propose a novel approach to modelling rater effects in scoring-based assessment. The approach is based on a Bayesian hierarchical model and simulations from the posterior distribution. We apply it to large-scale essay assessment data over a period of 5 years. Empirical results suggest that the model provides a good fit for both the total scores and when applied to individual rubrics. We estimate the median impact of rater effects on the final grade to be ± 2 points on a 50 point scale, while 10% of essays would receive a score at least ± 5 different from their actual quality. Most of the impact is due to rater unreliability, not rater bias. Public Library of Science 2018-04-03 /pmc/articles/PMC5882162/ /pubmed/29614129 http://dx.doi.org/10.1371/journal.pone.0195297 Text en © 2018 Zupanc, Štrumbelj http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Zupanc, Kaja Štrumbelj, Erik A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment |
title | A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment |
title_full | A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment |
title_fullStr | A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment |
title_full_unstemmed | A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment |
title_short | A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment |
title_sort | bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5882162/ https://www.ncbi.nlm.nih.gov/pubmed/29614129 http://dx.doi.org/10.1371/journal.pone.0195297 |
work_keys_str_mv | AT zupanckaja abayesianhierarchicallatenttraitmodelforestimatingraterbiasandreliabilityinlargescaleperformanceassessment AT strumbeljerik abayesianhierarchicallatenttraitmodelforestimatingraterbiasandreliabilityinlargescaleperformanceassessment AT zupanckaja bayesianhierarchicallatenttraitmodelforestimatingraterbiasandreliabilityinlargescaleperformanceassessment AT strumbeljerik bayesianhierarchicallatenttraitmodelforestimatingraterbiasandreliabilityinlargescaleperformanceassessment |