Cargando…

Inter-rater and test–retest reliability of quality assessments by novice student raters using the Jadad and Newcastle–Ottawa Scales

INTRODUCTION: Quality assessment of included studies is an important component of systematic reviews. OBJECTIVE: The authors investigated inter-rater and test–retest reliability for quality assessments conducted by inexperienced student raters. DESIGN: Student raters received a training session on q...

Descripción completa

Detalles Bibliográficos
Autores principales:	Oremus, Mark, Oremus, Carolina, Hall, Geoffrey B C, McKinnon, Margaret C
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BMJ Group 2012
Materias:	Evidence Based Practice
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4400798/ https://www.ncbi.nlm.nih.gov/pubmed/22855629 http://dx.doi.org/10.1136/bmjopen-2012-001368

_version_	1782367075005628416
author	Oremus, Mark Oremus, Carolina Hall, Geoffrey B C McKinnon, Margaret C
author_facet	Oremus, Mark Oremus, Carolina Hall, Geoffrey B C McKinnon, Margaret C
author_sort	Oremus, Mark
collection	PubMed
description	INTRODUCTION: Quality assessment of included studies is an important component of systematic reviews. OBJECTIVE: The authors investigated inter-rater and test–retest reliability for quality assessments conducted by inexperienced student raters. DESIGN: Student raters received a training session on quality assessment using the Jadad Scale for randomised controlled trials and the Newcastle–Ottawa Scale (NOS) for observational studies. Raters were randomly assigned into five pairs and they each independently rated the quality of 13–20 articles. These articles were drawn from a pool of 78 papers examining cognitive impairment following electroconvulsive therapy to treat major depressive disorder. The articles were randomly distributed to the raters. Two months later, each rater re-assessed the quality of half of their assigned articles. SETTING: McMaster Integrative Neuroscience Discovery and Study Program. PARTICIPANTS: 10 students taking McMaster Integrative Neuroscience Discovery and Study Program courses. MAIN OUTCOME MEASURES: The authors measured inter-rater reliability using κ and the intraclass correlation coefficient type 2,1 or ICC(2,1). The authors measured test–retest reliability using ICC(2,1). RESULTS: Inter-rater reliability varied by scale question. For the six-item Jadad Scale, question-specific κs ranged from 0.13 (95% CI −0.11 to 0.37) to 0.56 (95% CI 0.29 to 0.83). The ranges were −0.14 (95% CI −0.28 to 0.00) to 0.39 (95% CI −0.02 to 0.81) for the NOS cohort and −0.20 (95% CI −0.49 to 0.09) to 1.00 (95% CI 1.00 to 1.00) for the NOS case–control. For overall scores on the six-item Jadad Scale, ICC(2,1)s for inter-rater and test–retest reliability (accounting for systematic differences between raters) were 0.32 (95% CI 0.08 to 0.52) and 0.55 (95% CI 0.41 to 0.67), respectively. Corresponding ICC(2,1)s for the NOS cohort were −0.19 (95% CI −0.67 to 0.35) and 0.62 (95% CI 0.25 to 0.83), and for the NOS case–control, the ICC(2,1)s were 0.46 (95% CI −0.13 to 0.92) and 0.83 (95% CI 0.48 to 0.95). CONCLUSIONS: Inter-rater reliability was generally poor to fair and test–retest reliability was fair to excellent. A pilot rating phase following rater training may be one way to improve agreement.
format	Online Article Text
id	pubmed-4400798
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	BMJ Group
record_format	MEDLINE/PubMed
spelling	pubmed-44007982015-04-22 Inter-rater and test–retest reliability of quality assessments by novice student raters using the Jadad and Newcastle–Ottawa Scales Oremus, Mark Oremus, Carolina Hall, Geoffrey B C McKinnon, Margaret C BMJ Open Evidence Based Practice INTRODUCTION: Quality assessment of included studies is an important component of systematic reviews. OBJECTIVE: The authors investigated inter-rater and test–retest reliability for quality assessments conducted by inexperienced student raters. DESIGN: Student raters received a training session on quality assessment using the Jadad Scale for randomised controlled trials and the Newcastle–Ottawa Scale (NOS) for observational studies. Raters were randomly assigned into five pairs and they each independently rated the quality of 13–20 articles. These articles were drawn from a pool of 78 papers examining cognitive impairment following electroconvulsive therapy to treat major depressive disorder. The articles were randomly distributed to the raters. Two months later, each rater re-assessed the quality of half of their assigned articles. SETTING: McMaster Integrative Neuroscience Discovery and Study Program. PARTICIPANTS: 10 students taking McMaster Integrative Neuroscience Discovery and Study Program courses. MAIN OUTCOME MEASURES: The authors measured inter-rater reliability using κ and the intraclass correlation coefficient type 2,1 or ICC(2,1). The authors measured test–retest reliability using ICC(2,1). RESULTS: Inter-rater reliability varied by scale question. For the six-item Jadad Scale, question-specific κs ranged from 0.13 (95% CI −0.11 to 0.37) to 0.56 (95% CI 0.29 to 0.83). The ranges were −0.14 (95% CI −0.28 to 0.00) to 0.39 (95% CI −0.02 to 0.81) for the NOS cohort and −0.20 (95% CI −0.49 to 0.09) to 1.00 (95% CI 1.00 to 1.00) for the NOS case–control. For overall scores on the six-item Jadad Scale, ICC(2,1)s for inter-rater and test–retest reliability (accounting for systematic differences between raters) were 0.32 (95% CI 0.08 to 0.52) and 0.55 (95% CI 0.41 to 0.67), respectively. Corresponding ICC(2,1)s for the NOS cohort were −0.19 (95% CI −0.67 to 0.35) and 0.62 (95% CI 0.25 to 0.83), and for the NOS case–control, the ICC(2,1)s were 0.46 (95% CI −0.13 to 0.92) and 0.83 (95% CI 0.48 to 0.95). CONCLUSIONS: Inter-rater reliability was generally poor to fair and test–retest reliability was fair to excellent. A pilot rating phase following rater training may be one way to improve agreement. BMJ Group 2012-07-31 /pmc/articles/PMC4400798/ /pubmed/22855629 http://dx.doi.org/10.1136/bmjopen-2012-001368 Text en Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions This is an open-access article distributed under the terms of the Creative Commons Attribution Non-commercial License, which permits use, distribution, and reproduction in any medium, provided the original work is properly cited, the use is non commercial and is otherwise in compliance with the license. See: http://creativecommons.org/licenses/by-nc/2.0/ and http://creativecommons.org/licenses/by-nc/2.0/legalcode.
spellingShingle	Evidence Based Practice Oremus, Mark Oremus, Carolina Hall, Geoffrey B C McKinnon, Margaret C Inter-rater and test–retest reliability of quality assessments by novice student raters using the Jadad and Newcastle–Ottawa Scales
title	Inter-rater and test–retest reliability of quality assessments by novice student raters using the Jadad and Newcastle–Ottawa Scales
title_full	Inter-rater and test–retest reliability of quality assessments by novice student raters using the Jadad and Newcastle–Ottawa Scales
title_fullStr	Inter-rater and test–retest reliability of quality assessments by novice student raters using the Jadad and Newcastle–Ottawa Scales
title_full_unstemmed	Inter-rater and test–retest reliability of quality assessments by novice student raters using the Jadad and Newcastle–Ottawa Scales
title_short	Inter-rater and test–retest reliability of quality assessments by novice student raters using the Jadad and Newcastle–Ottawa Scales
title_sort	inter-rater and test–retest reliability of quality assessments by novice student raters using the jadad and newcastle–ottawa scales
topic	Evidence Based Practice
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4400798/ https://www.ncbi.nlm.nih.gov/pubmed/22855629 http://dx.doi.org/10.1136/bmjopen-2012-001368
work_keys_str_mv	AT oremusmark interraterandtestretestreliabilityofqualityassessmentsbynovicestudentratersusingthejadadandnewcastleottawascales AT oremuscarolina interraterandtestretestreliabilityofqualityassessmentsbynovicestudentratersusingthejadadandnewcastleottawascales AT hallgeoffreybc interraterandtestretestreliabilityofqualityassessmentsbynovicestudentratersusingthejadadandnewcastleottawascales AT mckinnonmargaretc interraterandtestretestreliabilityofqualityassessmentsbynovicestudentratersusingthejadadandnewcastleottawascales AT interraterandtestretestreliabilityofqualityassessmentsbynovicestudentratersusingthejadadandnewcastleottawascales

Inter-rater and test–retest reliability of quality assessments by novice student raters using the Jadad and Newcastle–Ottawa Scales

Ejemplares similares