Cargando…
Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy
PURPOSE: The Objective Structured Clinical Examination (OSCE) is a widely employed tool for measuring clinical competence. In the drive toward comprehensive assessment, OSCE stations and checklists may become increasingly complex. The objective of this study was to probe inter-observer reliability a...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Co-Action Publishing
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4613902/ https://www.ncbi.nlm.nih.gov/pubmed/26490948 http://dx.doi.org/10.3402/meo.v20.29242 |
_version_ | 1782396336894640128 |
---|---|
author | Hurley, Katrina F. Giffin, Nick A. Stewart, Samuel A. Bullock, Graham B. |
author_facet | Hurley, Katrina F. Giffin, Nick A. Stewart, Samuel A. Bullock, Graham B. |
author_sort | Hurley, Katrina F. |
collection | PubMed |
description | PURPOSE: The Objective Structured Clinical Examination (OSCE) is a widely employed tool for measuring clinical competence. In the drive toward comprehensive assessment, OSCE stations and checklists may become increasingly complex. The objective of this study was to probe inter-observer reliability and observer accuracy as a function of OSCE checklist length. METHOD: Study participants included emergency physicians and senior residents in Emergency Medicine at Dalhousie University. Participants watched an identical series of four, scripted, standardized videos enacting 10-min OSCE stations and completed corresponding assessment checklists. Each participating observer was provided with a random combination of two 40-item and two 20-item checklists. A panel of physicians scored the scenarios through repeated video review to determine the ‘gold standard’ checklist scores. RESULTS: Fifty-seven observers completed 228 assessment checklists. Mean observer accuracy ranged from 73 to 93% (14.6–18.7/20), with an overall accuracy of 86% (17.2/20), and inter-rater reliability range of 58–78%. After controlling for station and individual variation, no effect was observed regarding the number of checklist items on overall accuracy (p=0.2305). Consistency in ratings was calculated using intraclass correlation coefficient and demonstrated no significant difference in consistency between the 20- and 40-item checklists (ranged from 0.432 to 0.781, p-values from 0.56 to 0.73). CONCLUSIONS: The addition of 20 checklist items to a core list of 20 items in an OSCE assessment checklist does not appear to impact observer accuracy or inter-rater reliability. |
format | Online Article Text |
id | pubmed-4613902 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Co-Action Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-46139022015-11-23 Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy Hurley, Katrina F. Giffin, Nick A. Stewart, Samuel A. Bullock, Graham B. Med Educ Online Research Article PURPOSE: The Objective Structured Clinical Examination (OSCE) is a widely employed tool for measuring clinical competence. In the drive toward comprehensive assessment, OSCE stations and checklists may become increasingly complex. The objective of this study was to probe inter-observer reliability and observer accuracy as a function of OSCE checklist length. METHOD: Study participants included emergency physicians and senior residents in Emergency Medicine at Dalhousie University. Participants watched an identical series of four, scripted, standardized videos enacting 10-min OSCE stations and completed corresponding assessment checklists. Each participating observer was provided with a random combination of two 40-item and two 20-item checklists. A panel of physicians scored the scenarios through repeated video review to determine the ‘gold standard’ checklist scores. RESULTS: Fifty-seven observers completed 228 assessment checklists. Mean observer accuracy ranged from 73 to 93% (14.6–18.7/20), with an overall accuracy of 86% (17.2/20), and inter-rater reliability range of 58–78%. After controlling for station and individual variation, no effect was observed regarding the number of checklist items on overall accuracy (p=0.2305). Consistency in ratings was calculated using intraclass correlation coefficient and demonstrated no significant difference in consistency between the 20- and 40-item checklists (ranged from 0.432 to 0.781, p-values from 0.56 to 0.73). CONCLUSIONS: The addition of 20 checklist items to a core list of 20 items in an OSCE assessment checklist does not appear to impact observer accuracy or inter-rater reliability. Co-Action Publishing 2015-10-20 /pmc/articles/PMC4613902/ /pubmed/26490948 http://dx.doi.org/10.3402/meo.v20.29242 Text en © 2015 Katrina F. Hurley et al. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution 4.0 International License, allowing third parties to copy and redistribute the material in any medium or format and to remix, transform, and build upon the material for any purpose, even commercially, provided the original work is properly cited and states its license. |
spellingShingle | Research Article Hurley, Katrina F. Giffin, Nick A. Stewart, Samuel A. Bullock, Graham B. Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy |
title | Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy |
title_full | Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy |
title_fullStr | Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy |
title_full_unstemmed | Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy |
title_short | Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy |
title_sort | probing the effect of osce checklist length on inter-observer reliability and observer accuracy |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4613902/ https://www.ncbi.nlm.nih.gov/pubmed/26490948 http://dx.doi.org/10.3402/meo.v20.29242 |
work_keys_str_mv | AT hurleykatrinaf probingtheeffectofoscechecklistlengthoninterobserverreliabilityandobserveraccuracy AT giffinnicka probingtheeffectofoscechecklistlengthoninterobserverreliabilityandobserveraccuracy AT stewartsamuela probingtheeffectofoscechecklistlengthoninterobserverreliabilityandobserveraccuracy AT bullockgrahamb probingtheeffectofoscechecklistlengthoninterobserverreliabilityandobserveraccuracy |