Cargando…

Interobserver variability studies in diagnostic imaging: a methodological systematic review

OBJECTIVES: To review the methodology of interobserver variability studies; including current practice and quality of conducting and reporting studies. METHODS: Interobserver variability studies between January 2019 and January 2020 were included; extracted data comprised of study characteristics, p...

Descripción completa

Detalles Bibliográficos
Autores principales: Quinn, Laura, Tryposkiadis, Konstantinos, Deeks, Jon, De Vet, Henrica C.W., Mallett, Sue, Mokkink, Lidwine B., Takwoingi, Yemisi, Taylor-Phillips, Sian, Sitch, Alice
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The British Institute of Radiology. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10392644/
https://www.ncbi.nlm.nih.gov/pubmed/37399082
http://dx.doi.org/10.1259/bjr.20220972
_version_ 1785083009696792576
author Quinn, Laura
Tryposkiadis, Konstantinos
Deeks, Jon
De Vet, Henrica C.W.
Mallett, Sue
Mokkink, Lidwine B.
Takwoingi, Yemisi
Taylor-Phillips, Sian
Sitch, Alice
author_facet Quinn, Laura
Tryposkiadis, Konstantinos
Deeks, Jon
De Vet, Henrica C.W.
Mallett, Sue
Mokkink, Lidwine B.
Takwoingi, Yemisi
Taylor-Phillips, Sian
Sitch, Alice
author_sort Quinn, Laura
collection PubMed
description OBJECTIVES: To review the methodology of interobserver variability studies; including current practice and quality of conducting and reporting studies. METHODS: Interobserver variability studies between January 2019 and January 2020 were included; extracted data comprised of study characteristics, populations, variability measures, key results, and conclusions. Risk of bias was assessed using the COSMIN tool for assessing reliability and measurement error. RESULTS: Seventy-nine full-text studies were included covering various imaging tests and clinical areas. The median number of patients was 47 (IQR:23–88), and observers were 4 (IQR:2–7), with sample size justified in 12 (15%) studies. Most studies used static images (n = 75, 95%), where all observers interpreted images for all patients (n = 67, 85%). Intraclass correlation coefficients (ICC) (n = 41, 52%), Kappa (κ) statistics (n = 31, 39%) and percentage agreement (n = 15, 19%) were most commonly used. Interpretation of variability estimates often did not correspond with study conclusions. The COSMIN risk of bias tool gave a very good/adequate rating for 52 studies (66%) including any studies that used variability measures listed in the tool. For studies using static images, some study design standards were not applicable and did not contribute to the overall rating. CONCLUSIONS: Interobserver variability studies have diverse study designs and methods, the impact of which requires further evaluation. Sample size for patients and observers was often small without justification. Most studies report ICC and κ values, which did not always coincide with the study conclusion. High ratings were assigned to many studies using the COSMIN risk of bias tool, with certain standards scored ‘not applicable’ when static images were used. ADVANCES IN KNOWLEDGE: The sample size for both patients and observers was often small without justification. For most studies, observers interpreted static images and did not evaluate the process of acquiring the imaging test, meaning it was not possible to assess many COSMIN risk of bias standards for studies with this design. Most studies reported intraclass correlation coefficient and κ statistics; study conclusions often did not correspond with results.
format Online
Article
Text
id pubmed-10392644
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher The British Institute of Radiology.
record_format MEDLINE/PubMed
spelling pubmed-103926442023-08-02 Interobserver variability studies in diagnostic imaging: a methodological systematic review Quinn, Laura Tryposkiadis, Konstantinos Deeks, Jon De Vet, Henrica C.W. Mallett, Sue Mokkink, Lidwine B. Takwoingi, Yemisi Taylor-Phillips, Sian Sitch, Alice Br J Radiol Systematic Review OBJECTIVES: To review the methodology of interobserver variability studies; including current practice and quality of conducting and reporting studies. METHODS: Interobserver variability studies between January 2019 and January 2020 were included; extracted data comprised of study characteristics, populations, variability measures, key results, and conclusions. Risk of bias was assessed using the COSMIN tool for assessing reliability and measurement error. RESULTS: Seventy-nine full-text studies were included covering various imaging tests and clinical areas. The median number of patients was 47 (IQR:23–88), and observers were 4 (IQR:2–7), with sample size justified in 12 (15%) studies. Most studies used static images (n = 75, 95%), where all observers interpreted images for all patients (n = 67, 85%). Intraclass correlation coefficients (ICC) (n = 41, 52%), Kappa (κ) statistics (n = 31, 39%) and percentage agreement (n = 15, 19%) were most commonly used. Interpretation of variability estimates often did not correspond with study conclusions. The COSMIN risk of bias tool gave a very good/adequate rating for 52 studies (66%) including any studies that used variability measures listed in the tool. For studies using static images, some study design standards were not applicable and did not contribute to the overall rating. CONCLUSIONS: Interobserver variability studies have diverse study designs and methods, the impact of which requires further evaluation. Sample size for patients and observers was often small without justification. Most studies report ICC and κ values, which did not always coincide with the study conclusion. High ratings were assigned to many studies using the COSMIN risk of bias tool, with certain standards scored ‘not applicable’ when static images were used. ADVANCES IN KNOWLEDGE: The sample size for both patients and observers was often small without justification. For most studies, observers interpreted static images and did not evaluate the process of acquiring the imaging test, meaning it was not possible to assess many COSMIN risk of bias standards for studies with this design. Most studies reported intraclass correlation coefficient and κ statistics; study conclusions often did not correspond with results. The British Institute of Radiology. 2023-08 2023-06-29 /pmc/articles/PMC10392644/ /pubmed/37399082 http://dx.doi.org/10.1259/bjr.20220972 Text en © 2023 The Authors. Published by the British Institute of Radiology https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution 4.0 Unported License http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution and reproduction in any medium, provided the original author and source are credited.
spellingShingle Systematic Review
Quinn, Laura
Tryposkiadis, Konstantinos
Deeks, Jon
De Vet, Henrica C.W.
Mallett, Sue
Mokkink, Lidwine B.
Takwoingi, Yemisi
Taylor-Phillips, Sian
Sitch, Alice
Interobserver variability studies in diagnostic imaging: a methodological systematic review
title Interobserver variability studies in diagnostic imaging: a methodological systematic review
title_full Interobserver variability studies in diagnostic imaging: a methodological systematic review
title_fullStr Interobserver variability studies in diagnostic imaging: a methodological systematic review
title_full_unstemmed Interobserver variability studies in diagnostic imaging: a methodological systematic review
title_short Interobserver variability studies in diagnostic imaging: a methodological systematic review
title_sort interobserver variability studies in diagnostic imaging: a methodological systematic review
topic Systematic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10392644/
https://www.ncbi.nlm.nih.gov/pubmed/37399082
http://dx.doi.org/10.1259/bjr.20220972
work_keys_str_mv AT quinnlaura interobservervariabilitystudiesindiagnosticimagingamethodologicalsystematicreview
AT tryposkiadiskonstantinos interobservervariabilitystudiesindiagnosticimagingamethodologicalsystematicreview
AT deeksjon interobservervariabilitystudiesindiagnosticimagingamethodologicalsystematicreview
AT devethenricacw interobservervariabilitystudiesindiagnosticimagingamethodologicalsystematicreview
AT mallettsue interobservervariabilitystudiesindiagnosticimagingamethodologicalsystematicreview
AT mokkinklidwineb interobservervariabilitystudiesindiagnosticimagingamethodologicalsystematicreview
AT takwoingiyemisi interobservervariabilitystudiesindiagnosticimagingamethodologicalsystematicreview
AT taylorphillipssian interobservervariabilitystudiesindiagnosticimagingamethodologicalsystematicreview
AT sitchalice interobservervariabilitystudiesindiagnosticimagingamethodologicalsystematicreview