Cargando…
Interobserver variability studies in diagnostic imaging: a methodological systematic review
OBJECTIVES: To review the methodology of interobserver variability studies; including current practice and quality of conducting and reporting studies. METHODS: Interobserver variability studies between January 2019 and January 2020 were included; extracted data comprised of study characteristics, p...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The British Institute of Radiology.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10392644/ https://www.ncbi.nlm.nih.gov/pubmed/37399082 http://dx.doi.org/10.1259/bjr.20220972 |
_version_ | 1785083009696792576 |
---|---|
author | Quinn, Laura Tryposkiadis, Konstantinos Deeks, Jon De Vet, Henrica C.W. Mallett, Sue Mokkink, Lidwine B. Takwoingi, Yemisi Taylor-Phillips, Sian Sitch, Alice |
author_facet | Quinn, Laura Tryposkiadis, Konstantinos Deeks, Jon De Vet, Henrica C.W. Mallett, Sue Mokkink, Lidwine B. Takwoingi, Yemisi Taylor-Phillips, Sian Sitch, Alice |
author_sort | Quinn, Laura |
collection | PubMed |
description | OBJECTIVES: To review the methodology of interobserver variability studies; including current practice and quality of conducting and reporting studies. METHODS: Interobserver variability studies between January 2019 and January 2020 were included; extracted data comprised of study characteristics, populations, variability measures, key results, and conclusions. Risk of bias was assessed using the COSMIN tool for assessing reliability and measurement error. RESULTS: Seventy-nine full-text studies were included covering various imaging tests and clinical areas. The median number of patients was 47 (IQR:23–88), and observers were 4 (IQR:2–7), with sample size justified in 12 (15%) studies. Most studies used static images (n = 75, 95%), where all observers interpreted images for all patients (n = 67, 85%). Intraclass correlation coefficients (ICC) (n = 41, 52%), Kappa (κ) statistics (n = 31, 39%) and percentage agreement (n = 15, 19%) were most commonly used. Interpretation of variability estimates often did not correspond with study conclusions. The COSMIN risk of bias tool gave a very good/adequate rating for 52 studies (66%) including any studies that used variability measures listed in the tool. For studies using static images, some study design standards were not applicable and did not contribute to the overall rating. CONCLUSIONS: Interobserver variability studies have diverse study designs and methods, the impact of which requires further evaluation. Sample size for patients and observers was often small without justification. Most studies report ICC and κ values, which did not always coincide with the study conclusion. High ratings were assigned to many studies using the COSMIN risk of bias tool, with certain standards scored ‘not applicable’ when static images were used. ADVANCES IN KNOWLEDGE: The sample size for both patients and observers was often small without justification. For most studies, observers interpreted static images and did not evaluate the process of acquiring the imaging test, meaning it was not possible to assess many COSMIN risk of bias standards for studies with this design. Most studies reported intraclass correlation coefficient and κ statistics; study conclusions often did not correspond with results. |
format | Online Article Text |
id | pubmed-10392644 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | The British Institute of Radiology. |
record_format | MEDLINE/PubMed |
spelling | pubmed-103926442023-08-02 Interobserver variability studies in diagnostic imaging: a methodological systematic review Quinn, Laura Tryposkiadis, Konstantinos Deeks, Jon De Vet, Henrica C.W. Mallett, Sue Mokkink, Lidwine B. Takwoingi, Yemisi Taylor-Phillips, Sian Sitch, Alice Br J Radiol Systematic Review OBJECTIVES: To review the methodology of interobserver variability studies; including current practice and quality of conducting and reporting studies. METHODS: Interobserver variability studies between January 2019 and January 2020 were included; extracted data comprised of study characteristics, populations, variability measures, key results, and conclusions. Risk of bias was assessed using the COSMIN tool for assessing reliability and measurement error. RESULTS: Seventy-nine full-text studies were included covering various imaging tests and clinical areas. The median number of patients was 47 (IQR:23–88), and observers were 4 (IQR:2–7), with sample size justified in 12 (15%) studies. Most studies used static images (n = 75, 95%), where all observers interpreted images for all patients (n = 67, 85%). Intraclass correlation coefficients (ICC) (n = 41, 52%), Kappa (κ) statistics (n = 31, 39%) and percentage agreement (n = 15, 19%) were most commonly used. Interpretation of variability estimates often did not correspond with study conclusions. The COSMIN risk of bias tool gave a very good/adequate rating for 52 studies (66%) including any studies that used variability measures listed in the tool. For studies using static images, some study design standards were not applicable and did not contribute to the overall rating. CONCLUSIONS: Interobserver variability studies have diverse study designs and methods, the impact of which requires further evaluation. Sample size for patients and observers was often small without justification. Most studies report ICC and κ values, which did not always coincide with the study conclusion. High ratings were assigned to many studies using the COSMIN risk of bias tool, with certain standards scored ‘not applicable’ when static images were used. ADVANCES IN KNOWLEDGE: The sample size for both patients and observers was often small without justification. For most studies, observers interpreted static images and did not evaluate the process of acquiring the imaging test, meaning it was not possible to assess many COSMIN risk of bias standards for studies with this design. Most studies reported intraclass correlation coefficient and κ statistics; study conclusions often did not correspond with results. The British Institute of Radiology. 2023-08 2023-06-29 /pmc/articles/PMC10392644/ /pubmed/37399082 http://dx.doi.org/10.1259/bjr.20220972 Text en © 2023 The Authors. Published by the British Institute of Radiology https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution 4.0 Unported License http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Systematic Review Quinn, Laura Tryposkiadis, Konstantinos Deeks, Jon De Vet, Henrica C.W. Mallett, Sue Mokkink, Lidwine B. Takwoingi, Yemisi Taylor-Phillips, Sian Sitch, Alice Interobserver variability studies in diagnostic imaging: a methodological systematic review |
title | Interobserver variability studies in diagnostic imaging: a methodological systematic review |
title_full | Interobserver variability studies in diagnostic imaging: a methodological systematic review |
title_fullStr | Interobserver variability studies in diagnostic imaging: a methodological systematic review |
title_full_unstemmed | Interobserver variability studies in diagnostic imaging: a methodological systematic review |
title_short | Interobserver variability studies in diagnostic imaging: a methodological systematic review |
title_sort | interobserver variability studies in diagnostic imaging: a methodological systematic review |
topic | Systematic Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10392644/ https://www.ncbi.nlm.nih.gov/pubmed/37399082 http://dx.doi.org/10.1259/bjr.20220972 |
work_keys_str_mv | AT quinnlaura interobservervariabilitystudiesindiagnosticimagingamethodologicalsystematicreview AT tryposkiadiskonstantinos interobservervariabilitystudiesindiagnosticimagingamethodologicalsystematicreview AT deeksjon interobservervariabilitystudiesindiagnosticimagingamethodologicalsystematicreview AT devethenricacw interobservervariabilitystudiesindiagnosticimagingamethodologicalsystematicreview AT mallettsue interobservervariabilitystudiesindiagnosticimagingamethodologicalsystematicreview AT mokkinklidwineb interobservervariabilitystudiesindiagnosticimagingamethodologicalsystematicreview AT takwoingiyemisi interobservervariabilitystudiesindiagnosticimagingamethodologicalsystematicreview AT taylorphillipssian interobservervariabilitystudiesindiagnosticimagingamethodologicalsystematicreview AT sitchalice interobservervariabilitystudiesindiagnosticimagingamethodologicalsystematicreview |