Cargando…

Repeatability of radiographic assessments for feline hip dysplasia suggest consensus scores in radiology are more uncertain than commonly assumed

Variation in the diagnostic interpretation of radiographs is a well-recognised problem in human and veterinary medicine. One common solution is to create a ‘consensus’ score based on a majority or unanimous decision from multiple observers. While consensus approaches are generally assumed to improve...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ball, Elisabeth, Uhlhorn, Margareta, Eksell, Per, Olsson, Ulrika, Ohlsson, Åsa, Low, Matthew
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9385612/ https://www.ncbi.nlm.nih.gov/pubmed/35978034 http://dx.doi.org/10.1038/s41598-022-18364-9

_version_	1784769626607976448
author	Ball, Elisabeth Uhlhorn, Margareta Eksell, Per Olsson, Ulrika Ohlsson, Åsa Low, Matthew
author_facet	Ball, Elisabeth Uhlhorn, Margareta Eksell, Per Olsson, Ulrika Ohlsson, Åsa Low, Matthew
author_sort	Ball, Elisabeth
collection	PubMed
description	Variation in the diagnostic interpretation of radiographs is a well-recognised problem in human and veterinary medicine. One common solution is to create a ‘consensus’ score based on a majority or unanimous decision from multiple observers. While consensus approaches are generally assumed to improve diagnostic repeatability, the extent to which consensus scores are themselves repeatable has rarely been examined. Here we use repeated assessments by three radiologists of 196 hip radiographs from 98 cats within a health-screening programme to examine intra-observer, inter-observer, majority-consensus and unanimous-consensus repeatability scores for feline hip dysplasia. In line with other studies, intra-observer and inter-observer repeatability was moderate (63–71%), and related to the reference assessment and time taken to reach a decision. Consensus scores did show reduced variation between assessments compared to individuals, but consensus repeatability was far from perfect. Only 75% of majority consensus scores were in agreement between assessments, and based on Bayesian multinomial modelling we estimate that unanimous consensus scores can have repeatabilities as low as 83%. These results clearly show that consensus scores in radiology can have large uncertainties, and that future studies in both human and veterinary medicine need to include consensus-uncertainty estimates if we are to properly interpret radiological diagnoses and the extent to which consensus scores improve diagnostic accuracy.
format	Online Article Text
id	pubmed-9385612
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-93856122022-08-19 Repeatability of radiographic assessments for feline hip dysplasia suggest consensus scores in radiology are more uncertain than commonly assumed Ball, Elisabeth Uhlhorn, Margareta Eksell, Per Olsson, Ulrika Ohlsson, Åsa Low, Matthew Sci Rep Article Variation in the diagnostic interpretation of radiographs is a well-recognised problem in human and veterinary medicine. One common solution is to create a ‘consensus’ score based on a majority or unanimous decision from multiple observers. While consensus approaches are generally assumed to improve diagnostic repeatability, the extent to which consensus scores are themselves repeatable has rarely been examined. Here we use repeated assessments by three radiologists of 196 hip radiographs from 98 cats within a health-screening programme to examine intra-observer, inter-observer, majority-consensus and unanimous-consensus repeatability scores for feline hip dysplasia. In line with other studies, intra-observer and inter-observer repeatability was moderate (63–71%), and related to the reference assessment and time taken to reach a decision. Consensus scores did show reduced variation between assessments compared to individuals, but consensus repeatability was far from perfect. Only 75% of majority consensus scores were in agreement between assessments, and based on Bayesian multinomial modelling we estimate that unanimous consensus scores can have repeatabilities as low as 83%. These results clearly show that consensus scores in radiology can have large uncertainties, and that future studies in both human and veterinary medicine need to include consensus-uncertainty estimates if we are to properly interpret radiological diagnoses and the extent to which consensus scores improve diagnostic accuracy. Nature Publishing Group UK 2022-08-17 /pmc/articles/PMC9385612/ /pubmed/35978034 http://dx.doi.org/10.1038/s41598-022-18364-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Article Ball, Elisabeth Uhlhorn, Margareta Eksell, Per Olsson, Ulrika Ohlsson, Åsa Low, Matthew Repeatability of radiographic assessments for feline hip dysplasia suggest consensus scores in radiology are more uncertain than commonly assumed
title	Repeatability of radiographic assessments for feline hip dysplasia suggest consensus scores in radiology are more uncertain than commonly assumed
title_full	Repeatability of radiographic assessments for feline hip dysplasia suggest consensus scores in radiology are more uncertain than commonly assumed
title_fullStr	Repeatability of radiographic assessments for feline hip dysplasia suggest consensus scores in radiology are more uncertain than commonly assumed
title_full_unstemmed	Repeatability of radiographic assessments for feline hip dysplasia suggest consensus scores in radiology are more uncertain than commonly assumed
title_short	Repeatability of radiographic assessments for feline hip dysplasia suggest consensus scores in radiology are more uncertain than commonly assumed
title_sort	repeatability of radiographic assessments for feline hip dysplasia suggest consensus scores in radiology are more uncertain than commonly assumed
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9385612/ https://www.ncbi.nlm.nih.gov/pubmed/35978034 http://dx.doi.org/10.1038/s41598-022-18364-9
work_keys_str_mv	AT ballelisabeth repeatabilityofradiographicassessmentsforfelinehipdysplasiasuggestconsensusscoresinradiologyaremoreuncertainthancommonlyassumed AT uhlhornmargareta repeatabilityofradiographicassessmentsforfelinehipdysplasiasuggestconsensusscoresinradiologyaremoreuncertainthancommonlyassumed AT eksellper repeatabilityofradiographicassessmentsforfelinehipdysplasiasuggestconsensusscoresinradiologyaremoreuncertainthancommonlyassumed AT olssonulrika repeatabilityofradiographicassessmentsforfelinehipdysplasiasuggestconsensusscoresinradiologyaremoreuncertainthancommonlyassumed AT ohlssonasa repeatabilityofradiographicassessmentsforfelinehipdysplasiasuggestconsensusscoresinradiologyaremoreuncertainthancommonlyassumed AT lowmatthew repeatabilityofradiographicassessmentsforfelinehipdysplasiasuggestconsensusscoresinradiologyaremoreuncertainthancommonlyassumed

Repeatability of radiographic assessments for feline hip dysplasia suggest consensus scores in radiology are more uncertain than commonly assumed

Ejemplares similares