Cargando…

How high is the inter-observer reproducibility in the LIRADS reporting system?

PURPOSE: To investigate the reproducibility of LIRADS v2014 and contribute to its widespread use in clinical practice. MATERIAL AND METHODS: This retrospective, single-centre study was conducted between January 2010 and October 2015. A total of 132 patients who had dynamic magnetic resonance imaging...

Descripción completa

Detalles Bibliográficos
Autores principales: Sevim, Sezgin, Dicle, Oğuz, Gezer, Naciye S., Barış, Mustafa M., Altay, Canan, Akın, Işıl Başara
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Termedia Publishing House 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6964321/
https://www.ncbi.nlm.nih.gov/pubmed/31969967
http://dx.doi.org/10.5114/pjr.2019.90090
Descripción
Sumario:PURPOSE: To investigate the reproducibility of LIRADS v2014 and contribute to its widespread use in clinical practice. MATERIAL AND METHODS: This retrospective, single-centre study was conducted between January 2010 and October 2015. A total of 132 patients who had dynamic magnetic resonance imaging (MRI)/computed tomography (CT) images in the Picture Archiving and Communication Systems (PACS) with liver nodule were included in the study, 37 of whom had histopathology results. Five radiologists who participated in the study, interpreted liver nodules independently on different PACS stations according to the LIRADS reporting system and its main parameters. RESULTS: We determined that level of inter-observer agreement in the LR-1, LR-5, and LR-5V categories was higher than in the LR-2, LR-3, and LR-4 categories (κ = 0.522, 0.442, and 0.600 in the LR-1, LR-5, and LR-5V categories, respectively; κ = 0.082, 0.298, and 0.143 in the LR-2, LR-3, and LR-4 categories, respectively). The parameter that we observed to have the highest level of inter-observer agreement was venous thrombus (κ = 0.600). CONCLUSIONS: Our study showed that LIRADS achieves an acceptable inter-observer reproducibility in terms of clinical practice although it is insufficient at intermediate risk levels. We think that the prevalence of its use will be further increased with training related to the subject and the assignment of numerical values that express the probability of malignancy for each category and including the ancillary features in the algorithm according to clearer rules.