Cargando…

Inter- and Intra-Observer Agreement When Using a Diagnostic Labeling Scheme for Annotating Findings on Chest X-rays—An Early Step in the Development of a Deep Learning-Based Decision Support System

Consistent annotation of data is a prerequisite for the successful training and testing of artificial intelligence-based decision support systems in radiology. This can be obtained by standardizing terminology when annotating diagnostic images. The purpose of this study was to evaluate the annotatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Dana, Pehrson, Lea Marie, Tøttrup, Lea, Fraccaro, Marco, Bonnevie, Rasmus, Thrane, Jakob, Sørensen, Peter Jagd, Rykkje, Alexander, Andersen, Tobias Thostrup, Steglich-Arnholm, Henrik, Stærk, Dorte Marianne Rohde, Borgwardt, Lotte, Hansen, Kristoffer Lindskov, Darkner, Sune, Carlsen, Jonathan Frederik, Nielsen, Michael Bachmann
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9776917/
https://www.ncbi.nlm.nih.gov/pubmed/36553118
http://dx.doi.org/10.3390/diagnostics12123112
_version_ 1784855976475623424
author Li, Dana
Pehrson, Lea Marie
Tøttrup, Lea
Fraccaro, Marco
Bonnevie, Rasmus
Thrane, Jakob
Sørensen, Peter Jagd
Rykkje, Alexander
Andersen, Tobias Thostrup
Steglich-Arnholm, Henrik
Stærk, Dorte Marianne Rohde
Borgwardt, Lotte
Hansen, Kristoffer Lindskov
Darkner, Sune
Carlsen, Jonathan Frederik
Nielsen, Michael Bachmann
author_facet Li, Dana
Pehrson, Lea Marie
Tøttrup, Lea
Fraccaro, Marco
Bonnevie, Rasmus
Thrane, Jakob
Sørensen, Peter Jagd
Rykkje, Alexander
Andersen, Tobias Thostrup
Steglich-Arnholm, Henrik
Stærk, Dorte Marianne Rohde
Borgwardt, Lotte
Hansen, Kristoffer Lindskov
Darkner, Sune
Carlsen, Jonathan Frederik
Nielsen, Michael Bachmann
author_sort Li, Dana
collection PubMed
description Consistent annotation of data is a prerequisite for the successful training and testing of artificial intelligence-based decision support systems in radiology. This can be obtained by standardizing terminology when annotating diagnostic images. The purpose of this study was to evaluate the annotation consistency among radiologists when using a novel diagnostic labeling scheme for chest X-rays. Six radiologists with experience ranging from one to sixteen years, annotated a set of 100 fully anonymized chest X-rays. The blinded radiologists annotated on two separate occasions. Statistical analyses were done using Randolph’s kappa and PABAK, and the proportions of specific agreements were calculated. Fair-to-excellent agreement was found for all labels among the annotators (Randolph’s Kappa, 0.40–0.99). The PABAK ranged from 0.12 to 1 for the two-reader inter-rater agreement and 0.26 to 1 for the intra-rater agreement. Descriptive and broad labels achieved the highest proportion of positive agreement in both the inter- and intra-reader analyses. Annotating findings with specific, interpretive labels were found to be difficult for less experienced radiologists. Annotating images with descriptive labels may increase agreement between radiologists with different experience levels compared to annotation with interpretive labels.
format Online
Article
Text
id pubmed-9776917
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-97769172022-12-23 Inter- and Intra-Observer Agreement When Using a Diagnostic Labeling Scheme for Annotating Findings on Chest X-rays—An Early Step in the Development of a Deep Learning-Based Decision Support System Li, Dana Pehrson, Lea Marie Tøttrup, Lea Fraccaro, Marco Bonnevie, Rasmus Thrane, Jakob Sørensen, Peter Jagd Rykkje, Alexander Andersen, Tobias Thostrup Steglich-Arnholm, Henrik Stærk, Dorte Marianne Rohde Borgwardt, Lotte Hansen, Kristoffer Lindskov Darkner, Sune Carlsen, Jonathan Frederik Nielsen, Michael Bachmann Diagnostics (Basel) Article Consistent annotation of data is a prerequisite for the successful training and testing of artificial intelligence-based decision support systems in radiology. This can be obtained by standardizing terminology when annotating diagnostic images. The purpose of this study was to evaluate the annotation consistency among radiologists when using a novel diagnostic labeling scheme for chest X-rays. Six radiologists with experience ranging from one to sixteen years, annotated a set of 100 fully anonymized chest X-rays. The blinded radiologists annotated on two separate occasions. Statistical analyses were done using Randolph’s kappa and PABAK, and the proportions of specific agreements were calculated. Fair-to-excellent agreement was found for all labels among the annotators (Randolph’s Kappa, 0.40–0.99). The PABAK ranged from 0.12 to 1 for the two-reader inter-rater agreement and 0.26 to 1 for the intra-rater agreement. Descriptive and broad labels achieved the highest proportion of positive agreement in both the inter- and intra-reader analyses. Annotating findings with specific, interpretive labels were found to be difficult for less experienced radiologists. Annotating images with descriptive labels may increase agreement between radiologists with different experience levels compared to annotation with interpretive labels. MDPI 2022-12-09 /pmc/articles/PMC9776917/ /pubmed/36553118 http://dx.doi.org/10.3390/diagnostics12123112 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Li, Dana
Pehrson, Lea Marie
Tøttrup, Lea
Fraccaro, Marco
Bonnevie, Rasmus
Thrane, Jakob
Sørensen, Peter Jagd
Rykkje, Alexander
Andersen, Tobias Thostrup
Steglich-Arnholm, Henrik
Stærk, Dorte Marianne Rohde
Borgwardt, Lotte
Hansen, Kristoffer Lindskov
Darkner, Sune
Carlsen, Jonathan Frederik
Nielsen, Michael Bachmann
Inter- and Intra-Observer Agreement When Using a Diagnostic Labeling Scheme for Annotating Findings on Chest X-rays—An Early Step in the Development of a Deep Learning-Based Decision Support System
title Inter- and Intra-Observer Agreement When Using a Diagnostic Labeling Scheme for Annotating Findings on Chest X-rays—An Early Step in the Development of a Deep Learning-Based Decision Support System
title_full Inter- and Intra-Observer Agreement When Using a Diagnostic Labeling Scheme for Annotating Findings on Chest X-rays—An Early Step in the Development of a Deep Learning-Based Decision Support System
title_fullStr Inter- and Intra-Observer Agreement When Using a Diagnostic Labeling Scheme for Annotating Findings on Chest X-rays—An Early Step in the Development of a Deep Learning-Based Decision Support System
title_full_unstemmed Inter- and Intra-Observer Agreement When Using a Diagnostic Labeling Scheme for Annotating Findings on Chest X-rays—An Early Step in the Development of a Deep Learning-Based Decision Support System
title_short Inter- and Intra-Observer Agreement When Using a Diagnostic Labeling Scheme for Annotating Findings on Chest X-rays—An Early Step in the Development of a Deep Learning-Based Decision Support System
title_sort inter- and intra-observer agreement when using a diagnostic labeling scheme for annotating findings on chest x-rays—an early step in the development of a deep learning-based decision support system
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9776917/
https://www.ncbi.nlm.nih.gov/pubmed/36553118
http://dx.doi.org/10.3390/diagnostics12123112
work_keys_str_mv AT lidana interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem
AT pehrsonleamarie interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem
AT tøttruplea interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem
AT fraccaromarco interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem
AT bonnevierasmus interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem
AT thranejakob interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem
AT sørensenpeterjagd interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem
AT rykkjealexander interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem
AT andersentobiasthostrup interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem
AT steglicharnholmhenrik interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem
AT stærkdortemariannerohde interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem
AT borgwardtlotte interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem
AT hansenkristofferlindskov interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem
AT darknersune interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem
AT carlsenjonathanfrederik interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem
AT nielsenmichaelbachmann interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem