Cargando…
Inter- and Intra-Observer Agreement When Using a Diagnostic Labeling Scheme for Annotating Findings on Chest X-rays—An Early Step in the Development of a Deep Learning-Based Decision Support System
Consistent annotation of data is a prerequisite for the successful training and testing of artificial intelligence-based decision support systems in radiology. This can be obtained by standardizing terminology when annotating diagnostic images. The purpose of this study was to evaluate the annotatio...
Autores principales: | , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9776917/ https://www.ncbi.nlm.nih.gov/pubmed/36553118 http://dx.doi.org/10.3390/diagnostics12123112 |
_version_ | 1784855976475623424 |
---|---|
author | Li, Dana Pehrson, Lea Marie Tøttrup, Lea Fraccaro, Marco Bonnevie, Rasmus Thrane, Jakob Sørensen, Peter Jagd Rykkje, Alexander Andersen, Tobias Thostrup Steglich-Arnholm, Henrik Stærk, Dorte Marianne Rohde Borgwardt, Lotte Hansen, Kristoffer Lindskov Darkner, Sune Carlsen, Jonathan Frederik Nielsen, Michael Bachmann |
author_facet | Li, Dana Pehrson, Lea Marie Tøttrup, Lea Fraccaro, Marco Bonnevie, Rasmus Thrane, Jakob Sørensen, Peter Jagd Rykkje, Alexander Andersen, Tobias Thostrup Steglich-Arnholm, Henrik Stærk, Dorte Marianne Rohde Borgwardt, Lotte Hansen, Kristoffer Lindskov Darkner, Sune Carlsen, Jonathan Frederik Nielsen, Michael Bachmann |
author_sort | Li, Dana |
collection | PubMed |
description | Consistent annotation of data is a prerequisite for the successful training and testing of artificial intelligence-based decision support systems in radiology. This can be obtained by standardizing terminology when annotating diagnostic images. The purpose of this study was to evaluate the annotation consistency among radiologists when using a novel diagnostic labeling scheme for chest X-rays. Six radiologists with experience ranging from one to sixteen years, annotated a set of 100 fully anonymized chest X-rays. The blinded radiologists annotated on two separate occasions. Statistical analyses were done using Randolph’s kappa and PABAK, and the proportions of specific agreements were calculated. Fair-to-excellent agreement was found for all labels among the annotators (Randolph’s Kappa, 0.40–0.99). The PABAK ranged from 0.12 to 1 for the two-reader inter-rater agreement and 0.26 to 1 for the intra-rater agreement. Descriptive and broad labels achieved the highest proportion of positive agreement in both the inter- and intra-reader analyses. Annotating findings with specific, interpretive labels were found to be difficult for less experienced radiologists. Annotating images with descriptive labels may increase agreement between radiologists with different experience levels compared to annotation with interpretive labels. |
format | Online Article Text |
id | pubmed-9776917 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-97769172022-12-23 Inter- and Intra-Observer Agreement When Using a Diagnostic Labeling Scheme for Annotating Findings on Chest X-rays—An Early Step in the Development of a Deep Learning-Based Decision Support System Li, Dana Pehrson, Lea Marie Tøttrup, Lea Fraccaro, Marco Bonnevie, Rasmus Thrane, Jakob Sørensen, Peter Jagd Rykkje, Alexander Andersen, Tobias Thostrup Steglich-Arnholm, Henrik Stærk, Dorte Marianne Rohde Borgwardt, Lotte Hansen, Kristoffer Lindskov Darkner, Sune Carlsen, Jonathan Frederik Nielsen, Michael Bachmann Diagnostics (Basel) Article Consistent annotation of data is a prerequisite for the successful training and testing of artificial intelligence-based decision support systems in radiology. This can be obtained by standardizing terminology when annotating diagnostic images. The purpose of this study was to evaluate the annotation consistency among radiologists when using a novel diagnostic labeling scheme for chest X-rays. Six radiologists with experience ranging from one to sixteen years, annotated a set of 100 fully anonymized chest X-rays. The blinded radiologists annotated on two separate occasions. Statistical analyses were done using Randolph’s kappa and PABAK, and the proportions of specific agreements were calculated. Fair-to-excellent agreement was found for all labels among the annotators (Randolph’s Kappa, 0.40–0.99). The PABAK ranged from 0.12 to 1 for the two-reader inter-rater agreement and 0.26 to 1 for the intra-rater agreement. Descriptive and broad labels achieved the highest proportion of positive agreement in both the inter- and intra-reader analyses. Annotating findings with specific, interpretive labels were found to be difficult for less experienced radiologists. Annotating images with descriptive labels may increase agreement between radiologists with different experience levels compared to annotation with interpretive labels. MDPI 2022-12-09 /pmc/articles/PMC9776917/ /pubmed/36553118 http://dx.doi.org/10.3390/diagnostics12123112 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Li, Dana Pehrson, Lea Marie Tøttrup, Lea Fraccaro, Marco Bonnevie, Rasmus Thrane, Jakob Sørensen, Peter Jagd Rykkje, Alexander Andersen, Tobias Thostrup Steglich-Arnholm, Henrik Stærk, Dorte Marianne Rohde Borgwardt, Lotte Hansen, Kristoffer Lindskov Darkner, Sune Carlsen, Jonathan Frederik Nielsen, Michael Bachmann Inter- and Intra-Observer Agreement When Using a Diagnostic Labeling Scheme for Annotating Findings on Chest X-rays—An Early Step in the Development of a Deep Learning-Based Decision Support System |
title | Inter- and Intra-Observer Agreement When Using a Diagnostic Labeling Scheme for Annotating Findings on Chest X-rays—An Early Step in the Development of a Deep Learning-Based Decision Support System |
title_full | Inter- and Intra-Observer Agreement When Using a Diagnostic Labeling Scheme for Annotating Findings on Chest X-rays—An Early Step in the Development of a Deep Learning-Based Decision Support System |
title_fullStr | Inter- and Intra-Observer Agreement When Using a Diagnostic Labeling Scheme for Annotating Findings on Chest X-rays—An Early Step in the Development of a Deep Learning-Based Decision Support System |
title_full_unstemmed | Inter- and Intra-Observer Agreement When Using a Diagnostic Labeling Scheme for Annotating Findings on Chest X-rays—An Early Step in the Development of a Deep Learning-Based Decision Support System |
title_short | Inter- and Intra-Observer Agreement When Using a Diagnostic Labeling Scheme for Annotating Findings on Chest X-rays—An Early Step in the Development of a Deep Learning-Based Decision Support System |
title_sort | inter- and intra-observer agreement when using a diagnostic labeling scheme for annotating findings on chest x-rays—an early step in the development of a deep learning-based decision support system |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9776917/ https://www.ncbi.nlm.nih.gov/pubmed/36553118 http://dx.doi.org/10.3390/diagnostics12123112 |
work_keys_str_mv | AT lidana interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem AT pehrsonleamarie interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem AT tøttruplea interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem AT fraccaromarco interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem AT bonnevierasmus interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem AT thranejakob interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem AT sørensenpeterjagd interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem AT rykkjealexander interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem AT andersentobiasthostrup interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem AT steglicharnholmhenrik interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem AT stærkdortemariannerohde interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem AT borgwardtlotte interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem AT hansenkristofferlindskov interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem AT darknersune interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem AT carlsenjonathanfrederik interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem AT nielsenmichaelbachmann interandintraobserveragreementwhenusingadiagnosticlabelingschemeforannotatingfindingsonchestxraysanearlystepinthedevelopmentofadeeplearningbaseddecisionsupportsystem |