Cargando…

Convolutional neural network -based phantom image scoring for mammography quality control

BACKGROUND: Visual evaluation of phantom images is an important, but time-consuming part of mammography quality control (QC). Consistent scoring of phantom images over the device’s lifetime is highly desirable. Recently, convolutional neural networks (CNNs) have been applied to a wide range of image...

Descripción completa

Detalles Bibliográficos
Autores principales: Sundell, Veli-Matti, Mäkelä, Teemu, Vitikainen, Anne-Mari, Kaasalainen, Touko
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9727908/
https://www.ncbi.nlm.nih.gov/pubmed/36476319
http://dx.doi.org/10.1186/s12880-022-00944-w
_version_ 1784845129243164672
author Sundell, Veli-Matti
Mäkelä, Teemu
Vitikainen, Anne-Mari
Kaasalainen, Touko
author_facet Sundell, Veli-Matti
Mäkelä, Teemu
Vitikainen, Anne-Mari
Kaasalainen, Touko
author_sort Sundell, Veli-Matti
collection PubMed
description BACKGROUND: Visual evaluation of phantom images is an important, but time-consuming part of mammography quality control (QC). Consistent scoring of phantom images over the device’s lifetime is highly desirable. Recently, convolutional neural networks (CNNs) have been applied to a wide range of image classification problems, performing with a high accuracy. The purpose of this study was to automate mammography QC phantom scoring task by training CNN models to mimic a human reviewer. METHODS: Eight CNN variations consisting of three to ten convolutional layers were trained for detecting targets (fibres, microcalcifications and masses) in American College of Radiology (ACR) accreditation phantom images and the results were compared with human scoring. Regular and artificially degraded/improved QC phantom images from eight mammography devices were visually evaluated by one reviewer. These images were used in training the CNN models. A separate test set consisted of daily QC images from the eight devices and separately acquired images with varying dose levels. These were scored by four reviewers and considered the ground truth for CNN performance testing. RESULTS: Although hyper-parameter search space was limited, an optimal network depth after which additional layers resulted in decreased accuracy was identified. The highest scoring accuracy (95%) was achieved with the CNN consisting of six convolutional layers. The highest deviation between the CNN and the reviewers was found at lowest dose levels. No significant difference emerged between the visual reviews and CNN results except in case of smallest masses. CONCLUSION: A CNN-based automatic mammography QC phantom scoring system can score phantom images in a good agreement with human reviewers, and can therefore be of benefit in mammography QC.
format Online
Article
Text
id pubmed-9727908
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-97279082022-12-08 Convolutional neural network -based phantom image scoring for mammography quality control Sundell, Veli-Matti Mäkelä, Teemu Vitikainen, Anne-Mari Kaasalainen, Touko BMC Med Imaging Research BACKGROUND: Visual evaluation of phantom images is an important, but time-consuming part of mammography quality control (QC). Consistent scoring of phantom images over the device’s lifetime is highly desirable. Recently, convolutional neural networks (CNNs) have been applied to a wide range of image classification problems, performing with a high accuracy. The purpose of this study was to automate mammography QC phantom scoring task by training CNN models to mimic a human reviewer. METHODS: Eight CNN variations consisting of three to ten convolutional layers were trained for detecting targets (fibres, microcalcifications and masses) in American College of Radiology (ACR) accreditation phantom images and the results were compared with human scoring. Regular and artificially degraded/improved QC phantom images from eight mammography devices were visually evaluated by one reviewer. These images were used in training the CNN models. A separate test set consisted of daily QC images from the eight devices and separately acquired images with varying dose levels. These were scored by four reviewers and considered the ground truth for CNN performance testing. RESULTS: Although hyper-parameter search space was limited, an optimal network depth after which additional layers resulted in decreased accuracy was identified. The highest scoring accuracy (95%) was achieved with the CNN consisting of six convolutional layers. The highest deviation between the CNN and the reviewers was found at lowest dose levels. No significant difference emerged between the visual reviews and CNN results except in case of smallest masses. CONCLUSION: A CNN-based automatic mammography QC phantom scoring system can score phantom images in a good agreement with human reviewers, and can therefore be of benefit in mammography QC. BioMed Central 2022-12-07 /pmc/articles/PMC9727908/ /pubmed/36476319 http://dx.doi.org/10.1186/s12880-022-00944-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Sundell, Veli-Matti
Mäkelä, Teemu
Vitikainen, Anne-Mari
Kaasalainen, Touko
Convolutional neural network -based phantom image scoring for mammography quality control
title Convolutional neural network -based phantom image scoring for mammography quality control
title_full Convolutional neural network -based phantom image scoring for mammography quality control
title_fullStr Convolutional neural network -based phantom image scoring for mammography quality control
title_full_unstemmed Convolutional neural network -based phantom image scoring for mammography quality control
title_short Convolutional neural network -based phantom image scoring for mammography quality control
title_sort convolutional neural network -based phantom image scoring for mammography quality control
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9727908/
https://www.ncbi.nlm.nih.gov/pubmed/36476319
http://dx.doi.org/10.1186/s12880-022-00944-w
work_keys_str_mv AT sundellvelimatti convolutionalneuralnetworkbasedphantomimagescoringformammographyqualitycontrol
AT makelateemu convolutionalneuralnetworkbasedphantomimagescoringformammographyqualitycontrol
AT vitikainenannemari convolutionalneuralnetworkbasedphantomimagescoringformammographyqualitycontrol
AT kaasalainentouko convolutionalneuralnetworkbasedphantomimagescoringformammographyqualitycontrol