Cargando…
Convolutional neural network -based phantom image scoring for mammography quality control
BACKGROUND: Visual evaluation of phantom images is an important, but time-consuming part of mammography quality control (QC). Consistent scoring of phantom images over the device’s lifetime is highly desirable. Recently, convolutional neural networks (CNNs) have been applied to a wide range of image...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9727908/ https://www.ncbi.nlm.nih.gov/pubmed/36476319 http://dx.doi.org/10.1186/s12880-022-00944-w |
_version_ | 1784845129243164672 |
---|---|
author | Sundell, Veli-Matti Mäkelä, Teemu Vitikainen, Anne-Mari Kaasalainen, Touko |
author_facet | Sundell, Veli-Matti Mäkelä, Teemu Vitikainen, Anne-Mari Kaasalainen, Touko |
author_sort | Sundell, Veli-Matti |
collection | PubMed |
description | BACKGROUND: Visual evaluation of phantom images is an important, but time-consuming part of mammography quality control (QC). Consistent scoring of phantom images over the device’s lifetime is highly desirable. Recently, convolutional neural networks (CNNs) have been applied to a wide range of image classification problems, performing with a high accuracy. The purpose of this study was to automate mammography QC phantom scoring task by training CNN models to mimic a human reviewer. METHODS: Eight CNN variations consisting of three to ten convolutional layers were trained for detecting targets (fibres, microcalcifications and masses) in American College of Radiology (ACR) accreditation phantom images and the results were compared with human scoring. Regular and artificially degraded/improved QC phantom images from eight mammography devices were visually evaluated by one reviewer. These images were used in training the CNN models. A separate test set consisted of daily QC images from the eight devices and separately acquired images with varying dose levels. These were scored by four reviewers and considered the ground truth for CNN performance testing. RESULTS: Although hyper-parameter search space was limited, an optimal network depth after which additional layers resulted in decreased accuracy was identified. The highest scoring accuracy (95%) was achieved with the CNN consisting of six convolutional layers. The highest deviation between the CNN and the reviewers was found at lowest dose levels. No significant difference emerged between the visual reviews and CNN results except in case of smallest masses. CONCLUSION: A CNN-based automatic mammography QC phantom scoring system can score phantom images in a good agreement with human reviewers, and can therefore be of benefit in mammography QC. |
format | Online Article Text |
id | pubmed-9727908 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-97279082022-12-08 Convolutional neural network -based phantom image scoring for mammography quality control Sundell, Veli-Matti Mäkelä, Teemu Vitikainen, Anne-Mari Kaasalainen, Touko BMC Med Imaging Research BACKGROUND: Visual evaluation of phantom images is an important, but time-consuming part of mammography quality control (QC). Consistent scoring of phantom images over the device’s lifetime is highly desirable. Recently, convolutional neural networks (CNNs) have been applied to a wide range of image classification problems, performing with a high accuracy. The purpose of this study was to automate mammography QC phantom scoring task by training CNN models to mimic a human reviewer. METHODS: Eight CNN variations consisting of three to ten convolutional layers were trained for detecting targets (fibres, microcalcifications and masses) in American College of Radiology (ACR) accreditation phantom images and the results were compared with human scoring. Regular and artificially degraded/improved QC phantom images from eight mammography devices were visually evaluated by one reviewer. These images were used in training the CNN models. A separate test set consisted of daily QC images from the eight devices and separately acquired images with varying dose levels. These were scored by four reviewers and considered the ground truth for CNN performance testing. RESULTS: Although hyper-parameter search space was limited, an optimal network depth after which additional layers resulted in decreased accuracy was identified. The highest scoring accuracy (95%) was achieved with the CNN consisting of six convolutional layers. The highest deviation between the CNN and the reviewers was found at lowest dose levels. No significant difference emerged between the visual reviews and CNN results except in case of smallest masses. CONCLUSION: A CNN-based automatic mammography QC phantom scoring system can score phantom images in a good agreement with human reviewers, and can therefore be of benefit in mammography QC. BioMed Central 2022-12-07 /pmc/articles/PMC9727908/ /pubmed/36476319 http://dx.doi.org/10.1186/s12880-022-00944-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Sundell, Veli-Matti Mäkelä, Teemu Vitikainen, Anne-Mari Kaasalainen, Touko Convolutional neural network -based phantom image scoring for mammography quality control |
title | Convolutional neural network -based phantom image scoring for mammography quality control |
title_full | Convolutional neural network -based phantom image scoring for mammography quality control |
title_fullStr | Convolutional neural network -based phantom image scoring for mammography quality control |
title_full_unstemmed | Convolutional neural network -based phantom image scoring for mammography quality control |
title_short | Convolutional neural network -based phantom image scoring for mammography quality control |
title_sort | convolutional neural network -based phantom image scoring for mammography quality control |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9727908/ https://www.ncbi.nlm.nih.gov/pubmed/36476319 http://dx.doi.org/10.1186/s12880-022-00944-w |
work_keys_str_mv | AT sundellvelimatti convolutionalneuralnetworkbasedphantomimagescoringformammographyqualitycontrol AT makelateemu convolutionalneuralnetworkbasedphantomimagescoringformammographyqualitycontrol AT vitikainenannemari convolutionalneuralnetworkbasedphantomimagescoringformammographyqualitycontrol AT kaasalainentouko convolutionalneuralnetworkbasedphantomimagescoringformammographyqualitycontrol |