Cargando…
2D medical image synthesis using transformer-based denoising diffusion probabilistic model
Objective. Artificial intelligence (AI) methods have gained popularity in medical imaging research. The size and scope of the training image datasets needed for successful AI model deployment does not always have the desired scale. In this paper, we introduce a medical image synthesis framework aime...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
IOP Publishing
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10160739/ https://www.ncbi.nlm.nih.gov/pubmed/37015231 http://dx.doi.org/10.1088/1361-6560/acca5c |
_version_ | 1785037347613573120 |
---|---|
author | Pan, Shaoyan Wang, Tonghe Qiu, Richard L J Axente, Marian Chang, Chih-Wei Peng, Junbo Patel, Ashish B Shelton, Joseph Patel, Sagar A Roper, Justin Yang, Xiaofeng |
author_facet | Pan, Shaoyan Wang, Tonghe Qiu, Richard L J Axente, Marian Chang, Chih-Wei Peng, Junbo Patel, Ashish B Shelton, Joseph Patel, Sagar A Roper, Justin Yang, Xiaofeng |
author_sort | Pan, Shaoyan |
collection | PubMed |
description | Objective. Artificial intelligence (AI) methods have gained popularity in medical imaging research. The size and scope of the training image datasets needed for successful AI model deployment does not always have the desired scale. In this paper, we introduce a medical image synthesis framework aimed at addressing the challenge of limited training datasets for AI models. Approach. The proposed 2D image synthesis framework is based on a diffusion model using a Swin-transformer-based network. This model consists of a forward Gaussian noise process and a reverse process using the transformer-based diffusion model for denoising. Training data includes four image datasets: chest x-rays, heart MRI, pelvic CT, and abdomen CT. We evaluated the authenticity, quality, and diversity of the synthetic images using visual Turing assessments conducted by three medical physicists, and four quantitative evaluations: the Inception score (IS), Fréchet Inception Distance score (FID), feature similarity and diversity score (DS, indicating diversity similarity) between the synthetic and true images. To leverage the framework value for training AI models, we conducted COVID-19 classification tasks using real images, synthetic images, and mixtures of both images. Main results. Visual Turing assessments showed an average accuracy of 0.64 (accuracy converging to [Formula: see text] indicates a better realistic visual appearance of the synthetic images), sensitivity of 0.79, and specificity of 0.50. Average quantitative accuracy obtained from all datasets were IS = 2.28, FID = 37.27, FDS = 0.20, and DS = 0.86. For the COVID-19 classification task, the baseline network obtained an accuracy of 0.88 using a pure real dataset, 0.89 using a pure synthetic dataset, and 0.93 using a dataset mixed of real and synthetic data. Significance. A image synthesis framework was demonstrated for medical image synthesis, which can generate high-quality medical images of different imaging modalities with the purpose of supplementing existing training sets for AI model deployment. This method has potential applications in many data-driven medical imaging research. |
format | Online Article Text |
id | pubmed-10160739 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | IOP Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-101607392023-05-06 2D medical image synthesis using transformer-based denoising diffusion probabilistic model Pan, Shaoyan Wang, Tonghe Qiu, Richard L J Axente, Marian Chang, Chih-Wei Peng, Junbo Patel, Ashish B Shelton, Joseph Patel, Sagar A Roper, Justin Yang, Xiaofeng Phys Med Biol Paper Objective. Artificial intelligence (AI) methods have gained popularity in medical imaging research. The size and scope of the training image datasets needed for successful AI model deployment does not always have the desired scale. In this paper, we introduce a medical image synthesis framework aimed at addressing the challenge of limited training datasets for AI models. Approach. The proposed 2D image synthesis framework is based on a diffusion model using a Swin-transformer-based network. This model consists of a forward Gaussian noise process and a reverse process using the transformer-based diffusion model for denoising. Training data includes four image datasets: chest x-rays, heart MRI, pelvic CT, and abdomen CT. We evaluated the authenticity, quality, and diversity of the synthetic images using visual Turing assessments conducted by three medical physicists, and four quantitative evaluations: the Inception score (IS), Fréchet Inception Distance score (FID), feature similarity and diversity score (DS, indicating diversity similarity) between the synthetic and true images. To leverage the framework value for training AI models, we conducted COVID-19 classification tasks using real images, synthetic images, and mixtures of both images. Main results. Visual Turing assessments showed an average accuracy of 0.64 (accuracy converging to [Formula: see text] indicates a better realistic visual appearance of the synthetic images), sensitivity of 0.79, and specificity of 0.50. Average quantitative accuracy obtained from all datasets were IS = 2.28, FID = 37.27, FDS = 0.20, and DS = 0.86. For the COVID-19 classification task, the baseline network obtained an accuracy of 0.88 using a pure real dataset, 0.89 using a pure synthetic dataset, and 0.93 using a dataset mixed of real and synthetic data. Significance. A image synthesis framework was demonstrated for medical image synthesis, which can generate high-quality medical images of different imaging modalities with the purpose of supplementing existing training sets for AI model deployment. This method has potential applications in many data-driven medical imaging research. IOP Publishing 2023-05-21 2023-05-05 /pmc/articles/PMC10160739/ /pubmed/37015231 http://dx.doi.org/10.1088/1361-6560/acca5c Text en © 2023 The Author(s). Published on behalf of Institute of Physics and Engineering in Medicine by IOP Publishing Ltd https://creativecommons.org/licenses/by/4.0/Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence (https://creativecommons.org/licenses/by/4.0/) . Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI. |
spellingShingle | Paper Pan, Shaoyan Wang, Tonghe Qiu, Richard L J Axente, Marian Chang, Chih-Wei Peng, Junbo Patel, Ashish B Shelton, Joseph Patel, Sagar A Roper, Justin Yang, Xiaofeng 2D medical image synthesis using transformer-based denoising diffusion probabilistic model |
title | 2D medical image synthesis using transformer-based denoising diffusion probabilistic model |
title_full | 2D medical image synthesis using transformer-based denoising diffusion probabilistic model |
title_fullStr | 2D medical image synthesis using transformer-based denoising diffusion probabilistic model |
title_full_unstemmed | 2D medical image synthesis using transformer-based denoising diffusion probabilistic model |
title_short | 2D medical image synthesis using transformer-based denoising diffusion probabilistic model |
title_sort | 2d medical image synthesis using transformer-based denoising diffusion probabilistic model |
topic | Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10160739/ https://www.ncbi.nlm.nih.gov/pubmed/37015231 http://dx.doi.org/10.1088/1361-6560/acca5c |
work_keys_str_mv | AT panshaoyan 2dmedicalimagesynthesisusingtransformerbaseddenoisingdiffusionprobabilisticmodel AT wangtonghe 2dmedicalimagesynthesisusingtransformerbaseddenoisingdiffusionprobabilisticmodel AT qiurichardlj 2dmedicalimagesynthesisusingtransformerbaseddenoisingdiffusionprobabilisticmodel AT axentemarian 2dmedicalimagesynthesisusingtransformerbaseddenoisingdiffusionprobabilisticmodel AT changchihwei 2dmedicalimagesynthesisusingtransformerbaseddenoisingdiffusionprobabilisticmodel AT pengjunbo 2dmedicalimagesynthesisusingtransformerbaseddenoisingdiffusionprobabilisticmodel AT patelashishb 2dmedicalimagesynthesisusingtransformerbaseddenoisingdiffusionprobabilisticmodel AT sheltonjoseph 2dmedicalimagesynthesisusingtransformerbaseddenoisingdiffusionprobabilisticmodel AT patelsagara 2dmedicalimagesynthesisusingtransformerbaseddenoisingdiffusionprobabilisticmodel AT roperjustin 2dmedicalimagesynthesisusingtransformerbaseddenoisingdiffusionprobabilisticmodel AT yangxiaofeng 2dmedicalimagesynthesisusingtransformerbaseddenoisingdiffusionprobabilisticmodel |