Cargando…

Finetuning of GLIDE stable diffusion model for AI-based text-conditional image synthesis of dermoscopic images

BACKGROUND: The development of artificial intelligence (AI)-based algorithms and advances in medical domains rely on large datasets. A recent advancement in text-to-image generative AI is GLIDE (Guided Language to Image Diffusion for Generation and Editing). There are a number of representations ava...

Descripción completa

Detalles Bibliográficos
Autores principales: Shavlokhova, Veronika, Vollmer, Andreas, Zouboulis, Christos C., Vollmer, Michael, Wollborn, Jakob, Lang, Gernot, Kübler, Alexander, Hartmann, Stefan, Stoll, Christian, Roider, Elisabeth, Saravi, Babak
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10623307/
https://www.ncbi.nlm.nih.gov/pubmed/37928464
http://dx.doi.org/10.3389/fmed.2023.1231436
_version_ 1785130714000261120
author Shavlokhova, Veronika
Vollmer, Andreas
Zouboulis, Christos C.
Vollmer, Michael
Wollborn, Jakob
Lang, Gernot
Kübler, Alexander
Hartmann, Stefan
Stoll, Christian
Roider, Elisabeth
Saravi, Babak
author_facet Shavlokhova, Veronika
Vollmer, Andreas
Zouboulis, Christos C.
Vollmer, Michael
Wollborn, Jakob
Lang, Gernot
Kübler, Alexander
Hartmann, Stefan
Stoll, Christian
Roider, Elisabeth
Saravi, Babak
author_sort Shavlokhova, Veronika
collection PubMed
description BACKGROUND: The development of artificial intelligence (AI)-based algorithms and advances in medical domains rely on large datasets. A recent advancement in text-to-image generative AI is GLIDE (Guided Language to Image Diffusion for Generation and Editing). There are a number of representations available in the GLIDE model, but it has not been refined for medical applications. METHODS: For text-conditional image synthesis with classifier-free guidance, we have fine-tuned GLIDE using 10,015 dermoscopic images of seven diagnostic entities, including melanoma and melanocytic nevi. Photorealistic synthetic samples of each diagnostic entity were created by the algorithm. Following this, an experienced dermatologist reviewed 140 images (20 of each entity), with 10 samples originating from artificial intelligence and 10 from original images from the dataset. The dermatologist classified the provided images according to the seven diagnostic entities. Additionally, the dermatologist was asked to indicate whether or not a particular image was created by AI. Further, we trained a deep learning model to compare the diagnostic results of dermatologist versus machine for entity classification. RESULTS: The results indicate that the generated images possess varying degrees of quality and realism, with melanocytic nevi and melanoma having higher similarity to real images than other classes. The integration of synthetic images improved the classification performance of the model, resulting in higher accuracy and precision. The AI assessment showed superior classification performance compared to dermatologist. CONCLUSION: Overall, the results highlight the potential of synthetic images for training and improving AI models in dermatology to overcome data scarcity.
format Online
Article
Text
id pubmed-10623307
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-106233072023-11-04 Finetuning of GLIDE stable diffusion model for AI-based text-conditional image synthesis of dermoscopic images Shavlokhova, Veronika Vollmer, Andreas Zouboulis, Christos C. Vollmer, Michael Wollborn, Jakob Lang, Gernot Kübler, Alexander Hartmann, Stefan Stoll, Christian Roider, Elisabeth Saravi, Babak Front Med (Lausanne) Medicine BACKGROUND: The development of artificial intelligence (AI)-based algorithms and advances in medical domains rely on large datasets. A recent advancement in text-to-image generative AI is GLIDE (Guided Language to Image Diffusion for Generation and Editing). There are a number of representations available in the GLIDE model, but it has not been refined for medical applications. METHODS: For text-conditional image synthesis with classifier-free guidance, we have fine-tuned GLIDE using 10,015 dermoscopic images of seven diagnostic entities, including melanoma and melanocytic nevi. Photorealistic synthetic samples of each diagnostic entity were created by the algorithm. Following this, an experienced dermatologist reviewed 140 images (20 of each entity), with 10 samples originating from artificial intelligence and 10 from original images from the dataset. The dermatologist classified the provided images according to the seven diagnostic entities. Additionally, the dermatologist was asked to indicate whether or not a particular image was created by AI. Further, we trained a deep learning model to compare the diagnostic results of dermatologist versus machine for entity classification. RESULTS: The results indicate that the generated images possess varying degrees of quality and realism, with melanocytic nevi and melanoma having higher similarity to real images than other classes. The integration of synthetic images improved the classification performance of the model, resulting in higher accuracy and precision. The AI assessment showed superior classification performance compared to dermatologist. CONCLUSION: Overall, the results highlight the potential of synthetic images for training and improving AI models in dermatology to overcome data scarcity. Frontiers Media S.A. 2023-10-20 /pmc/articles/PMC10623307/ /pubmed/37928464 http://dx.doi.org/10.3389/fmed.2023.1231436 Text en Copyright © 2023 Shavlokhova, Vollmer, Zouboulis, Vollmer, Wollborn, Lang, Kübler, Hartmann, Stoll, Roider and Saravi. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Medicine
Shavlokhova, Veronika
Vollmer, Andreas
Zouboulis, Christos C.
Vollmer, Michael
Wollborn, Jakob
Lang, Gernot
Kübler, Alexander
Hartmann, Stefan
Stoll, Christian
Roider, Elisabeth
Saravi, Babak
Finetuning of GLIDE stable diffusion model for AI-based text-conditional image synthesis of dermoscopic images
title Finetuning of GLIDE stable diffusion model for AI-based text-conditional image synthesis of dermoscopic images
title_full Finetuning of GLIDE stable diffusion model for AI-based text-conditional image synthesis of dermoscopic images
title_fullStr Finetuning of GLIDE stable diffusion model for AI-based text-conditional image synthesis of dermoscopic images
title_full_unstemmed Finetuning of GLIDE stable diffusion model for AI-based text-conditional image synthesis of dermoscopic images
title_short Finetuning of GLIDE stable diffusion model for AI-based text-conditional image synthesis of dermoscopic images
title_sort finetuning of glide stable diffusion model for ai-based text-conditional image synthesis of dermoscopic images
topic Medicine
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10623307/
https://www.ncbi.nlm.nih.gov/pubmed/37928464
http://dx.doi.org/10.3389/fmed.2023.1231436
work_keys_str_mv AT shavlokhovaveronika finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages
AT vollmerandreas finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages
AT zouboulischristosc finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages
AT vollmermichael finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages
AT wollbornjakob finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages
AT langgernot finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages
AT kubleralexander finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages
AT hartmannstefan finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages
AT stollchristian finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages
AT roiderelisabeth finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages
AT saravibabak finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages