Cargando…
Finetuning of GLIDE stable diffusion model for AI-based text-conditional image synthesis of dermoscopic images
BACKGROUND: The development of artificial intelligence (AI)-based algorithms and advances in medical domains rely on large datasets. A recent advancement in text-to-image generative AI is GLIDE (Guided Language to Image Diffusion for Generation and Editing). There are a number of representations ava...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10623307/ https://www.ncbi.nlm.nih.gov/pubmed/37928464 http://dx.doi.org/10.3389/fmed.2023.1231436 |
_version_ | 1785130714000261120 |
---|---|
author | Shavlokhova, Veronika Vollmer, Andreas Zouboulis, Christos C. Vollmer, Michael Wollborn, Jakob Lang, Gernot Kübler, Alexander Hartmann, Stefan Stoll, Christian Roider, Elisabeth Saravi, Babak |
author_facet | Shavlokhova, Veronika Vollmer, Andreas Zouboulis, Christos C. Vollmer, Michael Wollborn, Jakob Lang, Gernot Kübler, Alexander Hartmann, Stefan Stoll, Christian Roider, Elisabeth Saravi, Babak |
author_sort | Shavlokhova, Veronika |
collection | PubMed |
description | BACKGROUND: The development of artificial intelligence (AI)-based algorithms and advances in medical domains rely on large datasets. A recent advancement in text-to-image generative AI is GLIDE (Guided Language to Image Diffusion for Generation and Editing). There are a number of representations available in the GLIDE model, but it has not been refined for medical applications. METHODS: For text-conditional image synthesis with classifier-free guidance, we have fine-tuned GLIDE using 10,015 dermoscopic images of seven diagnostic entities, including melanoma and melanocytic nevi. Photorealistic synthetic samples of each diagnostic entity were created by the algorithm. Following this, an experienced dermatologist reviewed 140 images (20 of each entity), with 10 samples originating from artificial intelligence and 10 from original images from the dataset. The dermatologist classified the provided images according to the seven diagnostic entities. Additionally, the dermatologist was asked to indicate whether or not a particular image was created by AI. Further, we trained a deep learning model to compare the diagnostic results of dermatologist versus machine for entity classification. RESULTS: The results indicate that the generated images possess varying degrees of quality and realism, with melanocytic nevi and melanoma having higher similarity to real images than other classes. The integration of synthetic images improved the classification performance of the model, resulting in higher accuracy and precision. The AI assessment showed superior classification performance compared to dermatologist. CONCLUSION: Overall, the results highlight the potential of synthetic images for training and improving AI models in dermatology to overcome data scarcity. |
format | Online Article Text |
id | pubmed-10623307 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-106233072023-11-04 Finetuning of GLIDE stable diffusion model for AI-based text-conditional image synthesis of dermoscopic images Shavlokhova, Veronika Vollmer, Andreas Zouboulis, Christos C. Vollmer, Michael Wollborn, Jakob Lang, Gernot Kübler, Alexander Hartmann, Stefan Stoll, Christian Roider, Elisabeth Saravi, Babak Front Med (Lausanne) Medicine BACKGROUND: The development of artificial intelligence (AI)-based algorithms and advances in medical domains rely on large datasets. A recent advancement in text-to-image generative AI is GLIDE (Guided Language to Image Diffusion for Generation and Editing). There are a number of representations available in the GLIDE model, but it has not been refined for medical applications. METHODS: For text-conditional image synthesis with classifier-free guidance, we have fine-tuned GLIDE using 10,015 dermoscopic images of seven diagnostic entities, including melanoma and melanocytic nevi. Photorealistic synthetic samples of each diagnostic entity were created by the algorithm. Following this, an experienced dermatologist reviewed 140 images (20 of each entity), with 10 samples originating from artificial intelligence and 10 from original images from the dataset. The dermatologist classified the provided images according to the seven diagnostic entities. Additionally, the dermatologist was asked to indicate whether or not a particular image was created by AI. Further, we trained a deep learning model to compare the diagnostic results of dermatologist versus machine for entity classification. RESULTS: The results indicate that the generated images possess varying degrees of quality and realism, with melanocytic nevi and melanoma having higher similarity to real images than other classes. The integration of synthetic images improved the classification performance of the model, resulting in higher accuracy and precision. The AI assessment showed superior classification performance compared to dermatologist. CONCLUSION: Overall, the results highlight the potential of synthetic images for training and improving AI models in dermatology to overcome data scarcity. Frontiers Media S.A. 2023-10-20 /pmc/articles/PMC10623307/ /pubmed/37928464 http://dx.doi.org/10.3389/fmed.2023.1231436 Text en Copyright © 2023 Shavlokhova, Vollmer, Zouboulis, Vollmer, Wollborn, Lang, Kübler, Hartmann, Stoll, Roider and Saravi. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Medicine Shavlokhova, Veronika Vollmer, Andreas Zouboulis, Christos C. Vollmer, Michael Wollborn, Jakob Lang, Gernot Kübler, Alexander Hartmann, Stefan Stoll, Christian Roider, Elisabeth Saravi, Babak Finetuning of GLIDE stable diffusion model for AI-based text-conditional image synthesis of dermoscopic images |
title | Finetuning of GLIDE stable diffusion model for AI-based text-conditional image synthesis of dermoscopic images |
title_full | Finetuning of GLIDE stable diffusion model for AI-based text-conditional image synthesis of dermoscopic images |
title_fullStr | Finetuning of GLIDE stable diffusion model for AI-based text-conditional image synthesis of dermoscopic images |
title_full_unstemmed | Finetuning of GLIDE stable diffusion model for AI-based text-conditional image synthesis of dermoscopic images |
title_short | Finetuning of GLIDE stable diffusion model for AI-based text-conditional image synthesis of dermoscopic images |
title_sort | finetuning of glide stable diffusion model for ai-based text-conditional image synthesis of dermoscopic images |
topic | Medicine |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10623307/ https://www.ncbi.nlm.nih.gov/pubmed/37928464 http://dx.doi.org/10.3389/fmed.2023.1231436 |
work_keys_str_mv | AT shavlokhovaveronika finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages AT vollmerandreas finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages AT zouboulischristosc finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages AT vollmermichael finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages AT wollbornjakob finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages AT langgernot finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages AT kubleralexander finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages AT hartmannstefan finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages AT stollchristian finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages AT roiderelisabeth finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages AT saravibabak finetuningofglidestablediffusionmodelforaibasedtextconditionalimagesynthesisofdermoscopicimages |