Cargando…

How Much Is Enough? A Study on Diffusion Times in Score-Based Generative Models

Score-based diffusion models are a class of generative models whose dynamics is described by stochastic differential equations that map noise into data. While recent works have started to lay down a theoretical foundation for these models, a detailed understanding of the role of the diffusion time T...

Descripción completa

Detalles Bibliográficos
Autores principales: Franzese, Giulio, Rossi, Simone, Yang, Lixuan, Finamore, Alessandro, Rossi, Dario, Filippone, Maurizio, Michiardi, Pietro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10138161/
https://www.ncbi.nlm.nih.gov/pubmed/37190421
http://dx.doi.org/10.3390/e25040633
_version_ 1785032641188200448
author Franzese, Giulio
Rossi, Simone
Yang, Lixuan
Finamore, Alessandro
Rossi, Dario
Filippone, Maurizio
Michiardi, Pietro
author_facet Franzese, Giulio
Rossi, Simone
Yang, Lixuan
Finamore, Alessandro
Rossi, Dario
Filippone, Maurizio
Michiardi, Pietro
author_sort Franzese, Giulio
collection PubMed
description Score-based diffusion models are a class of generative models whose dynamics is described by stochastic differential equations that map noise into data. While recent works have started to lay down a theoretical foundation for these models, a detailed understanding of the role of the diffusion time T is still lacking. Current best practice advocates for a large T to ensure that the forward dynamics brings the diffusion sufficiently close to a known and simple noise distribution; however, a smaller value of T should be preferred for a better approximation of the score-matching objective and higher computational efficiency. Starting from a variational interpretation of diffusion models, in this work we quantify this trade-off and suggest a new method to improve quality and efficiency of both training and sampling, by adopting smaller diffusion times. Indeed, we show how an auxiliary model can be used to bridge the gap between the ideal and the simulated forward dynamics, followed by a standard reverse diffusion process. Empirical results support our analysis; for image data, our method is competitive with regard to the state of the art, according to standard sample quality metrics and log-likelihood.
format Online
Article
Text
id pubmed-10138161
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-101381612023-04-28 How Much Is Enough? A Study on Diffusion Times in Score-Based Generative Models Franzese, Giulio Rossi, Simone Yang, Lixuan Finamore, Alessandro Rossi, Dario Filippone, Maurizio Michiardi, Pietro Entropy (Basel) Article Score-based diffusion models are a class of generative models whose dynamics is described by stochastic differential equations that map noise into data. While recent works have started to lay down a theoretical foundation for these models, a detailed understanding of the role of the diffusion time T is still lacking. Current best practice advocates for a large T to ensure that the forward dynamics brings the diffusion sufficiently close to a known and simple noise distribution; however, a smaller value of T should be preferred for a better approximation of the score-matching objective and higher computational efficiency. Starting from a variational interpretation of diffusion models, in this work we quantify this trade-off and suggest a new method to improve quality and efficiency of both training and sampling, by adopting smaller diffusion times. Indeed, we show how an auxiliary model can be used to bridge the gap between the ideal and the simulated forward dynamics, followed by a standard reverse diffusion process. Empirical results support our analysis; for image data, our method is competitive with regard to the state of the art, according to standard sample quality metrics and log-likelihood. MDPI 2023-04-07 /pmc/articles/PMC10138161/ /pubmed/37190421 http://dx.doi.org/10.3390/e25040633 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Franzese, Giulio
Rossi, Simone
Yang, Lixuan
Finamore, Alessandro
Rossi, Dario
Filippone, Maurizio
Michiardi, Pietro
How Much Is Enough? A Study on Diffusion Times in Score-Based Generative Models
title How Much Is Enough? A Study on Diffusion Times in Score-Based Generative Models
title_full How Much Is Enough? A Study on Diffusion Times in Score-Based Generative Models
title_fullStr How Much Is Enough? A Study on Diffusion Times in Score-Based Generative Models
title_full_unstemmed How Much Is Enough? A Study on Diffusion Times in Score-Based Generative Models
title_short How Much Is Enough? A Study on Diffusion Times in Score-Based Generative Models
title_sort how much is enough? a study on diffusion times in score-based generative models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10138161/
https://www.ncbi.nlm.nih.gov/pubmed/37190421
http://dx.doi.org/10.3390/e25040633
work_keys_str_mv AT franzesegiulio howmuchisenoughastudyondiffusiontimesinscorebasedgenerativemodels
AT rossisimone howmuchisenoughastudyondiffusiontimesinscorebasedgenerativemodels
AT yanglixuan howmuchisenoughastudyondiffusiontimesinscorebasedgenerativemodels
AT finamorealessandro howmuchisenoughastudyondiffusiontimesinscorebasedgenerativemodels
AT rossidario howmuchisenoughastudyondiffusiontimesinscorebasedgenerativemodels
AT filipponemaurizio howmuchisenoughastudyondiffusiontimesinscorebasedgenerativemodels
AT michiardipietro howmuchisenoughastudyondiffusiontimesinscorebasedgenerativemodels