CRIS Current Research Information System

Denoising Diffusion models are gaining increasing popularity in the field of generative modeling for several reasons, including the simple and stable training, the excellent generative quality, and the solid probabilistic foundation. In this article, we address the problem of embedding an image into the latent space of Denoising Diffusion Models, that is finding a suitable “noisy” image whose denoising results in the original image. We particularly focus on Denoising Diffusion Implicit Models due to the deterministic nature of their reverse diffusion process. As a side result of our investigation, we gain a deeper insight into the structure of the latent space of diffusion models, opening interesting perspectives on its exploration, the definition of semantic trajectories, and the manipulation/conditioning of encodings for editing purposes. A particularly interesting property highlighted by our research, which is also characteristic of this class of generative models, is the independence of the latent representation from the networks implementing the reverse diffusion process. In other words, a common seed passed to different networks (each trained on the same dataset), eventually results in identical images.

Asperti, A., Evangelista, D., Marro, S., Merizzi, F. (2023). Image embedding for denoising generative models. ARTIFICIAL INTELLIGENCE REVIEW, 56, 14511-14533 [10.1007/s10462-023-10504-5].

Image embedding for denoising generative models

Asperti, Andrea;Evangelista, Davide;Marro, Samuele;Merizzi, Fabio

2023

Abstract

Denoising Diffusion models are gaining increasing popularity in the field of generative modeling for several reasons, including the simple and stable training, the excellent generative quality, and the solid probabilistic foundation. In this article, we address the problem of embedding an image into the latent space of Denoising Diffusion Models, that is finding a suitable “noisy” image whose denoising results in the original image. We particularly focus on Denoising Diffusion Implicit Models due to the deterministic nature of their reverse diffusion process. As a side result of our investigation, we gain a deeper insight into the structure of the latent space of diffusion models, opening interesting perspectives on its exploration, the definition of semantic trajectories, and the manipulation/conditioning of encodings for editing purposes. A particularly interesting property highlighted by our research, which is also characteristic of this class of generative models, is the independence of the latent representation from the networks implementing the reverse diffusion process. In other words, a common seed passed to different networks (each trained on the same dataset), eventually results in identical images.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2023
			
	Rivista
	
				ARTIFICIAL INTELLIGENCE REVIEW
			
	Codice DOI
	
				https://dx.doi.org/10.1007/s10462-023-10504-5
			
	Citazione
	
				Asperti, A., Evangelista, D., Marro, S., Merizzi, F. (2023). Image embedding for denoising generative models. ARTIFICIAL INTELLIGENCE REVIEW, 56, 14511-14533 [10.1007/s10462-023-10504-5].
			
	Tutti gli autori
	
						Asperti, Andrea; Evangelista, Davide; Marro, Samuele; Merizzi, Fabio
					
	Appare nelle tipologie:
	
				1.01 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
s10462-023-10504-5.pdf accesso aperto Tipo: Versione (PDF) editoriale Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY) Dimensione 3.39 MB Formato Adobe PDF Visualizza/Apri	3.39 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/928216

Citazioni

ND

3

2

social impact