Submitted by TheCockatoo t3_10m1sdm in MachineLearning
HateRedditCantQuitit t1_j60qzvg wrote
Reply to comment by dojoteef in [D] Why are GANs worse than (Latent) Diffusion Models for text2img generation? by TheCockatoo
I always see diffusion/score models contrasted against VAEs, but is there really a good distinction? Especially given latent diffusion and IAFs and all the other blurry lines. I feel like any time you're doing forward training & backwards inference trained with an ELBO objective, it should count as a VAE.
Zealousideal_Low1287 t1_j6191sq wrote
I guess for it to really count as a variational autoencoder you need to be reconstructing the input
HateRedditCantQuitit t1_j621uj8 wrote
Isn't reconstructing the input exactly what the denoising objective does?
Viewing a single comment thread. View all comments