ccrdallas t1_iuimqp9 wrote on October 31, 2022 at 4:46 PM

Reply to [D] Diffusion vs MCMC as sampling algorithms by ZeronixSama

Diffusion models essentially use a stochastic differential equation to produce samples. An MCMC method that produces samples using such equations, often known as Langevin diffusions, are called the Unadjusted/Metropolis-Adjusted Langevin algorithm(s).

Typically we don’t use the metropolis adjustment in practice and so these methods aren’t truly Metropolis-Hastings algorithms, they are just discretized SDEs. The purpose of the adjustment is to reduce bias and ensure convergence to a particular distribution but these are lesser concerns in diffusion models, particularly with small enough step size.

ccrdallas t1_ir5d2pn wrote on October 5, 2022 at 1:40 PM

Reply to [D] How to efficiently sample audio clips from a large audio file for classifying the large audio file? by time_waster103

If you have inexpensive features then you could try and build a Determinantal Point Process, a model for sampling diverse subsets (diversity defined via the features and an appropriate kernel). The downside is that this method, at worst, typically scales poorly (O(N^3 )) with the number of samples although there is recent work in this field to speed up inference. I can’t say a priori if it is fast enough for your task.