You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
kind of, you can backprop the gradients through the vae but it uses a lot of vram and doesn't work that well in my experience. Ideally there should be a latent CLIP trained on the LDM embeddings instead of pixels.
Wondering if this code base means we can use CLIP guidance for generation instead of the classifier free guidance in the regular model?
The text was updated successfully, but these errors were encountered: