Text-to-Image

Rectified-flow diffusion in CLIP/VAE latent space.

1 200
0 10
1 200