Scaling Diffusion Transformers Efficiently via μP
Paper • 2505.15270 • Published • 35
We release pretrained models in the paper Scaling Diffusion Transformers Efficiently via μP, which includes DiT-muP and PixArt-muP.
Code: https://github.com/ML-GSAI/Scaling-Diffusion-Transformers-muP