FlowTok: Flowing Seamlessly Across Text and Image Tokens Paper • 2503.10772 • Published 9 days ago • 16
Learning Few-Step Diffusion Models by Trajectory Distribution Matching Paper • 2503.06674 • Published 13 days ago • 6
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper • 2503.09573 • Published 10 days ago • 58
Running 2.33k 2.33k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think Paper • 2503.00948 • Published 20 days ago • 3
Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think Paper • 2503.00948 • Published 20 days ago • 3
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment Paper • 2502.16894 • Published 26 days ago • 28
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment Paper • 2502.16894 • Published 26 days ago • 28
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model Paper • 2411.19108 • Published Nov 28, 2024 • 19