Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper • 2503.09573 • Published Mar 12 • 72
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective Paper • 2505.15045 • Published 12 days ago • 52
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning Paper • 2505.16933 • Published 10 days ago • 29
LaViDa: A Large Diffusion Language Model for Multimodal Understanding Paper • 2505.16839 • Published 10 days ago • 12