-
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective
Paper • 2505.15045 • Published • 53 -
MMaDA: Multimodal Large Diffusion Language Models
Paper • 2505.15809 • Published • 85 -
R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing
Paper • 2505.21600 • Published • 67 -
Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO
Paper • 2505.22453 • Published • 45

Swee.LOL
sweelol
AI & ML interests
None yet
Recent Activity
updated
a collection
5 days ago
Important
updated
a collection
5 days ago
Important
updated
a collection
5 days ago
Important
Organizations
None yet
Collections
1
spaces
1
models
0
None public yet
datasets
0
None public yet