Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated 18 days ago • 107
Running 2.26k 2.26k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model Paper • 2502.10248 • Published 30 days ago • 51
Congliu/Chinese-DeepSeek-R1-Distill-data-110k Viewer • Updated 24 days ago • 110k • 8.31k • 533