Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated 19 days ago • 107
Running 2.27k 2.27k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model Paper • 2502.10248 • Published about 1 month ago • 51
Congliu/Chinese-DeepSeek-R1-Distill-data-110k Viewer • Updated 24 days ago • 110k • 8.31k • 536