Feng
VandeeeFeng
·
AI & ML interests
None yet
Recent Activity
updated
a collection
3 days ago
apps
updated
a collection
5 days ago
apps
updated
a collection
14 days ago
models
Organizations
None yet
Collections
3
-
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 118 -
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Paper • 2501.17161 • Published • 121 -
2.57k
The Ultra-Scale Playbook
🌌The ultimate guide to training LLM on large GPU Clusters
-
202
LLM训练终极指南 | The Ultra-Scale Playbook
🔥了解LLM训练的方方面面
datasets
0
None public yet