Magpie-Align/Magpie-Qwen2.5-Pro-300K-Filtered Viewer • Updated Oct 20, 2024 • 300k • 207 • 12
nvidia/Llama-Nemotron-Post-Training-Dataset Viewer • Updated 2 days ago • 3.91M • 11.4k • 475
Running 9 9 Financial LLM Performance Leaderboard 📈 Expect the Unexpected: FailSafe Long Context QA for Finance
Running 2.57k 2.57k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models Paper • 2501.13629 • Published Jan 23 • 48