Implicit Chain of Thought Reasoning via Knowledge Distillation Paper • 2311.01460 • Published Nov 2, 2023 • 2
Differentiable Tree Operations Promote Compositional Generalization Paper • 2306.00751 • Published Jun 1, 2023
Running 927 927 FineWeb: decanting the web for the finest text data at scale 🍷 Generate high-quality web text data for LLM training
Running on CPU Upgrade 13k 13k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots