Scaling Literature - a ydnysh Collection

ydnysh 's Collections

Scaling Literature

The Deepseek AI Collection

Benchmarks and Evals

Scaling Literature

updated Apr 4

Collection of Scaling Law Papers

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 63
Training Compute-Optimal Large Language Models

Paper • 2203.15556 • Published Mar 29, 2022 • 10
Scaling Laws for Precision

Paper • 2411.04330 • Published Nov 7, 2024 • 8
Transcending Scaling Laws with 0.1% Extra Compute

Paper • 2210.11399 • Published Oct 20, 2022
Scaling Vision Transformers

Paper • 2106.04560 • Published Jun 8, 2021
Emergent Abilities of Large Language Models

Paper • 2206.07682 • Published Jun 15, 2022 • 3
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 152
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws

Paper • 2401.00448 • Published Dec 31, 2023 • 31