Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ydnysh 's Collections
Scaling Literature
The Deepseek AI Collection
Benchmarks and Evals

Scaling Literature

updated Apr 4

Collection of Scaling Law Papers

Upvote
-

  • Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

    Paper • 2408.03314 • Published Aug 6, 2024 • 63

  • Training Compute-Optimal Large Language Models

    Paper • 2203.15556 • Published Mar 29, 2022 • 10

  • Scaling Laws for Precision

    Paper • 2411.04330 • Published Nov 7, 2024 • 8

  • Transcending Scaling Laws with 0.1% Extra Compute

    Paper • 2210.11399 • Published Oct 20, 2022

  • Scaling Vision Transformers

    Paper • 2106.04560 • Published Jun 8, 2021

  • Emergent Abilities of Large Language Models

    Paper • 2206.07682 • Published Jun 15, 2022 • 3

  • Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

    Paper • 2502.06703 • Published Feb 10 • 152

  • Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws

    Paper • 2401.00448 • Published Dec 31, 2023 • 31
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs