koalazf99 commited on
Commit
b1d78ad
·
verified ·
1 Parent(s): 1dff1a7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -7,4 +7,5 @@ sdk: static
7
  pinned: false
8
  ---
9
 
10
- Edit this `README.md` markdown file to author your organization card.
 
 
7
  pinned: false
8
  ---
9
 
10
+ 🐙 OctoThinker, led by [GAIR](https://huggingface.co/GAIR), is an initiative to explore earlier training interventions that make base models more amenable to reinforcement learning (RL) scaling.
11
+ 🎯 Our Goal: To reshape the pre-training trajectory so models scale better under RL.