None defined yet.
š OctoThinker is led by GAIR
šÆ Our Goal: To reshape the pre-training trajectory so models scale better under RL.