Open-RS - a knoveleng Collection

knoveleng 's Collections

updated 1 day ago

Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t"