Lewis Tunstall PRO
lewtun
AI & ML interests
LLMs, LLMs, LLMs
Recent Activity
updated
a Space
about 8 hours ago
open-r1/open-r1-eval-leaderboard
updated
a Space
about 9 hours ago
open-r1/open-r1-eval-leaderboard
updated
a Space
about 15 hours ago
open-r1/open-r1-eval-leaderboard
Organizations
lewtun's activity
Why is there a discrepancy between the 'Solutions' subset and the 'Solutions_py' subset?
1
#2 opened 3 days ago
by
waple

Trouble loading the dataset
2
#2 opened 3 days ago
by
lewtun

Update README.md
1
#1 opened 4 days ago
by
lhoestq

Size of the weights > 140 GB for a 32 GB model?
1
#2 opened 4 days ago
by
stelterlab

Remove fp32 weights
#4 opened 4 days ago
by
lewtun

Remove fp32 weights
#3 opened 4 days ago
by
lewtun

[Paper review] Small Models Struggle to Learn from Strong Reasoners
#19 opened 21 days ago
by
lewtun

⚠️ Chat template foot gun with DeepSeek distilled models and RL format reward function
6
#17 opened 30 days ago
by
lewtun

the finetune config of open-r1?
2
#6 opened 29 days ago
by
MilyFang
Update README.md
3
#1 opened about 1 month ago
by
davidberenstein1957

[Experiment] Applying GRPO to DeepSeek-R1-Distill-Qwen-1.5B with LIMO
21
#15 opened about 1 month ago
by
lewtun

System Prompt
3
#3 opened about 2 months ago
by
Wanfq

Is there a way to print this article?
2
#9 opened 3 months ago
by
iamgianluca
Questions about Verifier Development, Search as Data Generation Tool, and Model Family Alignment
4
#8 opened 3 months ago
by
bird-of-paradise
Link to "canonical form" does not work
1
#4 opened 3 months ago
by
blackhc
code pointers?
2
#7 opened 3 months ago
by
rganti
Link to the community tab in the blogpost
1
#3 opened 3 months ago
by
clem

Use HF papers
1
#2 opened 3 months ago
by
qgallouedec

Update app/src/index.html
1
#1 opened 3 months ago
by
qgallouedec
