Lewis Tunstall's picture

Lewis Tunstall PRO

lewtun

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

updated a Space about 1 hour ago

open-r1/open-r1-eval-leaderboard

updated a Space about 2 hours ago

open-r1/open-r1-eval-leaderboard

updated a Space about 2 hours ago

open-r1/open-r1-eval-leaderboard

View all activity

Organizations

lewtun's activity

New activity in open-r1/README 15 days ago

[Experiment] Training R1-Zero-like models with Open R1

#20 opened about 1 month ago by

New activity in open-r1/OlympicCoder-32B about 1 month ago

about <think> and </think>

#9 opened about 1 month ago by

Please add HF Inference Endpoint and library tags which allow easier deployment

#8 opened about 1 month ago by

Mode changed to Model

#7 opened about 1 month ago by

Update README.md

#6 opened about 1 month ago by

New activity in open-r1/OlympicCoder-7B about 1 month ago

Omitted <think> at the start and almost 10k tokens to debug 2 JS functions

#2 opened about 2 months ago by

operationdarkside

It seems to overthink

#3 opened about 1 month ago by

New activity in open-r1/codeforces-cots about 1 month ago

Upload dataset

#4 opened about 1 month ago by

missing </think> in all subset

#3 opened about 1 month ago by

New activity in open-r1/codeforces-cots about 2 months ago

Why is there a discrepancy between the 'Solutions' subset and the 'Solutions_py' subset?

#2 opened about 2 months ago by

Update README.md

#1 opened about 2 months ago by

New activity in open-r1/OlympicCoder-32B about 2 months ago

Size of the weights > 140 GB for a 32 GB model?

#2 opened about 2 months ago by

Remove fp32 weights

#4 opened about 2 months ago by

Remove fp32 weights

#3 opened about 2 months ago by

New activity in open-r1/README 2 months ago

⚠️ Chat template foot gun with DeepSeek distilled models and RL format reward function

#17 opened 2 months ago by

New activity in open-r1/OpenR1-Math-220k 2 months ago

the finetune config of open-r1?

#6 opened 2 months ago by

New activity in open-r1/OpenR1-Math-220k 3 months ago

Update README.md

#1 opened 3 months ago by

davidberenstein1957

New activity in open-r1/README 3 months ago

[Experiment] Applying GRPO to DeepSeek-R1-Distill-Qwen-1.5B with LIMO

#15 opened 3 months ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B 3 months ago

System Prompt

#3 opened 3 months ago by

New activity in HuggingFaceH4/blogpost-scaling-test-time-compute 3 months ago

Is there a way to print this article?

#9 opened 4 months ago by