Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2140.8
TFLOPS
1205
69
65
Quentin Gallouédec
PRO
qgallouedec
Follow
misovalko's profile picture
TanveerSingh182764's profile picture
helper2424's profile picture
250 followers
·
84 following
QGallouedec
qgallouedec
qgallouedec
qgallouedec.bsky.social
AI & ML interests
None yet
Recent Activity
updated
a Space
2 days ago
open-r1/open-r1-eval-leaderboard
updated
a model
2 days ago
qgallouedec/R1-Zero-Qwen-7B-Math
updated
a model
2 days ago
qgallouedec/R1-Zero-Qwen-7B-Math
View all activity
Organizations
Articles
6
Article
34
Gotchas in Tokenizer Behavior Every Developer Should Know
Article
290
Open R1: Update #3
View all Articles
Papers
4
arxiv:
2402.09844
arxiv:
2402.03046
arxiv:
2208.14928
arxiv:
2106.13687
spaces
3
Sort: Recently updated
Runtime error
1
Run Hello World
👀
Runtime error
Run DuckDB Jobs
🦆
Process datasets with DuckDB SQL
Running
13
Train Memory
📈
Generate memory forecast for ML models
models
726
Sort: Recently updated
qgallouedec/R1-Zero-Qwen-7B-Math
Text Generation
•
Updated
2 days ago
•
179
qgallouedec/Qwen-2.5-7B-Simple-RL
Text Generation
•
Updated
26 days ago
•
6
qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
27 days ago
•
2
qgallouedec/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
Mar 26
qgallouedec/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
Mar 24
qgallouedec/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
Mar 15
qgallouedec/gemma-3-12b-it-codeforces-SFT-eager-packing
Image-Text-to-Text
•
Updated
Mar 14
•
2
qgallouedec/gemma-3-12b-it-codeforces-SFT
Image-Text-to-Text
•
Updated
Mar 14
•
36
•
5
qgallouedec/gemma-3-12b-it-codeforces-SFT-eager-no-packing
Image-Text-to-Text
•
Updated
Mar 14
•
3
qgallouedec/gemma-3-4b-it-codeforces-SFT
Image-Text-to-Text
•
Updated
Mar 13
•
35
•
3
Expand 726 models
datasets
68
Sort: Recently updated
qgallouedec/DAPO-Math-17k-Processed-Scored
Viewer
•
Updated
4 days ago
•
16.4k
•
96
•
1
qgallouedec/trl-metrics
Viewer
•
Updated
8 days ago
•
108k
•
699
•
1
qgallouedec/prm800k
Viewer
•
Updated
Dec 17, 2024
•
41.2k
•
32
•
3
qgallouedec/ultrafeedback-prompt
Viewer
•
Updated
Sep 9, 2024
•
60.9k
•
43
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Sep 9, 2024
•
16.6k
•
26
qgallouedec/lm-human-preferences-descriptiveness
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
21
qgallouedec/lm-human-preferences-sentiment
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
21
qgallouedec/tldr-preference
Viewer
•
Updated
Sep 9, 2024
•
179k
•
28
qgallouedec/tldr
Viewer
•
Updated
Sep 9, 2024
•
130k
•
30
qgallouedec/hh-rlhf-helpful-base
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
18
Expand 68 datasets