Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Reda alami
RedaAlami
Follow
Pent's profile picture
Mastane's profile picture
21world's profile picture
8 followers
·
3 following
AI & ML interests
Reinforcement Learning
Recent Activity
updated
a dataset
12 days ago
RedaAlami/OpenR1-Math-split-v2
published
a dataset
12 days ago
RedaAlami/OpenR1-Math-split-v2
published
a model
14 days ago
RedaAlami/Falcon3-7B-Instruct-OpenR1-Math
View all activity
Organizations
spaces
1
Sleeping
TestRecommenderSystem
👁
models
15
Sort: Recently updated
RedaAlami/Falcon3-7B-Instruct-OpenR1-Math
Text Generation
•
Updated
14 days ago
•
56
RedaAlami/Qwen-2.5-7B-Simple-RL
Updated
29 days ago
RedaAlami/Falcon3-7B-Instruct-Distill-DS-v1
Text Generation
•
Updated
Feb 12
•
55
RedaAlami/Qwen2-0.5B-GRPO-test
Updated
Feb 10
RedaAlami/zephyr-7b-dpo-qlora
Updated
Oct 4, 2024
•
46
RedaAlami/zephyr-7b-dpo-full
Updated
Aug 29, 2024
RedaAlami/merged-dataset0-dataset1
Updated
Aug 28, 2024
RedaAlami/zephyr-7b-gemma-dpo
Updated
Jul 31, 2024
•
5
RedaAlami/ultrafeedback_binarized_custom2
Updated
Jul 17, 2024
RedaAlami/ultrafeedback_binarized_custom
Updated
Jul 17, 2024
Expand 15 models
datasets
145
Sort: Recently updated
RedaAlami/OpenR1-Math-split-v2
Viewer
•
Updated
12 days ago
•
93.7k
•
119
RedaAlami/OpenR1-Math-split-v1
Viewer
•
Updated
19 days ago
•
93.7k
•
120
RedaAlami/OpenR1-Math-split-modified
Viewer
•
Updated
19 days ago
•
93.7k
•
78
RedaAlami/OpenR1-Math-split
Viewer
•
Updated
19 days ago
•
93.7k
•
119
RedaAlami/OpenR1-Math-220k-default-50percent
Viewer
•
Updated
22 days ago
•
46.9k
•
88
RedaAlami/OpenR1-Math-220k-default
Viewer
•
Updated
23 days ago
•
93.7k
•
138
RedaAlami/merged-dpo-safety
Viewer
•
Updated
Feb 3
•
3.95k
•
47
RedaAlami/eng-batch-3-dpo-safety_test
Viewer
•
Updated
Feb 3
•
36
•
43
RedaAlami/eng-batch-4-dpo-safety_test
Viewer
•
Updated
Feb 3
•
53
•
54
RedaAlami/eng-batch-5-dpo-safety_test
Viewer
•
Updated
Feb 3
•
63
•
58
Expand 145 datasets