Reda alami's picture

1

Reda alami

RedaAlami

·

AI & ML interests

Reinforcement Learning

Recent Activity

updated a dataset 13 days ago

RedaAlami/OpenR1-Math-split-v2

published a dataset 13 days ago

RedaAlami/OpenR1-Math-split-v2

published a model 15 days ago

RedaAlami/Falcon3-7B-Instruct-OpenR1-Math

View all activity

Organizations

spaces 1

TestRecommenderSystem

models 15

RedaAlami/Falcon3-7B-Instruct-OpenR1-Math

Text Generation • Updated 15 days ago • 56

RedaAlami/Qwen-2.5-7B-Simple-RL

Updated 29 days ago

RedaAlami/Falcon3-7B-Instruct-Distill-DS-v1

Text Generation • Updated Feb 12 • 55

RedaAlami/Qwen2-0.5B-GRPO-test

RedaAlami/zephyr-7b-dpo-qlora

Updated Oct 4, 2024 • 46

RedaAlami/zephyr-7b-dpo-full

Updated Aug 29, 2024

RedaAlami/merged-dataset0-dataset1

Updated Aug 28, 2024

RedaAlami/zephyr-7b-gemma-dpo

Updated Jul 31, 2024 • 5

RedaAlami/ultrafeedback_binarized_custom2

Updated Jul 17, 2024

RedaAlami/ultrafeedback_binarized_custom

Updated Jul 17, 2024

datasets 145

RedaAlami/OpenR1-Math-split-v2

Viewer • Updated 13 days ago • 93.7k • 119

RedaAlami/OpenR1-Math-split-v1

Viewer • Updated 20 days ago • 93.7k • 120

RedaAlami/OpenR1-Math-split-modified

Viewer • Updated 20 days ago • 93.7k • 78

RedaAlami/OpenR1-Math-split

Viewer • Updated 20 days ago • 93.7k • 119

RedaAlami/OpenR1-Math-220k-default-50percent

Viewer • Updated 23 days ago • 46.9k • 88

RedaAlami/OpenR1-Math-220k-default

Viewer • Updated 24 days ago • 93.7k • 138

RedaAlami/merged-dpo-safety

Viewer • Updated Feb 3 • 3.95k • 47

RedaAlami/eng-batch-3-dpo-safety_test

Viewer • Updated Feb 3 • 36 • 43

RedaAlami/eng-batch-4-dpo-safety_test

Viewer • Updated Feb 3 • 53 • 54

RedaAlami/eng-batch-5-dpo-safety_test

Viewer • Updated Feb 3 • 63 • 58