hanzlajavaid

hanzla

AI & ML interests

Direct Preference Optimization, Supervised Finetuning, Stable Diffusion

Recent Activity

Organizations

ZeroGPU Explorers's profile picture MLX Community's profile picture ModularityAI's profile picture Social Post Explorers's profile picture Turquoise Turtle's profile picture

hanzla's activity

reacted to their post with ๐Ÿ‘ about 16 hours ago
view post
Post
1071
Hello community,

I want to share my work of creating a reasoning mamba model

I used GRPO over Falcon3 Mamba Instruct to make this model. It generates blazing fast response while building good logic to answer challenging questions.

Give it a try:

Model repo: hanzla/Falcon3-Mamba-R1-v0

Space: hanzla/Falcon3MambaReasoner

Looking forward to community feedback.
posted an update about 22 hours ago
view post
Post
1071
Hello community,

I want to share my work of creating a reasoning mamba model

I used GRPO over Falcon3 Mamba Instruct to make this model. It generates blazing fast response while building good logic to answer challenging questions.

Give it a try:

Model repo: hanzla/Falcon3-Mamba-R1-v0

Space: hanzla/Falcon3MambaReasoner

Looking forward to community feedback.
reacted to AtAndDev's post with ๐Ÿ”ฅ 3 days ago
view post
Post
1434
Gemma 3 seems to be really good at human preference. Just waiting for ppl to see it.
posted an update 4 days ago
view post
Post
1124
Gemma 3 is a game changer for on device multimodal applications.

Try for yourself how a 4 billion parameter model can be so good.

hanzla/PlaygroundGemma3
  • 1 reply
ยท
published a model 4 days ago