Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
vinhnx90
's Collections
Orpheus TTS Fine Tune
Phi GRPO Fine Tuning
Qwen GRPO Fine Tuning
Gemma 3 GRPO Fine Tuning
Models
Datasets
Spaces
Research Papers
Gemma 3 GRPO Fine Tuning
updated
Mar 22
My collecions of Gemma 3 1B RL fine-tuning using GPRO technique.
Upvote
-
vinhnx90/gemma-3-1b-thinking-v2
Text Generation
•
Updated
Mar 22
•
6
•
1
vinhnx90/gemma-3-1b-thinking-v2-mlx-4Bit
Text Generation
•
Updated
Mar 22
•
14
•
1
vinhnx90/gemma3-1b-thinking
Updated
Mar 22
•
5
vinhnx90/gemma-3-1b-thinking-v2-base-mlx-8Bit
Text Generation
•
Updated
Mar 22
•
9
•
1
vinhnx90/gemma-3-1b-thinking-v2-Q8_0-GGUF
Updated
Mar 22
•
35
•
1
vinhnx90/gemma-3-1b-thinking-v2-Q4_K_M-GGUF
Updated
Mar 22
•
64
•
3
vinhnx90/gemma-3-1b-thinking-v2-Q6_K-GGUF
Updated
Mar 22
•
18
vinhnx90/gemma-3-1b-thinking-v2-Q5_K_M-GGUF
Updated
Mar 22
•
20
vinhnx90/gemma-3-1b-thinking-v2-mlx-6Bit
Text Generation
•
Updated
Mar 22
•
4
Upvote
-
Share collection
View history
Collection guide
Browse collections