Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
vinhnx90 's Collections
Orpheus TTS Fine Tune
Phi GRPO Fine Tuning
Qwen GRPO Fine Tuning
Gemma 3 GRPO Fine Tuning
Models
Datasets
Spaces
Research Papers

Gemma 3 GRPO Fine Tuning

updated Mar 22

My collecions of Gemma 3 1B RL fine-tuning using GPRO technique.

Upvote
-

  • vinhnx90/gemma-3-1b-thinking-v2

    Text Generation • Updated Mar 22 • 6 • 1

  • vinhnx90/gemma-3-1b-thinking-v2-mlx-4Bit

    Text Generation • Updated Mar 22 • 14 • 1

  • vinhnx90/gemma3-1b-thinking

    Updated Mar 22 • 5

  • vinhnx90/gemma-3-1b-thinking-v2-base-mlx-8Bit

    Text Generation • Updated Mar 22 • 9 • 1

  • vinhnx90/gemma-3-1b-thinking-v2-Q8_0-GGUF

    Updated Mar 22 • 35 • 1

  • vinhnx90/gemma-3-1b-thinking-v2-Q4_K_M-GGUF

    Updated Mar 22 • 64 • 3

  • vinhnx90/gemma-3-1b-thinking-v2-Q6_K-GGUF

    Updated Mar 22 • 18

  • vinhnx90/gemma-3-1b-thinking-v2-Q5_K_M-GGUF

    Updated Mar 22 • 20

  • vinhnx90/gemma-3-1b-thinking-v2-mlx-6Bit

    Text Generation • Updated Mar 22 • 4
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs