--- base_model: - unsloth/Qwen2.5-3B-Instruct-unsloth-bnb-4bit tags: - text-generation-inference - unsloth - qwen2 - trl license: apache-2.0 language: - en datasets: - openai/gsm8k pipeline_tag: text-generation library_name: peft --- # Uploaded model - **Developed by:** nomadicsynth - **License:** apache-2.0 - **Finetuned from model:** unsloth/Qwen2.5-3B-Instruct-unsloth-bnb-4bit - **Training Notebook:** [Qwen2.5_(3B)-GRPO.ipynb](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Qwen2.5_(3B)-GRPO.ipynb) This qwen2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library. [](https://github.com/unslothai/unsloth)