unsloth
/

Phi-4-mini-instruct-unsloth-bnb-4bit

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions

danielhanchen commited on Feb 28

Commit

9dea888

·

verified ·

1 Parent(s): e34d5c3

Update README.md

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -40,7 +40,12 @@ library_name: transformers
 We have a free Google Colab notebook for turning Phi-4 into a reasoning model: https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Phi_4_(14B)-GRPO.ipynb
 ## ✨ Finetune for Free
 All notebooks are **beginner friendly**! Add your dataset, click "Run All", and you'll get a 2x faster finetuned model which can be exported to GGUF, vLLM or uploaded to Hugging Face.

 We have a free Google Colab notebook for turning Phi-4 into a reasoning model: https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Phi_4_(14B)-GRPO.ipynb
+### Unsloth bug fixes:
+1. Padding and EOS tokens are the same - fixed this.
+2. Chat template had extra EOS token - removed this. Otherwise you will be <|end|> during inference.
+3. EOS token should be <|end|> not <|endoftext|>. Otherwise it'll terminate at <|endoftext|>
+4. Changed unk_token to � from EOS.
+5.
 ## ✨ Finetune for Free
 All notebooks are **beginner friendly**! Add your dataset, click "Run All", and you'll get a 2x faster finetuned model which can be exported to GGUF, vLLM or uploaded to Hugging Face.