dahara1
/

unsloth-QwQ-32B-gguf-japanese-imatrix

Model card Files Files and versions Community

dahara1 commited on Mar 14

Commit

47e0829

·

verified ·

1 Parent(s): 878b995

Update README.md

Files changed (1) hide show

README.md +13 -17

README.md CHANGED Viewed

@@ -15,20 +15,7 @@ language:
 This gguf version aims to create a gguf with [improved quantization parameters](https://huggingface.co/unsloth/QwQ-32B) and [improved Japanese language capabilities](https://huggingface.co/dahara1/imatrix-jpn-test).
 Details are under verification.
-### currnet sample parameters.
-```
-temperature = 0.6
-top-k = 40 (20 to 40 suggested)
-min-p = 0.00 (optional, but 0.01 works well, llama.cpp default is 0.1)
-top-p = 0.95
-repetition-penalty = 1.0
-dry-multiplier 0.5
-Chat template: <|im_start|>user\nCreate a Flappy Bird game in Python.<|im_end|>\n<|im_start|>assistant\n<think>\n
-```
-Reference information
-[Tutorial: How to Run QwQ-32B effectively](https://docs.unsloth.ai/basics/tutorial-how-to-run-qwq-32b-effectively#tutorial-how-to-run-qwq-32b)
 おそらくllama-serverの仕様が変わったためか、私はllama-server経由で正常に動かす事がまだできていません（繰り返しが発生してしまう)
 Probably because the llama-server specifications have changed, I have not yet been able to get it to work properly via llama-server (repetition occurs).
@@ -166,8 +153,17 @@ llama_perf_context_print:       total time =  864163.89 ms /  1681 tokens
 ```

 This gguf version aims to create a gguf with [improved quantization parameters](https://huggingface.co/unsloth/QwQ-32B) and [improved Japanese language capabilities](https://huggingface.co/dahara1/imatrix-jpn-test).
 Details are under verification.
+## example
 おそらくllama-serverの仕様が変わったためか、私はllama-server経由で正常に動かす事がまだできていません（繰り返しが発生してしまう)
 Probably because the llama-server specifications have changed, I have not yet been able to get it to work properly via llama-server (repetition occurs).
 ```
+## currnet sample parameters.
+```
+temperature = 0.6
+top-k = 40 (20 to 40 suggested)
+min-p = 0.00 (optional, but 0.01 works well, llama.cpp default is 0.1)
+top-p = 0.95
+repetition-penalty = 1.0
+dry-multiplier 0.5
+Chat template: <|im_start|>user\nCreate a Flappy Bird game in Python.<|im_end|>\n<|im_start|>assistant\n<think>\n
+```
+Reference information
+[Tutorial: How to Run QwQ-32B effectively](https://docs.unsloth.ai/basics/tutorial-how-to-run-qwq-32b-effectively#tutorial-how-to-run-qwq-32b)