how to disable <think> with llama.cpp
3
#9 opened about 14 hours ago
by
bobchenyx
It seems like model have serious repetition issues (both gguf and on openrouter)
4
#8 opened about 17 hours ago
by
roadtoagi

[Qwen3-235B-A22B-UD-Q4_K_XL.gguf] UD Quant seems to be invalid.
2
#7 opened about 17 hours ago
by
XelotX
Test on 3090 + Tesla P40 (48gb vram total) + 64gb ram (Q2K)
1
#6 opened about 18 hours ago
by
roadtoagi

Ud quants please🥺
2
#5 opened about 20 hours ago
by
Ainonake
ValueError: Cannot use chat template functions because tokenizer.chat_template is not set and no template argument was passed!
1
#4 opened about 20 hours ago
by
shakhizat
Do the Q4 quants work? On the 30b moe it says not to use them.
2
#3 opened 1 day ago
by
Lockout

UD quants missing some files
3
6
#2 opened 1 day ago
by
MLDataScientist
Add languages tag
#1 opened 1 day ago
by
de-francophones
