unsloth
/

Qwen3-235B-A22B-GGUF

Text Generation

Model card Files Files and versions

Resources

View closed (0)

how to disable <think> with llama.cpp

#9 opened about 14 hours ago by

It seems like model have serious repetition issues (both gguf and on openrouter)

#8 opened about 17 hours ago by

[Qwen3-235B-A22B-UD-Q4_K_XL.gguf] UD Quant seems to be invalid.

#7 opened about 17 hours ago by

Test on 3090 + Tesla P40 (48gb vram total) + 64gb ram (Q2K)

#6 opened about 18 hours ago by

Ud quants please🥺

#5 opened about 20 hours ago by

ValueError: Cannot use chat template functions because tokenizer.chat_template is not set and no template argument was passed!

#4 opened about 20 hours ago by

Do the Q4 quants work? On the 30b moe it says not to use them.

#3 opened 1 day ago by

UD quants missing some files

#2 opened 1 day ago by

MLDataScientist

Add languages tag

#1 opened 1 day ago by

de-francophones