Qwen3 GGUF Models
Collection
LlamaEdge compatible quants for Qwen3 models.
โข
7 items
โข
Updated
second-state/Qwen3-30B-A3B-GGUF
LlamaEdge version: v0.17.0 and above
Prompt template
chatml
Context size: 128000
Run as LlamaEdge service
wasmedge --dir .:. --nn-preload default:GGML:AUTO:Qwen3-30B-A3B-Q5_K_M.gguf \
llama-api-server.wasm \
--model-name Qwen3-30B-A3B \
--prompt-template chatml \
--ctx-size 128000
Quantized with llama.cpp b5097
2-bit
3-bit
4-bit
5-bit
6-bit
8-bit
16-bit
Unable to build the model tree, the base model loops to the model itself. Learn more.