gaianet
/

Qwen3-8B-GGUF

Model card Files Files and versions Community

Qwen3-8B-GGUF

Original Model

Run with Gaianet

Prompt template

prompt template: chatml

Context size

chat_ctx_size: 128000

Run with GaiaNet

Quick start: https://docs.gaianet.ai/node-guide/quick-start
Customize your node: https://docs.gaianet.ai/node-guide/customize

Quantized with llama.cpp b5097

Downloads last month: 200

GGUF

Model size

8.19B params

Architecture

qwen3

Hardware compatibility

Log In to view the estimation

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for gaianet/Qwen3-8B-GGUF

Base model

Qwen/Qwen3-8B-Base

Finetuned

Quantized

(52)

this model

Collection including gaianet/Qwen3-8B-GGUF

Qwen3

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. • 7 items • Updated about 4 hours ago • 2