Thanks a lot for this release
#19 opened about 9 hours ago
by
Volko76

Does anyone feel Qwen3 often fails to follow instructions accurately?
5
2
#18 opened about 13 hours ago
by
DOFOFFICIAL

Two of the base models are missing
1
#17 opened about 19 hours ago
by
ZhangRC

Qwen is loosing broad knowledge since Qwen2.
4
9
#16 opened about 20 hours ago
by
phil111
GPQA perf for DSV3-Base seems wrong
1
1
#15 opened about 23 hours ago
by
AChen-qaq
About the non-thinking mode
2
#14 opened 1 day ago
by
volcanos

235B会放出来Base模型吗?
4
#12 opened 1 day ago
by
Yantao2009
看模型介绍和模型结构里面没有关于vision encoder的部分,但是在qwen的在线模型服务界面可以用这个模型去看图片,想问下视觉部分是复用了哪个vision encoder呢?
4
#11 opened 1 day ago
by
Chloez

有用4张H20实践过的大佬吗
1
#10 opened 1 day ago
by
Edison0902
8张80G显存的8卡A100能部署不?
2
#9 opened 1 day ago
by
Yuxin362
User rating and reviews of Qwen3 App and Qwen3 Model
#8 opened 1 day ago
by
DeepNLP
是不是奖励函数没有ngram重复度惩罚
#7 opened 1 day ago
by
wzx111
🚀[Fine-tuning] Qwen3-MoE Megatron Training Implementation and Best Practices👋
5
1
#6 opened 1 day ago
by
study-hjt

【Evaluation】Best practice for evaluating Qwen3 !!
4
#5 opened 1 day ago
by
wangxingjun778

Please upload the base model for this one
4
#4 opened 1 day ago
by
mesh-ops

GPTQ/AWQ
12
1
#3 opened 1 day ago
by
ndurkee
Add languages tag
#2 opened 1 day ago
by
de-francophones

fix: use tp 8 for SGLang
#1 opened 1 day ago
by
zhyncs
