Quantizer: Running into an error with quantization "TypeError: 'dict' object is not callable"
3
#24 opened 13 days ago
by
AaronVogler

Support for FP8 + Fused MoE layers in vLLM?
2
#23 opened 20 days ago
by
szlevi
is it w8a16 or w8a8?
1
#19 opened 22 days ago
by
ehartford

[request for feedback] faster downloads with xet
#18 opened 24 days ago
by
clem
