Thanks for your efforts! Want to inquire the quantizing scripts and inference code.
#2 opened 6 days ago
by
listen2you
Can this FP8 model be deployed on 4090? How is the speed?
1
#1 opened 6 days ago
by
yoolv