Difference from Qwen

#2
by hbbOooO - opened

I meet ValueError: The input size is not aligned with the quantized weight shape. This can be caused by too large tensor parallel size. BUG when deploying Qwen/Qwen2.5-VL-72B-Instruct-AWQ with vLLM.

This ISSUE show the model you provided can solve this problem.

Thank you for providing the model and helping me successfully solve the problem. I would like to know the differences between your model and the Qwen/Qwen2.5-VL-72B-Instruct-AWQ model?

me too

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment