Bnb breaks the function calls
#5
by
Rexschwert
- opened
The model became unable to correctly call functions after quantization. I also tried default bnb-4bit and its still generate something like this:
content": "\n addCriterion\n\n\n\n addCriterion\n\n\n\n",
Only model with original weights (bf16) can handle tools.
I tested it on docker vllm