THUDM
/

GLM-4-32B-0414

Text Generation

Model card Files Files and versions Community

Resources

View closed (1)

A quick test using M1 Max (64G) and Word

#16 opened 1 day ago by

Awesome model! Can we get a version with a larger context window?

#15 opened 1 day ago by

Fix template when add_generation_prompt=true

#14 opened 4 days ago by

matteogeniaccio

It supports Serbo-Croatian language very well!

#13 opened 5 days ago by

GPTQ or AWQ Quants

#12 opened 5 days ago by

Great job, thanks for this model.

#11 opened 6 days ago by

recommended sampling parameters?

#10 opened 8 days ago by

Can we have some more popular benchmarks

#8 opened 9 days ago by

The model is the best for coding.

#7 opened 12 days ago by

When running with a single GPU, I get an error saying the VRAM is insufficient. However, when using multiple GPUs on a single machine, there are many errors. My vllm version is 0.8.4.

#6 opened 12 days ago by

BitsAndBytes quantization inference error

#5 opened 12 days ago by

Some bug when using function call with vllm==0.8.4

#4 opened 13 days ago by

SimpleQA Scores Are WAY off

#3 opened 14 days ago by

Need fp8 version for inerface

#2 opened 15 days ago by

RuntimeError: CUDA error: device-side assert triggered

#1 opened 15 days ago by