VPTQ-community
/

deepseek-r1_v8_k_65536_mp4

Model card Files Files and versions Community

yangwang92 commited on 8 days ago

Commit

8356297

·

verified ·

1 Parent(s): 6083418

Create README.md

Files changed (1) hide show

README.md +24 -0

README.md ADDED Viewed

	@@ -0,0 +1,24 @@

+---
+license: mit
+base_model:
+- deepseek-ai/DeepSeek-R1
+base_model_relation: quantized
+tags:
+- VPTQ
+- Quantized
+- Quantization
+---
+**Disclaimer**:
+The model is reproduced based on the paper *VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models* [github](https://github.com/microsoft/vptq) and [arXiv](https://arxiv.org/abs/2409.17066)
+The model itself is sourced from a community release.
+It is intended only for experimental purposes.
+Users are responsible for any consequences arising from the use of this model.
+The model is reshard for 4 GPUs.
+```