remove q2_k;q3_k_m;q4_0

Files changed (4) hide show

README.md CHANGED Viewed

@@ -32,7 +32,7 @@ QwQ is the reasoning model of the Qwen series. Compared with conventional instru
 - Number of Layers: 64
 - Number of Attention Heads (GQA): 40 for Q and 8 for KV
 - Context Length: Full 131,072 tokens
-- Quantization: q2_K, q3_K_M, q4_0, q4_K_M, q5_0, q5_K_M, q6_K, q8_0
 **Note:** For the best experience, please review the [usage guidelines](#usage-guidelines) before deploying QwQ models.

 - Number of Layers: 64
 - Number of Attention Heads (GQA): 40 for Q and 8 for KV
 - Context Length: Full 131,072 tokens
+- Quantization: q4_K_M, q5_0, q5_K_M, q6_K, q8_0
 **Note:** For the best experience, please review the [usage guidelines](#usage-guidelines) before deploying QwQ models.

qwq-32b-q2_k.gguf DELETED Viewed

@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:4146aaf6dabff8452c4c02323098898b3b59af78c0bb41c90827342ca29dcd3d
-size 12313098400

qwq-32b-q3_k_m.gguf DELETED Viewed

@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:5c8abb3b760e67374f22d099f7631b16cc1c109961795fb2329a7b3b72171e54
-size 15935047840

qwq-32b-q4_0.gguf DELETED Viewed

@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:8e3a6c38a4a54a60493cc1f501722bd6fb07e758d370b9b34c22d0181c7e31d9
-size 18640230560