Vuanhngo11
/

bloomvn-0.5b-ppo-gguf

Vuanhngo11 commited on Mar 14

Commit

ec24bd7

verified ·

1 Parent(s): d3c8cde

Upload folder using huggingface_hub

Files changed (1) hide show

README.md CHANGED Viewed

@@ -20,9 +20,9 @@ A collection of optimized GGUF quantized models derived from [bloomvn-0.5b-ppo](
 | Variant | Use Case | Download |
 |---------|-----------|------------|
-| base | This is the base model in FP16 format, suitable for general use cases where model size is not a concern, providing the full capabilities of the bloomvn-0.5b-ppo model. | [📥](https://huggingface.co/Vuanhngo11/bloomvn-0.5b-ppo-gguf/resolve/main/base.gguf)
-| q2_k | The 2-bit quantized variant is extremely compressed, making it ideal for highly constrained environments such as low-memory devices or applications where storage and bandwidth are limited, though it may come at the cost of some model performance. | [📥](https://huggingface.co/Vuanhngo11/bloomvn-0.5b-ppo-gguf/resolve/main/q2_k.gguf)
-| q3_k_m | The 3-bit quantized variant offers a very compressed model, suitable for memory-limited devices that still require a balance between model performance and size, making it a good choice for applications where some compression is necessary but performance cannot be heavily compromised. | [📥](https://huggingface.co/Vuanhngo11/bloomvn-0.5b-ppo-gguf/resolve/main/q3_k_m.gguf)
 ## 🤝 Contributors

 | Variant | Use Case | Download |
 |---------|-----------|------------|
+| base | This FP16 format model is ideal for applications where high accuracy is crucial and storage space is not a concern, such as in data centers or high-performance computing environments. | [📥](https://huggingface.co/Vuanhngo11/bloomvn-0.5b-ppo-gguf/resolve/main/base.gguf)
+| q2_k | The 2-bit quantized model is suitable for extremely constrained environments, such as low-power edge devices or those with very limited memory, where a balance between size and performance is necessary. | [📥](https://huggingface.co/Vuanhngo11/bloomvn-0.5b-ppo-gguf/resolve/main/q2_k.gguf)
+| q3_k_m | The 3-bit quantized model offers a good balance between compression and performance, making it suitable for memory-limited devices that still require a reasonable level of accuracy, such as mid-range smartphones or embedded systems. | [📥](https://huggingface.co/Vuanhngo11/bloomvn-0.5b-ppo-gguf/resolve/main/q3_k_m.gguf)
 ## 🤝 Contributors