Upload folder using huggingface_hub

Browse files

Files changed (7) hide show

.gitattributes +5 -0
EtherealAurora-MN-Nemo-12B-Q4_K_M.gguf +3 -0
EtherealAurora-MN-Nemo-12B-Q4_K_S.gguf +3 -0
EtherealAurora-MN-Nemo-12B-Q5_K_M.gguf +3 -0
EtherealAurora-MN-Nemo-12B-Q6_K.gguf +3 -0
EtherealAurora-MN-Nemo-12B-Q8_0.gguf +3 -0
README.me +49 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,8 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+EtherealAurora-MN-Nemo-12B-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+EtherealAurora-MN-Nemo-12B-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+EtherealAurora-MN-Nemo-12B-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+EtherealAurora-MN-Nemo-12B-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
+EtherealAurora-MN-Nemo-12B-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text

EtherealAurora-MN-Nemo-12B-Q4_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:bb8aafb8835cbcfe75d39825e1600c6078fd65da7cdac9bae2d9d03c3c6a3b81
+size 7477203872

EtherealAurora-MN-Nemo-12B-Q4_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6ea9661dee4490ee5209881d9eb061eaab77d4b0921e105ed7a6e0691d8baf1c
+size 7120196512

EtherealAurora-MN-Nemo-12B-Q5_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b4e93653ab9ff3e3747ddc64ad642064a2d1467fbcb667dfada4ac009aaa1cd4
+size 8727630752

EtherealAurora-MN-Nemo-12B-Q6_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:53b9410a628c809314efb17c0aaf2ec3f37762a9e157d01f1f85c3022bff3dfb
+size 10056209312

EtherealAurora-MN-Nemo-12B-Q8_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1e4718741ffdc82ae054797a947f4355325a80bd8903f49fd7f2f42307ca312e
+size 13022368672

README.me ADDED Viewed

	@@ -0,0 +1,49 @@

+---
+license: apache-2.0 # Or the license of the original model
+language: en
+library_name: llama.cpp # Indicate it's primarily for llama.cpp ecosystem
+tags:
+- gguf
+- quantized
+- merge
+- mergekit
+- ties
+- 12b
+- text-generation
+- etherealaurora
+- mn-mag-mell
+- nemomix
+- chat
+- roleplay
+pipeline_tag: text-generation
+base_model: TomoDG/EtherealAurora-MN-Nemo-12B
+model_type: llama # Or the appropriate architecture type
+---
+# GGUF Quantized Models for TomoDG/EtherealAurora-MN-Nemo-12B
+This repository contains GGUF format model files for [TomoDG/EtherealAurora-MN-Nemo-12B](https://huggingface.co/TomoDG/EtherealAurora-MN-Nemo-12B).
+These files were quantized using [llama.cpp](https://github.com/ggerganov/llama.cpp).
+## Original Model Card
+For details on the merge process, methodology, and intended use, please refer to the original model card:
+[**TomoDG/EtherealAurora-MN-Nemo-12B**](https://huggingface.co/TomoDG/EtherealAurora-MN-Nemo-12B)
+## Available Quantizations
+| File Name                                   | Quantization Type | Size (Approx) | Recommended RAM | Use Case                                     |
+| :------------------------------------------ | :---------------- | :------------ | :-------------- | :------------------------------------------- |
+| `EtherealAurora-MN-Nemo-12B-Q4_K_S.gguf`    | Q4_K_S            | ~6.95 GB      | 9 GB+           | Smallest 4-bit K-quant, lower RAM usage      |
+| `EtherealAurora-MN-Nemo-12B-Q4_K_M.gguf`    | Q4_K_M            | ~7.30 GB      | 10 GB+          | Good balance quality/performance, medium RAM |
+| `EtherealAurora-MN-Nemo-12B-Q5_K_M.gguf`    | Q5_K_M            | ~8.52 GB      | 12 GB+          | Higher quality, higher RAM usage           |
+| `EtherealAurora-MN-Nemo-12B-Q6_K.gguf`      | Q6_K              | ~9.82 GB      | 13 GB+          | Very high quality, close to FP16             |
+| `EtherealAurora-MN-Nemo-12B-Q8_0.gguf`      | Q8_0              | ~12.7 GB      | 16 GB+          | Highest quality GGUF quant, large size       |                              |
+**General Recommendations:**
+* **`_K_M` quants (like Q4_K_M, Q5_K_M):** Generally recommended for a good balance of quality and resource usage.
+* **`Q6_K`:** Offers higher quality closer to FP16 if you have sufficient RAM.
+* **`Q8_0`:** Highest quality GGUF quantization but requires the most resources.