DFloat11
/

Mistral-Nemo-Instruct-2407-DF11

lossless compression

70% size, 100% accuracy

Model card Files Files and versions Community

LeanQuant commited on 13 days ago

Commit

53f2587

·

verified ·

1 Parent(s): 444ac37

Add files using upload-large-folder tool

Files changed (2) hide show

README.md +1 -1
lm_head.safetensors +2 -2

README.md CHANGED Viewed

@@ -10,7 +10,7 @@ tags:
 ## DFloat11 Compressed Model: `mistralai/Mistral-Nemo-Instruct-2407`
-This is a **losslessly compressed** version of [`mistralai/Mistral-Nemo-Instruct-2407`](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407) using our custom **DFloat11** format. The model size is reduced from **24.50GB to 16.14GB**. The outputs of this compressed model are **bit-for-bit identical** to the original BFloat16 model, while reducing GPU memory consumption by approximately **30%**.
 ### 🔍 How It Works

 ## DFloat11 Compressed Model: `mistralai/Mistral-Nemo-Instruct-2407`
+This is a **losslessly compressed** version of [`mistralai/Mistral-Nemo-Instruct-2407`](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407) using our custom **DFloat11** format. The outputs of this compressed model are **bit-for-bit identical** to the original BFloat16 model, while reducing GPU memory consumption by approximately **30%**.
 ### 🔍 How It Works

lm_head.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:aebf578c11bf369deca610b915b0645374465e8652a1b19026a5e7880f5c2377
-size 454258018

 version https://git-lfs.github.com/spec/v1
+oid sha256:23ce2116ad823a9e01aa90f6dc3e9d996ad33066af19fa2115054d7e73b12b9e
+size 908735989