LeanQuant commited on
Commit
53f2587
·
verified ·
1 Parent(s): 444ac37

Add files using upload-large-folder tool

Browse files
Files changed (2) hide show
  1. README.md +1 -1
  2. lm_head.safetensors +2 -2
README.md CHANGED
@@ -10,7 +10,7 @@ tags:
10
 
11
  ## DFloat11 Compressed Model: `mistralai/Mistral-Nemo-Instruct-2407`
12
 
13
- This is a **losslessly compressed** version of [`mistralai/Mistral-Nemo-Instruct-2407`](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407) using our custom **DFloat11** format. The model size is reduced from **24.50GB to 16.14GB**. The outputs of this compressed model are **bit-for-bit identical** to the original BFloat16 model, while reducing GPU memory consumption by approximately **30%**.
14
 
15
  ### 🔍 How It Works
16
 
 
10
 
11
  ## DFloat11 Compressed Model: `mistralai/Mistral-Nemo-Instruct-2407`
12
 
13
+ This is a **losslessly compressed** version of [`mistralai/Mistral-Nemo-Instruct-2407`](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407) using our custom **DFloat11** format. The outputs of this compressed model are **bit-for-bit identical** to the original BFloat16 model, while reducing GPU memory consumption by approximately **30%**.
14
 
15
  ### 🔍 How It Works
16
 
lm_head.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:aebf578c11bf369deca610b915b0645374465e8652a1b19026a5e7880f5c2377
3
- size 454258018
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:23ce2116ad823a9e01aa90f6dc3e9d996ad33066af19fa2115054d7e73b12b9e
3
+ size 908735989