Add files using upload-large-folder tool
Browse filesThis view is limited to 50 files because it contains too many changes.
See raw diff
- .gitattributes +1 -0
- README.md +58 -0
- added_tokens.json +28 -0
- config.json +50 -0
- generation_config.json +13 -0
- lm_head.safetensors +3 -0
- merges.txt +0 -0
- model.safetensors +3 -0
- model_embed_tokens.safetensors +3 -0
- model_layers_0.safetensors +3 -0
- model_layers_1.safetensors +3 -0
- model_layers_10.safetensors +3 -0
- model_layers_11.safetensors +3 -0
- model_layers_12.safetensors +3 -0
- model_layers_13.safetensors +3 -0
- model_layers_14.safetensors +3 -0
- model_layers_15.safetensors +3 -0
- model_layers_16.safetensors +3 -0
- model_layers_17.safetensors +3 -0
- model_layers_18.safetensors +3 -0
- model_layers_19.safetensors +3 -0
- model_layers_2.safetensors +3 -0
- model_layers_20.safetensors +3 -0
- model_layers_21.safetensors +3 -0
- model_layers_22.safetensors +3 -0
- model_layers_23.safetensors +3 -0
- model_layers_24.safetensors +3 -0
- model_layers_25.safetensors +3 -0
- model_layers_26.safetensors +3 -0
- model_layers_27.safetensors +3 -0
- model_layers_28.safetensors +3 -0
- model_layers_29.safetensors +3 -0
- model_layers_3.safetensors +3 -0
- model_layers_30.safetensors +3 -0
- model_layers_31.safetensors +3 -0
- model_layers_32.safetensors +3 -0
- model_layers_33.safetensors +3 -0
- model_layers_34.safetensors +3 -0
- model_layers_35.safetensors +3 -0
- model_layers_36.safetensors +3 -0
- model_layers_37.safetensors +3 -0
- model_layers_38.safetensors +3 -0
- model_layers_39.safetensors +3 -0
- model_layers_4.safetensors +3 -0
- model_layers_5.safetensors +3 -0
- model_layers_6.safetensors +3 -0
- model_layers_7.safetensors +3 -0
- model_layers_8.safetensors +3 -0
- model_layers_9.safetensors +3 -0
- special_tokens_map.json +31 -0
.gitattributes
CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
README.md
ADDED
@@ -0,0 +1,58 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
## DFloat11 Compressed Model: `Qwen/Qwen3-14B`
|
2 |
+
|
3 |
+
This is a **losslessly compressed** version of [`Qwen/Qwen3-14B`](https://huggingface.co/Qwen/Qwen3-14B) using our custom **DFloat11** format. The outputs of this compressed model are **bit-for-bit identical** to the original BFloat16 model, while reducing GPU memory consumption by approximately **30%**.
|
4 |
+
|
5 |
+
### 🔍 How It Works
|
6 |
+
|
7 |
+
DFloat11 compresses model weights using **Huffman coding** of BFloat16 exponent bits, combined with **hardware-aware algorithmic designs** that enable efficient on-the-fly decompression directly on the GPU. During inference, the weights remain compressed in GPU memory and are **decompressed just before matrix multiplications**, then **immediately discarded after use** to minimize memory footprint.
|
8 |
+
|
9 |
+
Key benefits:
|
10 |
+
|
11 |
+
* **No CPU decompression or host-device data transfer** -- all operations are handled entirely on the GPU.
|
12 |
+
* **Decompression overhead is constant** per forward pass and **independent of batch size**, making DFloat11 increasingly efficient at larger batch sizes.
|
13 |
+
* DFloat11 is **much faster than CPU-offloading approaches**, enabling practical deployment in memory-constrained environments.
|
14 |
+
* At **batch size = 1**, inference is approximately **2× slower** than the original BF16 model, but the performance gap **narrows significantly** with larger batches.
|
15 |
+
* The compression is **fully lossless**, guaranteeing that the model’s outputs are **bit-for-bit identical** to those of the original model.
|
16 |
+
|
17 |
+
### 🔧 How to Use
|
18 |
+
|
19 |
+
1. Install the DFloat11 pip package *(installs the CUDA kernel automatically; requires a CUDA-compatible GPU and PyTorch installed)*:
|
20 |
+
|
21 |
+
```bash
|
22 |
+
pip install dfloat11[cuda12]
|
23 |
+
# or if you have CUDA version 11:
|
24 |
+
# pip install dfloat11[cuda11]
|
25 |
+
```
|
26 |
+
|
27 |
+
2. To use the DFloat11 model, run the following example code in Python:
|
28 |
+
|
29 |
+
```python
|
30 |
+
import torch
|
31 |
+
from dfloat11 import DFloat11Model
|
32 |
+
from transformers import AutoTokenizer
|
33 |
+
|
34 |
+
model_id = "DFloat11/Qwen3-14B-DF11"
|
35 |
+
|
36 |
+
model = DFloat11Model.from_pretrained(model_id, device_map="auto")
|
37 |
+
|
38 |
+
tokenizer = AutoTokenizer.from_pretrained(model_id)
|
39 |
+
tokenizer.pad_token = tokenizer.eos_token
|
40 |
+
|
41 |
+
prompt = "Question: What is a binary tree and its applications? Answer:"
|
42 |
+
inputs = tokenizer(prompt, return_tensors="pt", padding=True).to(model.device)
|
43 |
+
|
44 |
+
with torch.no_grad():
|
45 |
+
output = model.generate(
|
46 |
+
**inputs,
|
47 |
+
max_new_tokens=256,
|
48 |
+
do_sample=True,
|
49 |
+
)
|
50 |
+
|
51 |
+
print(tokenizer.batch_decode(output, skip_special_tokens=True))
|
52 |
+
```
|
53 |
+
|
54 |
+
### 📄 Learn More
|
55 |
+
|
56 |
+
* **Paper**: [70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float](https://arxiv.org/abs/2504.11651)
|
57 |
+
* **GitHub**: [https://github.com/LeanModels/DFloat11](https://github.com/LeanModels/DFloat11)
|
58 |
+
* **HuggingFace**: [https://huggingface.co/DFloat11](https://huggingface.co/DFloat11)
|
added_tokens.json
ADDED
@@ -0,0 +1,28 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"</think>": 151668,
|
3 |
+
"</tool_call>": 151658,
|
4 |
+
"</tool_response>": 151666,
|
5 |
+
"<think>": 151667,
|
6 |
+
"<tool_call>": 151657,
|
7 |
+
"<tool_response>": 151665,
|
8 |
+
"<|box_end|>": 151649,
|
9 |
+
"<|box_start|>": 151648,
|
10 |
+
"<|endoftext|>": 151643,
|
11 |
+
"<|file_sep|>": 151664,
|
12 |
+
"<|fim_middle|>": 151660,
|
13 |
+
"<|fim_pad|>": 151662,
|
14 |
+
"<|fim_prefix|>": 151659,
|
15 |
+
"<|fim_suffix|>": 151661,
|
16 |
+
"<|im_end|>": 151645,
|
17 |
+
"<|im_start|>": 151644,
|
18 |
+
"<|image_pad|>": 151655,
|
19 |
+
"<|object_ref_end|>": 151647,
|
20 |
+
"<|object_ref_start|>": 151646,
|
21 |
+
"<|quad_end|>": 151651,
|
22 |
+
"<|quad_start|>": 151650,
|
23 |
+
"<|repo_name|>": 151663,
|
24 |
+
"<|video_pad|>": 151656,
|
25 |
+
"<|vision_end|>": 151653,
|
26 |
+
"<|vision_pad|>": 151654,
|
27 |
+
"<|vision_start|>": 151652
|
28 |
+
}
|
config.json
ADDED
@@ -0,0 +1,50 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"architectures": [
|
3 |
+
"Qwen3ForCausalLM"
|
4 |
+
],
|
5 |
+
"attention_bias": false,
|
6 |
+
"attention_dropout": 0.0,
|
7 |
+
"bos_token_id": 151643,
|
8 |
+
"dfloat11_config": {
|
9 |
+
"bytes_per_thread": 8,
|
10 |
+
"pattern_dict": {
|
11 |
+
"lm_head": [],
|
12 |
+
"model.embed_tokens": [],
|
13 |
+
"model.layers.\\d+": [
|
14 |
+
"self_attn.q_proj",
|
15 |
+
"self_attn.k_proj",
|
16 |
+
"self_attn.v_proj",
|
17 |
+
"self_attn.o_proj",
|
18 |
+
"mlp.gate_proj",
|
19 |
+
"mlp.up_proj",
|
20 |
+
"mlp.down_proj"
|
21 |
+
]
|
22 |
+
},
|
23 |
+
"threads_per_block": [
|
24 |
+
512
|
25 |
+
],
|
26 |
+
"version": "0.2.0"
|
27 |
+
},
|
28 |
+
"eos_token_id": 151645,
|
29 |
+
"head_dim": 128,
|
30 |
+
"hidden_act": "silu",
|
31 |
+
"hidden_size": 5120,
|
32 |
+
"initializer_range": 0.02,
|
33 |
+
"intermediate_size": 17408,
|
34 |
+
"max_position_embeddings": 40960,
|
35 |
+
"max_window_layers": 40,
|
36 |
+
"model_type": "qwen3",
|
37 |
+
"num_attention_heads": 40,
|
38 |
+
"num_hidden_layers": 40,
|
39 |
+
"num_key_value_heads": 8,
|
40 |
+
"rms_norm_eps": 1e-06,
|
41 |
+
"rope_scaling": null,
|
42 |
+
"rope_theta": 1000000,
|
43 |
+
"sliding_window": null,
|
44 |
+
"tie_word_embeddings": false,
|
45 |
+
"torch_dtype": "bfloat16",
|
46 |
+
"transformers_version": "4.51.3",
|
47 |
+
"use_cache": true,
|
48 |
+
"use_sliding_window": false,
|
49 |
+
"vocab_size": 151936
|
50 |
+
}
|
generation_config.json
ADDED
@@ -0,0 +1,13 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"bos_token_id": 151643,
|
3 |
+
"do_sample": true,
|
4 |
+
"eos_token_id": [
|
5 |
+
151645,
|
6 |
+
151643
|
7 |
+
],
|
8 |
+
"pad_token_id": 151643,
|
9 |
+
"temperature": 0.6,
|
10 |
+
"top_k": 20,
|
11 |
+
"top_p": 0.95,
|
12 |
+
"transformers_version": "4.51.3"
|
13 |
+
}
|
lm_head.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2ca4ecce5d1bb4845f60bf93d7e932a43edfc4d8a2592376eae64fd715659a20
|
3 |
+
size 1055702884
|
merges.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
model.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:872cea34c9b5d844d71523ff1220c492eb0807afa0acc03cc1ed1e6a64e4e5b6
|
3 |
+
size 10360
|
model_embed_tokens.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:99b020d1cc3f10bdf1a122260f36660b1c31c710c00ee704e551bc8d397f889e
|
3 |
+
size 1057861894
|
model_layers_0.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:42fef12b7c56f5d52b913ad3951f7ec1f7ea55d02d954b649a5b2bf399134cb6
|
3 |
+
size 448335915
|
model_layers_1.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:142ccff539212e4ab3eafee16d52750a8ba1fa84b5ab50f8d06adf08423255d6
|
3 |
+
size 487779009
|
model_layers_10.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b6bf55a70baed8e314f0c66118168cc5e78b429b4cc60fd8c134519eba6537f4
|
3 |
+
size 445626062
|
model_layers_11.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b14afe90e2fffedb4017d352345ed1b927991088aa8b993837380c33549fa7fd
|
3 |
+
size 445739053
|
model_layers_12.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f4e39ae35c3913d19ab4a5f16178b988d41efe2d118783cc77d08540c2430268
|
3 |
+
size 446097052
|
model_layers_13.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f77d37d6a1d3fe0ee9aacf216b01070ebfe877ad7fb725d0e6587a3c96bb61af
|
3 |
+
size 446478333
|
model_layers_14.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e02e2773623e878d9b8c66e5e56c909ba15338c69daa90ed2651eb4c54a49273
|
3 |
+
size 446577759
|
model_layers_15.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:30dceac03e66b7eed78d034dca0444cb2cd5d48667938b156c3f698bd28ec074
|
3 |
+
size 446456854
|
model_layers_16.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f4c6b01b8be77f11d8ef45a44bfdf8e36268017f4d38cd33b3aba37cbd0ffbd5
|
3 |
+
size 446723600
|
model_layers_17.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ef6ba15556076d0eeb95d00478442fc4a421c9ecfe21645651b4cf2dfebd8fd9
|
3 |
+
size 446940624
|
model_layers_18.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f780c1365ae39e3b0e9a8eca419a8ce8e7e87953ea9cf1df172be050c21e52ee
|
3 |
+
size 447226980
|
model_layers_19.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:25488e1a3015b6e6cb91eb40a8dc3f604c45847a713947b27dacb9d0cbe62c04
|
3 |
+
size 447302715
|
model_layers_2.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3782035adca132d8f3de1e8ff2e6591e84dcb965a31317162f16e6baa0401635
|
3 |
+
size 490310356
|
model_layers_20.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a843ae9e9224d8bba3a054d2a260d2f6a2eca4a30ba9c42e7c8c84439170d796
|
3 |
+
size 447516605
|
model_layers_21.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4cc434ca668684fc97f20d3412a32f88199a2abdca0aedcfa45131aa7a6c6168
|
3 |
+
size 447230016
|
model_layers_22.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e017e7144603641765a54673ec847e1c2b979bfe91e3ed51df57ddb7f5a85a1d
|
3 |
+
size 447107813
|
model_layers_23.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:924355fe80590eb8ce583ce2b0714be15398d7a6435b3e05c80b5b1c3f2734e0
|
3 |
+
size 447026146
|
model_layers_24.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4f0e73494c27bdbe341aea4512632fdf45bb20268b00c5c2d0ab626f7aec34bf
|
3 |
+
size 446756247
|
model_layers_25.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b1cefd1c8e85c47572fa2406effcfea09c4e530915052dfab122f9774ffacbb7
|
3 |
+
size 446673623
|
model_layers_26.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8eaa0efc2cb28badbf185139a73ad57b4d3c389365e3660be70e414d156bd175
|
3 |
+
size 446685292
|
model_layers_27.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:829b76391e8b5304408fbc0da2c87ce5a7a8ee9a84de0f136d83aead9697318c
|
3 |
+
size 446674528
|
model_layers_28.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c4c1dfc3bc5a521847cbcfbeadd2910c37437984c62c79ee664abeff41c50455
|
3 |
+
size 446557884
|
model_layers_29.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5466a2e206b736388a7322fa275190a31276e384bbff032de03891e52c8b6dc3
|
3 |
+
size 446392941
|
model_layers_3.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:85a7faf199a2a2e06750576fd67b9fb7468e773c7df5fe2039ef5d1397c5b5d2
|
3 |
+
size 479236809
|
model_layers_30.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:25ae434b24b508abd2a23cb24c0852fdf3a9632af933d0296ec9b09d585b8556
|
3 |
+
size 446190074
|
model_layers_31.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:46641c1389521711f25df1e74899091ab7c4774c251b435825fafb81df9c22da
|
3 |
+
size 446182897
|
model_layers_32.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f0891d17df190e93b09e93df35edecc9627e94ae0d119f974ca3411c95268bef
|
3 |
+
size 446078836
|
model_layers_33.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4c2f5cbaaa4cc2240c3ee955de963fd33123a8b8cb88296ce49d08615972e5aa
|
3 |
+
size 446071802
|
model_layers_34.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a8356d6e182283fc7d4c8a11a7247f3144dba2da986de2d61f0165165c77fe3c
|
3 |
+
size 446124779
|
model_layers_35.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9aef3a26d2a2f30ff71c0a918b469a3a494fb115dd0108a32d33753a5fc8b4a4
|
3 |
+
size 446211006
|
model_layers_36.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8f630097733af7af4cff591fb8fd94b0af798e0c570ed35c0c262a7fb1c5e4e1
|
3 |
+
size 446160037
|
model_layers_37.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:07214d34285ae0ddc5798e72bc85bd427d4ce194d9731a147c0a999df200b544
|
3 |
+
size 446323194
|
model_layers_38.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:130c7966c5bfbbd58701f1b1709000e612df70fb3124150128730a910246b2f5
|
3 |
+
size 446456505
|
model_layers_39.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ae40cd2a13ea5927e0f277c5c6fc5a438063ad6cb93e6b448cf99f9c8f941192
|
3 |
+
size 447108384
|
model_layers_4.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a43a4016ba27fff88b79ae5d63e8f42a8fb4aa9deba46b9acfced0a510022db9
|
3 |
+
size 470529813
|
model_layers_5.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3eb91a7e0366bd46f6ca1ea9d324d15e16802a7ab42e084b769e0e01e12baabf
|
3 |
+
size 458885908
|
model_layers_6.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3bdc71d5578b341e50b76a7bc5f24ba01d6e83555e9c2c7450bb483799877788
|
3 |
+
size 449177728
|
model_layers_7.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4fdbca87c5ef10482a356c0346abb72afce65dd0499112fe9cd1fd5f70f59009
|
3 |
+
size 447757474
|
model_layers_8.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:94770c7685f23790240cb70f6dc63ffc3c06b56cae44c8c184db90272db3926a
|
3 |
+
size 446848638
|
model_layers_9.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8ecb871ec586dccc3baefe9d1fe11f60f2dfc725187a4d799700254eff397d35
|
3 |
+
size 446095910
|
special_tokens_map.json
ADDED
@@ -0,0 +1,31 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"additional_special_tokens": [
|
3 |
+
"<|im_start|>",
|
4 |
+
"<|im_end|>",
|
5 |
+
"<|object_ref_start|>",
|
6 |
+
"<|object_ref_end|>",
|
7 |
+
"<|box_start|>",
|
8 |
+
"<|box_end|>",
|
9 |
+
"<|quad_start|>",
|
10 |
+
"<|quad_end|>",
|
11 |
+
"<|vision_start|>",
|
12 |
+
"<|vision_end|>",
|
13 |
+
"<|vision_pad|>",
|
14 |
+
"<|image_pad|>",
|
15 |
+
"<|video_pad|>"
|
16 |
+
],
|
17 |
+
"eos_token": {
|
18 |
+
"content": "<|im_end|>",
|
19 |
+
"lstrip": false,
|
20 |
+
"normalized": false,
|
21 |
+
"rstrip": false,
|
22 |
+
"single_word": false
|
23 |
+
},
|
24 |
+
"pad_token": {
|
25 |
+
"content": "<|endoftext|>",
|
26 |
+
"lstrip": false,
|
27 |
+
"normalized": false,
|
28 |
+
"rstrip": false,
|
29 |
+
"single_word": false
|
30 |
+
}
|
31 |
+
}
|