Update README.md
Browse files
README.md
CHANGED
@@ -18,6 +18,9 @@ This is a test validation to see if we can prune the model according to professi
|
|
18 |
|
19 |
The total parameter is equivalent to 8B.
|
20 |
|
|
|
|
|
|
|
21 |
|
22 |
## Use with transformers
|
23 |
|
|
|
18 |
|
19 |
The total parameter is equivalent to 8B.
|
20 |
|
21 |
+
This model has the same architecture as [deepseek-ai/DeepSeek-V3](https://huggingface.co/deepseek-ai/DeepSeek-V3) model, and we will try the pruned version of the [deepseek-ai/DeepSeek-V3](https://huggingface.co/deepseek-ai/DeepSeek-V3) model.
|
22 |
+
|
23 |
+
|
24 |
|
25 |
## Use with transformers
|
26 |
|