Update README.md
Browse files
README.md
CHANGED
@@ -35,6 +35,11 @@ extra_gated_description: >-
|
|
35 |
pipeline_tag: image-text-to-text
|
36 |
---
|
37 |
|
|
|
|
|
|
|
|
|
|
|
38 |
# Model Card for Mistral-Small-3.1-24B-Instruct-2503
|
39 |
|
40 |
Building upon Mistral Small 3 (2501), Mistral Small 3.1 (2503) **adds state-of-the-art vision understanding** and enhances **long context capabilities up to 128k tokens** without compromising text performance.
|
|
|
35 |
pipeline_tag: image-text-to-text
|
36 |
---
|
37 |
|
38 |
+
Checkpoint of Mistral-Small-3.1-24B-Instruct-2503 with FP8 per-tensor quantization in the Mistral-format. Please run with vLLM like so:
|
39 |
+
```
|
40 |
+
vllm serve nm-testing/Mistral-Small-3.1-24B-Instruct-2503-FP8 --tokenizer_mode mistral --config_format mistral --load_format mistral --tool-call-parser mistral --enable-auto-tool-choice --limit_mm_per_prompt 'image=10'
|
41 |
+
```
|
42 |
+
|
43 |
# Model Card for Mistral-Small-3.1-24B-Instruct-2503
|
44 |
|
45 |
Building upon Mistral Small 3 (2501), Mistral Small 3.1 (2503) **adds state-of-the-art vision understanding** and enhances **long context capabilities up to 128k tokens** without compromising text performance.
|