mgoin commited on
Commit
67ae38e
·
verified ·
1 Parent(s): 51cfc3d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -0
README.md CHANGED
@@ -35,6 +35,11 @@ extra_gated_description: >-
35
  pipeline_tag: image-text-to-text
36
  ---
37
 
 
 
 
 
 
38
  # Model Card for Mistral-Small-3.1-24B-Instruct-2503
39
 
40
  Building upon Mistral Small 3 (2501), Mistral Small 3.1 (2503) **adds state-of-the-art vision understanding** and enhances **long context capabilities up to 128k tokens** without compromising text performance.
 
35
  pipeline_tag: image-text-to-text
36
  ---
37
 
38
+ Checkpoint of Mistral-Small-3.1-24B-Instruct-2503 with FP8 per-tensor quantization in the Mistral-format. Please run with vLLM like so:
39
+ ```
40
+ vllm serve nm-testing/Mistral-Small-3.1-24B-Instruct-2503-FP8 --tokenizer_mode mistral --config_format mistral --load_format mistral --tool-call-parser mistral --enable-auto-tool-choice --limit_mm_per_prompt 'image=10'
41
+ ```
42
+
43
  # Model Card for Mistral-Small-3.1-24B-Instruct-2503
44
 
45
  Building upon Mistral Small 3 (2501), Mistral Small 3.1 (2503) **adds state-of-the-art vision understanding** and enhances **long context capabilities up to 128k tokens** without compromising text performance.