nm-testing
/

Mistral-Small-3.1-24B-Instruct-2503-FP8

Image-Text-to-Text

Model card Files Files and versions Community

mgoin commited on 17 days ago

Commit

67ae38e

·

verified ·

1 Parent(s): 51cfc3d

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -35,6 +35,11 @@ extra_gated_description: >-
 pipeline_tag: image-text-to-text
 ---
 # Model Card for Mistral-Small-3.1-24B-Instruct-2503
 Building upon Mistral Small 3 (2501), Mistral Small 3.1 (2503) **adds state-of-the-art vision understanding** and enhances **long context capabilities up to 128k tokens** without compromising text performance.

 pipeline_tag: image-text-to-text
 ---
+Checkpoint of Mistral-Small-3.1-24B-Instruct-2503 with FP8 per-tensor quantization in the Mistral-format. Please run with vLLM like so:
+```
+vllm serve nm-testing/Mistral-Small-3.1-24B-Instruct-2503-FP8 --tokenizer_mode mistral --config_format mistral --load_format mistral --tool-call-parser mistral --enable-auto-tool-choice --limit_mm_per_prompt 'image=10'
+```
 # Model Card for Mistral-Small-3.1-24B-Instruct-2503
 Building upon Mistral Small 3 (2501), Mistral Small 3.1 (2503) **adds state-of-the-art vision understanding** and enhances **long context capabilities up to 128k tokens** without compromising text performance.