Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Replicate
Fireworks
Cerebras
Hyperbolic
fal
Nebius AI Studio
Novita
Together AI
Cohere
SambaNova
HF Inference API
Misc
Reset Misc
multimodal
Inference Endpoints
text-generation-inference
custom_code
AutoTrain Compatible
4-bit precision
Eval Results
Merge
8-bit precision
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
843
Full-text search
Edit filters
Sort: Trending
Active filters:
multimodal
Clear all
OpenGVLab/VideoChat-Flash-Qwen2_5-2B_res448
Video-Text-to-Text
•
Updated
Mar 16
•
963
•
18
OpenGVLab/VideoChat-Flash-Qwen2-7B_res224
Video-Text-to-Text
•
Updated
Mar 16
•
77
•
6
OpenGVLab/VideoChat-Flash-Qwen2-7B_res448
Video-Text-to-Text
•
Updated
Mar 16
•
691
•
12
osunlp/UGround-V1-72B
Image-Text-to-Text
•
Updated
Jan 23
•
136
•
4
tahamajs/plamma
Updated
Feb 9
•
1
•
3
XelotX/Qwen2-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
Updated
Jan 16
•
203
•
1
Minthy/ToriiGate-v0.4-7B
Image-Text-to-Text
•
Updated
Jan 22
•
485
•
35
Minthy/ToriiGate-v0.4-2B
Image-Text-to-Text
•
Updated
Jan 19
•
149
•
10
Minthy/ToriiGate-v0.4-7B-exl2-8bpw
Updated
Jan 19
•
3
•
1
ByteDance-Seed/UI-TARS-2B-SFT
Image-Text-to-Text
•
Updated
Jan 25
•
5.58k
•
19
ByteDance-Seed/UI-TARS-72B-SFT
Image-Text-to-Text
•
Updated
Jan 25
•
131
•
17
mradermacher/UI-TARS-7B-DPO-GGUF
Updated
16 days ago
•
361
•
9
mradermacher/Qwen2-VL-7B-Instruct-GGUF
Updated
Jan 21
•
51
•
1
OpenGVLab/InternVideo2_5_Chat_8B
Video-Text-to-Text
•
Updated
Feb 18
•
7.79k
•
61
ByteDance-Seed/UI-TARS-7B-DPO
Image-Text-to-Text
•
Updated
Jan 25
•
51.7k
•
207
OpenGVLab/InternVL_2_5_HiCo_R16
Video-Text-to-Text
•
Updated
Feb 13
•
1.2k
•
3
OpenGVLab/InternVL_2_5_HiCo_R64
Video-Text-to-Text
•
Updated
Feb 13
•
258
•
2
lmstudio-community/UI-TARS-7B-DPO-GGUF
Image-Text-to-Text
•
Updated
Jan 23
•
715
•
6
lmstudio-community/UI-TARS-2B-SFT-GGUF
Image-Text-to-Text
•
Updated
Jan 23
•
218
•
3
lmstudio-community/UI-TARS-72B-DPO-GGUF
Image-Text-to-Text
•
Updated
Jan 23
•
141
•
1
3ib0n/Qwen2-VL-2B-rkllm
Image-Text-to-Text
•
Updated
Jan 23
•
3
Sci-fi-vy/Llama-3.2-11B-Vision-Instruct-finetuned
Image-Text-to-Text
•
Updated
Jan 26
•
10
•
1
mlx-community/Qwen2.5-VL-3B-Instruct-4bit
Image-Text-to-Text
•
Updated
Feb 25
•
1.57k
•
2
mlx-community/Qwen2.5-VL-3B-Instruct-8bit
Image-Text-to-Text
•
Updated
Feb 26
•
369
•
7
mlx-community/Qwen2.5-VL-3B-Instruct-bf16
Image-Text-to-Text
•
Updated
Feb 26
•
69
•
2
mlx-community/Qwen2.5-VL-7B-Instruct-6bit
Image-Text-to-Text
•
Updated
Feb 25
•
65
•
3
mlx-community/Qwen2.5-VL-7B-Instruct-8bit
Image-Text-to-Text
•
Updated
Feb 25
•
2.35k
•
16
mlx-community/Qwen2.5-VL-7B-Instruct-bf16
Image-Text-to-Text
•
Updated
Feb 26
•
93
•
3
mlx-community/Qwen2.5-VL-72B-Instruct-4bit
Image-Text-to-Text
•
Updated
Feb 25
•
228
•
6
jarvisvasu/Qwen2.5-VL-3B-Instruct-4bit
Image-Text-to-Text
•
Updated
Jan 29
•
177
•
3
Previous
1
...
3
4
5
6
7
...
29
Next