Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
SambaNova
Fireworks
Hyperbolic
Replicate
Together AI
Cohere
Novita
fal
Cerebras
Nebius AI Studio
HF Inference API
Misc
Reset Misc
multimodal
Inference Endpoints
text-generation-inference
custom_code
AutoTrain Compatible
4-bit precision
Eval Results
Merge
8-bit precision
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
843
Full-text search
Edit filters
Sort: Trending
Active filters:
multimodal
Clear all
huihui-ai/Qwen2-VL-72B-Instruct-abliterated
Image-Text-to-Text
•
Updated
Nov 19, 2024
•
51
•
4
Cylingo/XinYuan-VL-2B-GGUF
Image-Text-to-Text
•
Updated
Nov 23, 2024
•
22
•
3
unsloth/llava-v1.6-mistral-7b-hf-bnb-4bit
Image-Text-to-Text
•
Updated
Feb 13
•
2.47k
•
6
erax-ai/EraX-VL-7B-V1.5
Visual Question Answering
•
Updated
about 1 month ago
•
409
•
8
NCSOFT/VARCO-VISION-14B-HF
Image-Text-to-Text
•
Updated
Mar 17
•
560
•
24
CogACT/CogACT-Base
Robotics
•
Updated
Dec 4, 2024
•
6.04k
•
12
Flex-Data/bm-v1
Audio-Text-to-Text
•
Updated
Dec 4, 2024
•
3
CogACT/CogACT-Large
Robotics
•
Updated
Dec 4, 2024
•
124
•
3
CogACT/CogACT-Small
Robotics
•
Updated
Dec 4, 2024
•
501
•
4
rhymes-ai/Aria-Base-64K
Image-Text-to-Text
•
Updated
Dec 1, 2024
•
21
•
14
rhymes-ai/Aria-Chat
Image-Text-to-Text
•
Updated
Dec 15, 2024
•
82
•
11
rhymes-ai/Aria-Base-8K
Image-Text-to-Text
•
Updated
Dec 1, 2024
•
15
•
9
AnyModal/LaTeX-OCR-Llama-3.2-1B
Updated
Dec 23, 2024
•
6
Qwen/Qwen2-VL-72B
Image-Text-to-Text
•
Updated
Dec 6, 2024
•
78.8k
•
77
unsloth/Pixtral-12B-2409-unsloth-bnb-4bit
Image-Text-to-Text
•
Updated
Dec 4, 2024
•
5.33k
•
10
lmstudio-community/Qwen2-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
Updated
Jan 6
•
4.41k
•
5
second-state/Qwen2-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
Updated
Jan 11
•
195
•
5
second-state/Qwen2-VL-2B-Instruct-GGUF
Image-Text-to-Text
•
Updated
Jan 11
•
111
•
3
gaianet/Qwen2-VL-2B-Instruct-GGUF
Image-Text-to-Text
•
Updated
Dec 15, 2024
•
91
•
1
mradermacher/Qwen2-VL-2B-Instruct-GGUF
Updated
Jan 21
•
197
•
1
mradermacher/Qwen2-VL-72B-Instruct-abliterated-i1-GGUF
Updated
Dec 15, 2024
•
68
•
1
GoodiesHere/Apollo-LMMs-Apollo-1_5B-t32
Video-Text-to-Text
•
Updated
Dec 18, 2024
•
37
•
10
GoodiesHere/Apollo-LMMs-Apollo-3B-t32
Text Generation
•
Updated
Dec 18, 2024
•
69
•
20
mlx-community/Llama-3.2-90B-Vision-Instruct-4bit
Image-Text-to-Text
•
Updated
Dec 21, 2024
•
57
•
3
Sri-Vigneshwar-DJ/Apollo-LMMs-Apollo-7B-t32
Video-Text-to-Text
•
Updated
Jan 1
•
11
•
1
mradermacher/UGround-V1-7B-GGUF
Updated
Jan 4
•
48
•
1
osunlp/UGround-V1-72B-Preview
Image-Text-to-Text
•
Updated
Jan 12
•
21
•
2
nintwentydo/Razorback-12B-v0.1
Image-Text-to-Text
•
Updated
Jan 10
•
8
•
3
nintwentydo/Razorback-12B-v0.2
Image-Text-to-Text
•
Updated
Jan 10
•
11
•
3
erax-ai/EraX-VL-7B-V2.0-Preview
Visual Question Answering
•
Updated
Jan 21
•
354
•
22
Previous
1
2
3
4
5
6
...
29
Next