Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Together AI
Replicate
Cohere
fal
SambaNova
Fireworks
Hyperbolic
Nebius AI Studio
Novita
Cerebras
HF Inference API
Misc
Reset Misc
Inference Endpoints
text-generation-inference
image-text-to-text
custom_code
4-bit precision
AutoTrain Compatible
Merge
8-bit precision
Eval Results
Mixture of Experts
Carbon Emissions
Misc with no match
text-embeddings-inference
Apply filters
Models
9,611
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
Updated
Feb 6
•
1.01M
•
•
1.18k
HuggingFaceTB/SmolVLM-256M-Instruct
Image-Text-to-Text
•
Updated
25 days ago
•
412k
•
219
Qwen/Qwen2.5-VL-7B-Instruct-AWQ
Image-Text-to-Text
•
Updated
27 days ago
•
96.2k
•
60
unsloth/gemma-3-4b-it-GGUF
Image-Text-to-Text
•
Updated
7 days ago
•
34.3k
•
79
OpenGVLab/InternVL3-14B
Image-Text-to-Text
•
Updated
9 days ago
•
58.7k
•
48
google/gemma-3-27b-it-qat-q4_0-unquantized
Image-Text-to-Text
•
Updated
18 days ago
•
31.8k
•
26
Skywork/SkyCaptioner-V1
Video-Text-to-Text
•
Updated
9 days ago
•
459
•
32
unsloth/gemma-3-12b-it-qat-GGUF
Image-Text-to-Text
•
Updated
7 days ago
•
6.18k
•
7
microsoft/trocr-base-handwritten
Image-to-Text
•
Updated
Feb 11
•
219k
•
407
google/paligemma-3b-pt-224
Image-Text-to-Text
•
Updated
Sep 21, 2024
•
25.1k
•
320
openbmb/MiniCPM-V-2_6
Image-Text-to-Text
•
Updated
Jan 15
•
68k
•
969
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text
•
Updated
25 days ago
•
77.1k
•
433
prithivMLmods/Qwen2-VL-OCR-2B-Instruct
Image-Text-to-Text
•
Updated
about 21 hours ago
•
51.9k
•
65
nvidia/Eagle2-1B
Image-Text-to-Text
•
Updated
6 days ago
•
2.6k
•
23
ByteDance-Seed/UI-TARS-72B-DPO
Image-Text-to-Text
•
Updated
Jan 25
•
14.2k
•
123
unsloth/gemma-3-12b-it-GGUF
Image-Text-to-Text
•
Updated
7 days ago
•
34.8k
•
65
unsloth/gemma-3-4b-it-unsloth-bnb-4bit
Image-Text-to-Text
•
Updated
9 days ago
•
282k
•
14
Qwen/Qwen2.5-VL-32B-Instruct-AWQ
Image-Text-to-Text
•
Updated
27 days ago
•
43.1k
•
40
Tesslate/Synthia-S1-27b
Image-Text-to-Text
•
Updated
25 days ago
•
715
•
60
OpenGVLab/InternVL3-38B
Image-Text-to-Text
•
Updated
9 days ago
•
10.3k
•
24
tngtech/olmOCR-7B-faithful
Image-Text-to-Text
•
Updated
15 days ago
•
498
•
10
mlx-community/gemma-3-27b-it-qat-4bit
Image-Text-to-Text
•
Updated
14 days ago
•
2.64k
•
15
remyxai/SpaceThinker-Qwen2.5VL-3B
Image-Text-to-Text
•
Updated
7 days ago
•
840
•
8
bartowski/google_gemma-3-27b-it-qat-GGUF
Image-Text-to-Text
•
Updated
12 days ago
•
16.2k
•
32
chancharikm/qwen2.5-vl-7b-cam-motion-preview
Video-Text-to-Text
•
Updated
1 day ago
•
646
•
3
microsoft/trocr-small-handwritten
Image-to-Text
•
Updated
May 27, 2024
•
531k
•
47
Salesforce/blip2-opt-2.7b
Image-Text-to-Text
•
Updated
Feb 3
•
895k
•
360
liuhaotian/llava-v1.5-7b
Image-Text-to-Text
•
Updated
May 8, 2024
•
1.23M
•
449
openvla/openvla-7b
Image-Text-to-Text
•
Updated
Sep 16, 2024
•
1.71M
•
110
onnx-community/Florence-2-base-ft
Image-Text-to-Text
•
Updated
Feb 15
•
43.5k
•
30
Previous
1
2
3
4
5
...
100
Next