Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
blanchefort
's Collections
Datasets for Embodied
Ru text encoders
Text2Image
VLMs
VLMs
updated
Feb 14
Upvote
-
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
Updated
Feb 6
•
1.22M
•
•
1.16k
NVEagle/Eagle-X5-13B-Chat
Image-Text-to-Text
•
Updated
Sep 16, 2024
•
54
•
28
internlm/internlm-xcomposer2d5-7b
Visual Question Answering
•
Updated
Jul 22, 2024
•
6.39k
•
204
AIRI-Institute/OmniFusion
Updated
Apr 10, 2024
•
56
OpenGVLab/InternVideo2_chat_8B_HD
Video-Text-to-Text
•
Updated
Dec 18, 2024
•
191
•
16
OpenGVLab/InternVideo2-Chat-8B
Video-Text-to-Text
•
Updated
Oct 10, 2024
•
1.73k
•
22
THUDM/cogvlm2-video-llama3-chat
Text Generation
•
Updated
Jul 24, 2024
•
1.29k
•
45
nyu-visionx/cambrian-34b
Text Generation
•
Updated
Jun 28, 2024
•
357
•
28
THUDM/cogvlm-base-490-hf
Text Generation
•
Updated
Nov 20, 2023
•
64
•
7
THUDM/cogvlm-chat-hf
Text Generation
•
Updated
Dec 19, 2023
•
5.51k
•
193
THUDM/cogvlm-grounding-generalist-hf
Text Generation
•
Updated
Dec 11, 2023
•
568
•
15
Qwen/Qwen-VL
Text Generation
•
Updated
Jan 25, 2024
•
18.3k
•
234
liuhaotian/llava-v1.5-7b
Image-Text-to-Text
•
Updated
May 8, 2024
•
1M
•
435
LanguageBind/MoE-LLaVA-Phi2-2.7B-4e-384
Text Generation
•
Updated
Feb 1, 2024
•
262
•
32
LanguageBind/Video-LLaVA-7B-hf
Updated
May 16, 2024
•
24k
•
41
openvla/openvla-7b-prismatic
Image-Text-to-Text
•
Updated
Jul 9, 2024
•
65
•
5
openvla/openvla-7b-finetuned-libero-object
Image-Text-to-Text
•
Updated
Oct 9, 2024
•
1.06k
•
1
openvla/openvla-7b-finetuned-libero-10
Image-Text-to-Text
•
Updated
Oct 9, 2024
•
1.2k
•
2
IntelLabs/LlavaOLMoBitnet1B
Updated
Aug 30, 2024
•
26
•
30
mistral-community/pixtral-12b-240910
Image-Text-to-Text
•
Updated
Oct 1, 2024
•
383
LanguageBind/MoE-LLaVA-StableLM-1.6B-4e
Text Generation
•
Updated
Feb 1, 2024
•
1.33k
•
8
llava-hf/LLaVA-NeXT-Video-7B-hf
Video-Text-to-Text
•
Updated
Jan 27
•
60.6k
•
73
Qwen/Qwen-VL-Chat
Text Generation
•
Updated
Jan 25, 2024
•
34.5k
•
359
LanguageBind/Video-LLaVA-7B
Text Generation
•
Updated
Apr 9, 2024
•
2.44k
•
83
LanguageBind/LanguageBind_Image
Zero-Shot Image Classification
•
Updated
Feb 1, 2024
•
46.6k
•
11
LanguageBind/LanguageBind_Video
Zero-Shot Image Classification
•
Updated
Feb 1, 2024
•
291
•
2
llava-hf/llava-1.5-13b-hf
Image-Text-to-Text
•
Updated
Jan 27
•
20.5k
•
•
30
llava-hf/llava-1.5-7b-hf
Image-Text-to-Text
•
Updated
Jan 27
•
808k
•
•
240
FreedomIntelligence/LongLLaVA-53B-A13B
Image-Text-to-Text
•
Updated
Nov 28, 2024
•
237
•
20
meta-llama/Llama-3.2-11B-Vision
Image-Text-to-Text
•
Updated
Sep 27, 2024
•
37.7k
•
484
BAAI/Emu3-VisionTokenizer
Feature Extraction
•
Updated
Oct 8, 2024
•
16.6k
•
56
openbmb/MiniCPM-V-2_6
Image-Text-to-Text
•
Updated
Jan 15
•
73.4k
•
957
openbmb/MiniCPM-V
Visual Question Answering
•
Updated
Jan 15
•
56.6k
•
165
openbmb/MiniCPM-V-2
Visual Question Answering
•
Updated
Jan 15
•
5.03k
•
450
openbmb/MiniCPM-Llama3-V-2_5
Image-Text-to-Text
•
Updated
Jan 15
•
26.3k
•
1.39k
nvidia/NVLM-D-72B
Image-Text-to-Text
•
Updated
Jan 14
•
21.3k
•
765
vikhyatk/moondream2
Image-Text-to-Text
•
Updated
Jan 9
•
149k
•
1.07k
allenai/Molmo-72B-0924
Image-Text-to-Text
•
Updated
Oct 10, 2024
•
4.32k
•
283
allenai/MolmoE-1B-0924
Image-Text-to-Text
•
Updated
Oct 10, 2024
•
45.8k
•
138
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
Updated
Oct 10, 2024
•
148k
•
515
allenai/Molmo-7B-O-0924
Image-Text-to-Text
•
Updated
Nov 15, 2024
•
18.6k
•
155
deepseek-ai/Janus-1.3B
Any-to-Any
•
Updated
Jan 27
•
14.9k
•
580
neulab/Pangea-7B
Updated
Oct 24, 2024
•
11.6k
•
126
neulab/Pangea-7B-hf
Updated
Oct 28, 2024
•
865
•
8
BAAI/Aquila-VL-2B-llava-qwen
Visual Question Answering
•
Updated
Nov 25, 2024
•
1.37k
•
56
mistralai/Pixtral-Large-Instruct-2411
Image-Text-to-Text
•
Updated
3 days ago
•
33
•
400
google/paligemma2-10b-pt-224
Image-Text-to-Text
•
Updated
Dec 5, 2024
•
1.96k
•
8
google/paligemma2-3b-pt-224
Image-Text-to-Text
•
Updated
Dec 5, 2024
•
113k
•
147
vidore/colqwen2-v1.0
Visual Document Retrieval
•
Updated
5 days ago
•
76.1k
•
78
deepseek-ai/Janus-Pro-7B
Any-to-Any
•
Updated
Feb 1
•
242k
•
3.23k
deepseek-ai/Janus-Pro-1B
Any-to-Any
•
Updated
Feb 1
•
80.9k
•
405
nvidia/Eagle2-9B
Image-Text-to-Text
•
Updated
Jan 28
•
6.06k
•
45
openbmb/MiniCPM-o-2_6
Any-to-Any
•
Updated
16 days ago
•
263k
•
1.05k
DAMO-NLP-SG/VideoLLaMA3-7B
Visual Question Answering
•
Updated
7 days ago
•
23.1k
•
40
DAMO-NLP-SG/VideoLLaMA3-2B
Visual Question Answering
•
Updated
7 days ago
•
5.63k
•
10
AIDC-AI/Ovis2-8B
Image-Text-to-Text
•
Updated
20 days ago
•
7.45k
•
57
Upvote
-
Share collection
View history
Collection guide
Browse collections