VLMs - a blanchefort Collection

Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

blanchefort 's Collections

Datasets for Embodied

Ru text encoders

VLMs

VLMs

updated Feb 14

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • Updated Feb 6 • 1.22M • • 1.16k
NVEagle/Eagle-X5-13B-Chat

Image-Text-to-Text • Updated Sep 16, 2024 • 54 • 28
internlm/internlm-xcomposer2d5-7b

Visual Question Answering • Updated Jul 22, 2024 • 6.39k • 204
AIRI-Institute/OmniFusion

Updated Apr 10, 2024 • 56
OpenGVLab/InternVideo2_chat_8B_HD

Video-Text-to-Text • Updated Dec 18, 2024 • 191 • 16
OpenGVLab/InternVideo2-Chat-8B

Video-Text-to-Text • Updated Oct 10, 2024 • 1.73k • 22
THUDM/cogvlm2-video-llama3-chat

Text Generation • Updated Jul 24, 2024 • 1.29k • 45
nyu-visionx/cambrian-34b

Text Generation • Updated Jun 28, 2024 • 357 • 28
THUDM/cogvlm-base-490-hf

Text Generation • Updated Nov 20, 2023 • 64 • 7
THUDM/cogvlm-chat-hf

Text Generation • Updated Dec 19, 2023 • 5.51k • 193
THUDM/cogvlm-grounding-generalist-hf

Text Generation • Updated Dec 11, 2023 • 568 • 15
Qwen/Qwen-VL

Text Generation • Updated Jan 25, 2024 • 18.3k • 234
liuhaotian/llava-v1.5-7b

Image-Text-to-Text • Updated May 8, 2024 • 1M • 435
LanguageBind/MoE-LLaVA-Phi2-2.7B-4e-384

Text Generation • Updated Feb 1, 2024 • 262 • 32
LanguageBind/Video-LLaVA-7B-hf

Updated May 16, 2024 • 24k • 41
openvla/openvla-7b-prismatic

Image-Text-to-Text • Updated Jul 9, 2024 • 65 • 5
openvla/openvla-7b-finetuned-libero-object

Image-Text-to-Text • Updated Oct 9, 2024 • 1.06k • 1
openvla/openvla-7b-finetuned-libero-10

Image-Text-to-Text • Updated Oct 9, 2024 • 1.2k • 2
IntelLabs/LlavaOLMoBitnet1B

Updated Aug 30, 2024 • 26 • 30
mistral-community/pixtral-12b-240910

Image-Text-to-Text • Updated Oct 1, 2024 • 383
LanguageBind/MoE-LLaVA-StableLM-1.6B-4e

Text Generation • Updated Feb 1, 2024 • 1.33k • 8
llava-hf/LLaVA-NeXT-Video-7B-hf

Video-Text-to-Text • Updated Jan 27 • 60.6k • 73
Qwen/Qwen-VL-Chat

Text Generation • Updated Jan 25, 2024 • 34.5k • 359
LanguageBind/Video-LLaVA-7B

Text Generation • Updated Apr 9, 2024 • 2.44k • 83
LanguageBind/LanguageBind_Image

Zero-Shot Image Classification • Updated Feb 1, 2024 • 46.6k • 11
LanguageBind/LanguageBind_Video

Zero-Shot Image Classification • Updated Feb 1, 2024 • 291 • 2
llava-hf/llava-1.5-13b-hf

Image-Text-to-Text • Updated Jan 27 • 20.5k • • 30
llava-hf/llava-1.5-7b-hf

Image-Text-to-Text • Updated Jan 27 • 808k • • 240
FreedomIntelligence/LongLLaVA-53B-A13B

Image-Text-to-Text • Updated Nov 28, 2024 • 237 • 20
meta-llama/Llama-3.2-11B-Vision

Image-Text-to-Text • Updated Sep 27, 2024 • 37.7k • 484
BAAI/Emu3-VisionTokenizer

Feature Extraction • Updated Oct 8, 2024 • 16.6k • 56
openbmb/MiniCPM-V-2_6

Image-Text-to-Text • Updated Jan 15 • 73.4k • 957
openbmb/MiniCPM-V

Visual Question Answering • Updated Jan 15 • 56.6k • 165
openbmb/MiniCPM-V-2

Visual Question Answering • Updated Jan 15 • 5.03k • 450
openbmb/MiniCPM-Llama3-V-2_5

Image-Text-to-Text • Updated Jan 15 • 26.3k • 1.39k
nvidia/NVLM-D-72B

Image-Text-to-Text • Updated Jan 14 • 21.3k • 765
vikhyatk/moondream2

Image-Text-to-Text • Updated Jan 9 • 149k • 1.07k
allenai/Molmo-72B-0924

Image-Text-to-Text • Updated Oct 10, 2024 • 4.32k • 283
allenai/MolmoE-1B-0924

Image-Text-to-Text • Updated Oct 10, 2024 • 45.8k • 138
allenai/Molmo-7B-D-0924

Image-Text-to-Text • Updated Oct 10, 2024 • 148k • 515
allenai/Molmo-7B-O-0924

Image-Text-to-Text • Updated Nov 15, 2024 • 18.6k • 155
deepseek-ai/Janus-1.3B

Any-to-Any • Updated Jan 27 • 14.9k • 580
neulab/Pangea-7B

Updated Oct 24, 2024 • 11.6k • 126
neulab/Pangea-7B-hf

Updated Oct 28, 2024 • 865 • 8
BAAI/Aquila-VL-2B-llava-qwen

Visual Question Answering • Updated Nov 25, 2024 • 1.37k • 56
mistralai/Pixtral-Large-Instruct-2411

Image-Text-to-Text • Updated 3 days ago • 33 • 400
google/paligemma2-10b-pt-224

Image-Text-to-Text • Updated Dec 5, 2024 • 1.96k • 8
google/paligemma2-3b-pt-224

Image-Text-to-Text • Updated Dec 5, 2024 • 113k • 147
vidore/colqwen2-v1.0

Visual Document Retrieval • Updated 5 days ago • 76.1k • 78
deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated Feb 1 • 242k • 3.23k
deepseek-ai/Janus-Pro-1B

Any-to-Any • Updated Feb 1 • 80.9k • 405
nvidia/Eagle2-9B

Image-Text-to-Text • Updated Jan 28 • 6.06k • 45
openbmb/MiniCPM-o-2_6

Any-to-Any • Updated 16 days ago • 263k • 1.05k
DAMO-NLP-SG/VideoLLaMA3-7B

Visual Question Answering • Updated 7 days ago • 23.1k • 40
DAMO-NLP-SG/VideoLLaMA3-2B

Visual Question Answering • Updated 7 days ago • 5.63k • 10
AIDC-AI/Ovis2-8B

Image-Text-to-Text • Updated 20 days ago • 7.45k • 57

Collection guide
Browse collections

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs