Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
lmms-lab
's Collections
Aero-1-Audio
EgoLife
VideoMMMU
Multimodal-SAE
LLaVA-Critic
LLaVA-Video
LLaVA-OneVision
LMMs-Eval
LongVA
LLaVA-Next-Interleave
LLaVA-NeXT
LMMs-Eval-Lite
LLaVA-OneVision
updated
Oct 5, 2024
a model good at arbitrary types of visual input
Upvote
24
+14
LLaVA-OneVision: Easy Visual Task Transfer
Paper
•
2408.03326
•
Published
Aug 6, 2024
•
61
lmms-lab/LLaVA-OneVision-Mid-Data
Viewer
•
Updated
Aug 26, 2024
•
563k
•
530
•
19
lmms-lab/LLaVA-OneVision-Data
Viewer
•
Updated
Oct 22, 2024
•
3.72M
•
17.6k
•
184
lmms-lab/LLaVA-NeXT-Data
Viewer
•
Updated
Aug 30, 2024
•
779k
•
2.26k
•
32
lmms-lab/llavanext-qwen-siglip-tokenizer
Text Generation
•
Updated
Jul 11, 2024
•
71
•
3
lmms-lab/llava-onevision-qwen2-0.5b-si
Text Generation
•
Updated
Sep 2, 2024
•
13k
•
13
lmms-lab/llava-onevision-qwen2-0.5b-ov
Text Generation
•
Updated
Sep 2, 2024
•
36.3k
•
18
lmms-lab/llava-onevision-qwen2-7b-si
Text Generation
•
Updated
Sep 2, 2024
•
5.9k
•
12
lmms-lab/llava-onevision-qwen2-7b-ov
Text Generation
•
Updated
Sep 2, 2024
•
83.3k
•
50
lmms-lab/llava-onevision-qwen2-72b-si
Text Generation
•
Updated
Sep 2, 2024
•
127
•
1
lmms-lab/llava-onevision-qwen2-72b-ov-sft
Text Generation
•
Updated
Sep 2, 2024
•
2.27k
•
14
lmms-lab/llava-onevision-qwen2-72b-ov-chat
Image-Text-to-Text
•
Updated
Oct 9, 2024
•
712
•
8
lmms-lab/llava-onevision-projectors
Updated
Aug 14, 2024
•
3
lmms-lab/llava-onevision-qwen2-0.5b-mid-stage-a4
Updated
Aug 6, 2024
•
817
lmms-lab/llava-onevision-qwen2-7b-mid-stage-a4
Updated
Aug 6, 2024
•
42
Upvote
24
+20
Share collection
View history
Collection guide
Browse collections