Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
merve
's Collections
April 16 Releases
Multimodal DSE Retrievers
April 11 Releases
March 28 Releases
March 21 Releases
TΓΌrkΓ§e VLMler
Feb 14 Releases π
Feb 7 Releases π§£
January 31 Releases π§€
Models, Jan 27
Jan 24 Releases
Jan 17 Releases βοΈ
Jan 10 Releases π¨οΈ
Dec 6 Releases π
Nov 29 Releases π²π²
Nov 22 Releases βοΈ
Nov 15 Releases π
Nov 1 Releases
MIT Talk 31/10 Papers
October 25 Releases
LOTUS πͺ·
New Depth Models
BRAVE Models π¦
Computer Vision Backbones π§©
Image Classification Models πΆ π±
Object Detection Models π₯₯
Image Segmentation Models π
Zero-shot Image Classification Models πΌοΈ
Image-to-Image Models π¨
Video Classification Models πΊ
Image-to-Text Models π
Text-to-Image Models π₯
Foundation Models for Vision π§©
Segment Anything Model
OWL-series π¦
SigLIP
Awesome Document AI
SegGPT
Vision Language Models Papers πΌοΈπ¬π
gvhf/owl
gv-hf/owl
merve/owl2
Depth Anything v2 Release
Document VLM Papers
Vision Language Leaderboards
Video Language Models
SAM2
NVEagle
Multimodal RAG
Zero-shot Segmentation
Feb 14 Releases π
updated
Feb 14
Upvote
7
OpenGVLab/InternVideo2_5_Chat_8B
Video-Text-to-Text
β’
Updated
Feb 18
β’
8.38k
β’
60
AIDC-AI/Ovis2-34B
Image-Text-to-Text
β’
Updated
Feb 27
β’
1.41k
β’
148
open-r1/OpenR1-Qwen-7B
Text Generation
β’
Updated
Feb 11
β’
4.9k
β’
48
nomic-ai/nomic-embed-text-v2-moe
Sentence Similarity
β’
Updated
29 days ago
β’
258k
β’
358
Zyphra/Zonos-v0.1-hybrid
Text-to-Speech
β’
Updated
Feb 15
β’
10.6k
β’
1.06k
agentica-org/DeepScaleR-1.5B-Preview
Text Generation
β’
Updated
21 days ago
β’
70.8k
β’
549
open-r1/OpenR1-Math-Raw
Viewer
β’
Updated
Feb 24
β’
516k
β’
757
β’
73
open-r1/OpenR1-Math-220k
Viewer
β’
Updated
Feb 18
β’
450k
β’
32k
β’
563
Zyphra/Zonos-v0.1-transformer
Text-to-Speech
β’
Updated
Feb 15
β’
58.3k
β’
393
AIDC-AI/Ovis2-1B
Image-Text-to-Text
β’
Updated
Feb 27
β’
3.62k
β’
81
AIDC-AI/Ovis2-16B
Image-Text-to-Text
β’
Updated
Feb 27
β’
5.27k
β’
91
AIDC-AI/Ovis2-2B
Image-Text-to-Text
β’
Updated
Feb 27
β’
2.3k
β’
53
AIDC-AI/Ovis2-8B
Image-Text-to-Text
β’
Updated
Feb 27
β’
19.3k
β’
63
AIDC-AI/Ovis2-4B
Image-Text-to-Text
β’
Updated
Feb 27
β’
13.9k
β’
54
sbintuitions/modernbert-ja-130m
Fill-Mask
β’
Updated
Feb 27
β’
4.6k
β’
41
Zyphra/Zonos-v0.1-speaker-embedding
Updated
Feb 12
β’
27
GAIR/LIMO
Updated
Feb 6
β’
9.06k
β’
40
prithivMLmods/Hoags-2B-Exp
Image-Text-to-Text
β’
Updated
Feb 15
β’
7
β’
3
Metric-AI/ColQwenStella-2b-multilingual
Visual Document Retrieval
β’
Updated
Mar 25
β’
183
β’
7
apple/DepthPro-hf
Depth Estimation
β’
Updated
Feb 28
β’
15k
β’
52
Liberata/illustrious-xl-v1.0
Text-to-Image
β’
Updated
Feb 12
β’
132
OpenGVLab/InternVL_2_5_HiCo_R16
Video-Text-to-Text
β’
Updated
Feb 13
β’
1.32k
β’
3
OpenGVLab/InternVL_2_5_HiCo_R64
Video-Text-to-Text
β’
Updated
Feb 13
β’
246
β’
2
Upvote
7
+3
Share collection
View history
Collection guide
Browse collections