Clip Vision - a rovo Collection

rovo 's Collections

3D Mesh

Audio

Text Generation

Dataset

codellm

Diffusion LORAs

Papers

Flux

Clip Vision

updated 12 days ago

InvokeAI/ip_adapter_sd_image_encoder

Updated Sep 23, 2023 • 9.67k • 12
InvokeAI/ip_adapter_sdxl_image_encoder

Updated Sep 23, 2023 • 7.26k • 14
vikhyatk/moondream2

Image-Text-to-Text • Updated Jan 9 • 149k • 1.07k
Running

426

426

moondream2

🌔

a tiny vision language model
Running on T4

1.26k

1.26k

CLIP Interrogator 2

🕵

Generate text descriptions from images
Running on A10G

2.85k

2.85k

CLIP Interrogator

🕵

Analyze image to generate descriptive prompt
Running on Zero

88

88

Llava Llama-3 8B

🔥

Meta Llama3 8b with Llava Multimodal capabilities
Running on Zero

1.19k

1.19k

FLUX Prompt Generator

😻

Display a user interface for various tasks
apple/MobileCLIP-S2-OpenCLIP

Zero-Shot Image Classification • Updated 19 days ago • 216k • 6
openai/clip-vit-large-patch14

Zero-Shot Image Classification • Updated Sep 15, 2023 • 45.8M • • 1.67k
zer0int/CLIP-GmP-ViT-L-14

Zero-Shot Image Classification • Updated Sep 23, 2024 • 6.16k • 402
Running on Zero

182

182

OmniParser

😻

Convert GUI screen to structured elements
meta-llama/Llama-3.2-11B-Vision-Instruct

Image-Text-to-Text • Updated Dec 4, 2024 • 1.43M • • 1.38k
huihui-ai/Llama-3.2-11B-Vision-Instruct-abliterated

Image-Text-to-Text • Updated Oct 22, 2024 • 2.02k • 25
Running on Zero

69

69

Prompt Enhancer with WD Tagger & Florence 2 Flux/SD3 Captioner

🏃

Generate detailed image descriptions for prompts
Running on Zero

72

72

Florence 2 Flux

🦀

Generate detailed image descriptions
apple/DepthPro

Depth Estimation • Updated 19 days ago • 2.06k • 413
Running

79

79

Omnivlm Dpo Demo

👁

Upload images and get detailed descriptions
guozinan/PuLID

Updated Oct 31, 2024 • 158
Running on Zero

1.84k

1.84k

PuLID-FLUX

🤗

Generate customized images using text and an ID image