
InvokeAI/ip_adapter_sd_image_encoder
Updated
•
9.67k
•
12
a tiny vision language model
Generate text descriptions from images
Analyze image to generate descriptive prompt
Meta Llama3 8b with Llava Multimodal capabilities
Display a user interface for various tasks
Convert GUI screen to structured elements
Generate detailed image descriptions for prompts
Generate detailed image descriptions
Upload images and get detailed descriptions
Generate customized images using text and an ID image