DreamO: A Unified Framework for Image Customization Paper โข 2504.16915 โข Published 15 days ago โข 20
Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation Paper โข 2504.17207 โข Published 14 days ago โข 29
Step1X-Edit: A Practical Framework for General Image Editing Paper โข 2504.17761 โข Published 14 days ago โข 86
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 โข 11 items โข Updated 9 days ago โข 463
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 โข 15 items โข Updated Dec 6, 2024 โข 601