Running 294 294 Qwen2.5 Omni 7B Demo 🏆 Generate text and speech responses from text, images, or audio input
Sleeping 97 97 CountGD_Multi-Modal_Open-World_Counting 🚀 Count objects in images using text or visual examples