1 13 11

Aurelien Lac

uminaty

uminaty

AI & ML interests

Computer vision, image generation, image translation, LLMs, multimodal AI

Recent Activity

upvoted a paper 29 days ago

SmolVLM: Redefining small and efficient multimodal models

liked a Space 30 days ago

vidore/vidore-leaderboard

liked a model 3 months ago

Zyphra/Zonos-v0.1-hybrid

View all activity

Organizations

uminaty's activity

upvoted a paper 29 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published about 1 month ago • 180

liked a Space 30 days ago

133

Vidore Leaderboard

🥇

Browse and submit visual document retrieval benchmark results

liked a model 3 months ago

Zyphra/Zonos-v0.1-hybrid

Text-to-Speech • Updated 1 day ago • 11.5k • 1.06k

upvoted a collection 3 months ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 11 items • Updated 9 days ago • 463

liked 2 models 5 months ago

answerdotai/ModernBERT-base

Fill-Mask • Updated Jan 15 • 475k • 840

answerdotai/ModernBERT-large

Fill-Mask • Updated Jan 15 • 89.5k • 390

upvoted a paper 5 months ago

Arbitrary-steps Image Super-resolution via Diffusion Inversion

Paper • 2412.09013 • Published Dec 12, 2024 • 13

New activity in lightonai/MonoQwen2-VL-v0.1 6 months ago

Model loading issues

#1 opened 6 months ago by

MaxJeblick

updated a model 6 months ago

lightonai/MonoQwen2-VL-v0.1

Updated Jan 9 • 866 • 35

liked a model 6 months ago

lightonai/MonoQwen2-VL-v0.1

Updated Jan 9 • 866 • 35

updated a model 6 months ago

lightonai/MonoQwen2-VL-v0.1

Updated Jan 9 • 866 • 35

liked a Space 7 months ago

Vision Pipeline

🌍

Query an image index to get answers

upvoted a paper 10 months ago

ColPali: Efficient Document Retrieval with Vision Language Models

Paper • 2407.01449 • Published Jun 27, 2024 • 48

liked a model 11 months ago

stabilityai/stable-audio-open-1.0

Text-to-Audio • Updated Apr 1 • 39.2k • 1.18k

liked a model 12 months ago

microsoft/Phi-3-medium-128k-instruct

Text Generation • Updated Aug 20, 2024 • 23k • • 381

upvoted 2 papers about 1 year ago

KAN: Kolmogorov-Arnold Networks

Paper • 2404.19756 • Published Apr 30, 2024 • 113

World Model on Million-Length Video And Language With RingAttention

Paper • 2402.08268 • Published Feb 13, 2024 • 39

liked a Space over 1 year ago

13k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots

upvoted a paper over 1 year ago

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

Paper • 2312.09390 • Published Dec 14, 2023 • 33