Victor Mustar's picture

Victor Mustar PRO

victor

·

victormustar

AI & ML interests

Building the UX of this website

Recent Activity

liked a Space 1 day ago

Sony/genwarp

liked a model 1 day ago

Sony/AKI-4B-phi-3.5-mini

liked a Space 1 day ago

CharlieAmalet/Tools3ox_PixelArt_SpriteSheet_GeneratorArt_Api

View all activity

Organizations

victor's activity

upvoted 3 papers 4 days ago

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published 5 days ago • 56

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published 6 days ago • 75

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published 6 days ago • 91

upvoted a collection 5 days ago

Gemma 3

All versions of Google's new multimodal models in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 29 items • Updated about 18 hours ago • 38

upvoted 2 articles 5 days ago

Article

Getting Started With Hugging Face in 10 Minutes

By

•

7 days ago

• 4

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

5 days ago

• 279

upvoted a collection 5 days ago

Gemma 3 Release

9 items • Updated 3 days ago • 257

upvoted 2 papers 6 days ago

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published 12 days ago • 210

Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning

Paper • 2502.18080 • Published 20 days ago • 2

upvoted a collection 7 days ago

YuLan-Mini

A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. • 5 items • Updated 24 days ago • 15

upvoted a collection 11 days ago

Instella ✨

Announcing Instella, a series of 3 billion parameter language models developed by AMD, trained from scratch on 128 Instinct MI300X GPUs. • 5 items • Updated 11 days ago • 5

upvoted a paper 12 days ago

Multi-Turn Code Generation Through Single-Step Rewards

Paper • 2502.20380 • Published 17 days ago • 30

upvoted a paper 14 days ago

Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V

Paper • 2310.11441 • Published Oct 17, 2023 • 28

upvoted a paper 21 days ago

YOLOv12: Attention-Centric Real-Time Object Detectors

Paper • 2502.12524 • Published 27 days ago • 10

upvoted a paper 22 days ago

AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO

Paper • 2502.14669 • Published 25 days ago • 11

upvoted a collection 23 days ago

Ola

Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment • 4 items • Updated 24 days ago • 2

upvoted a paper 24 days ago

Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation

Paper • 2502.14846 • Published 24 days ago • 13

upvoted an article 25 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

25 days ago

• 207

upvoted a paper 26 days ago

Rethinking Diverse Human Preference Learning through Principal Component Analysis

Paper • 2502.13131 • Published 26 days ago • 35

upvoted a collection 26 days ago

LipSync and Face Operations

16 items • Updated 19 days ago • 44