YuE: Scaling Open Foundation Models for Long-Form Music Generation Paper • 2503.08638 • Published 5 days ago • 56
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL Paper • 2503.07536 • Published 6 days ago • 75
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published 6 days ago • 90
Gemma 3 Collection All versions of Google's new multimodal models in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 29 items • Updated about 2 hours ago • 37
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 5 days ago • 268
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Paper • 2503.03601 • Published 11 days ago • 210
Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning Paper • 2502.18080 • Published 19 days ago • 2
YuLan-Mini Collection A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. • 5 items • Updated 23 days ago • 15
Instella ✨ Collection Announcing Instella, a series of 3 billion parameter language models developed by AMD, trained from scratch on 128 Instinct MI300X GPUs. • 5 items • Updated 11 days ago • 5
Multi-Turn Code Generation Through Single-Step Rewards Paper • 2502.20380 • Published 17 days ago • 30
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V Paper • 2310.11441 • Published Oct 17, 2023 • 28
YOLOv12: Attention-Centric Real-Time Object Detectors Paper • 2502.12524 • Published 26 days ago • 10
AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO Paper • 2502.14669 • Published 24 days ago • 11
Ola Collection Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment • 4 items • Updated 23 days ago • 2
Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation Paper • 2502.14846 • Published 24 days ago • 13
Rethinking Diverse Human Preference Learning through Principal Component Analysis Paper • 2502.13131 • Published 26 days ago • 35