Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Paper • 2504.21233 • Published 4 days ago • 32
Qwen3 Collection Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 65 items • Updated 3 days ago • 130
Skywork-R1V2 Collection Multimodal Hybrid Reinforcement Learning for Reasoning • 4 items • Updated 5 days ago • 10
Pleias-RAG Collection New generation of small reasoning models for RAG, search, and source summarization. • 4 items • Updated 9 days ago • 26
OpenMathReasoning Collection Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset" • 7 items • Updated 9 days ago • 34
AceMath-RL Collection Math reasoning models trained through reinforcement learning (RL) • 1 item • Updated 10 days ago • 3
Describe Anything Collection Multimodal Large Language Models for Detailed Localized Image and Video Captioning • 7 items • Updated 9 days ago • 47
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper • 2504.10479 • Published 19 days ago • 252
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory. • 19 items • Updated 15 days ago • 26
MAI-DS-R1 Collection MAI-DS-R1 is a DeepSeek-R1 reasoning model that has been post-trained by the Microsoft AI team. • 2 items • Updated 3 days ago • 11
BitNet Collection 🔥BitNet family of large language models (1-bit LLMs). • 7 items • Updated 3 days ago • 32