AI - a kiozheng Collection

kiozheng 's Collections

AI

AI

updated 5 days ago

Scaling Law for Quantization-Aware Training

Paper • 2505.14302 • Published 11 days ago • 69
Reward Reasoning Model

Paper • 2505.14674 • Published 11 days ago • 33
Qwen3 Technical Report

Paper • 2505.09388 • Published 17 days ago • 168
AdaptThink: Reasoning Models Can Learn When to Think

Paper • 2505.13417 • Published 12 days ago • 74
Thinkless: LLM Learns When to Think

Paper • 2505.13379 • Published 12 days ago • 47
Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published 25 days ago • 163
Seed1.5-VL Technical Report

Paper • 2505.07062 • Published 20 days ago • 139
MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

Paper • 2505.07916 • Published 19 days ago • 119
Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published 14 days ago • 109
Emerging Properties in Unified Multimodal Pretraining

Paper • 2505.14683 • Published 11 days ago • 124
Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published 16 days ago • 75
Flow-GRPO: Training Flow Matching Models via Online RL

Paper • 2505.05470 • Published 23 days ago • 76
RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published 26 days ago • 73
ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Paper • 2505.04588 • Published 24 days ago • 63
Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models

Paper • 2505.14810 • Published 11 days ago • 58
Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space

Paper • 2505.15778 • Published 10 days ago • 15
NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification

Paper • 2505.16938 • Published 9 days ago • 110
Learning to Reason via Mixture-of-Thought for Logical Reasoning

Paper • 2505.15817 • Published 10 days ago • 17
One RL to See Them All: Visual Triple Unified Reinforcement Learning

Paper • 2505.18129 • Published 8 days ago • 56