Zhixiang Zhou

SuperposedWave

AI & ML interests

None yet

Recent Activity

authored a paper 14 days ago

CPGD: Toward Stable Rule-based Reinforcement Learning for Language Models

authored a paper 14 days ago

MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision

upvoted a paper 14 days ago

MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision

View all activity

Organizations

None yet

SuperposedWave's activity

authored 2 papers 14 days ago

CPGD: Toward Stable Rule-based Reinforcement Learning for Language Models

Paper • 2505.12504 • Published 16 days ago • 23

MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision

Paper • 2505.13427 • Published 15 days ago • 25

upvoted 2 papers 14 days ago

MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision

Paper • 2505.13427 • Published 15 days ago • 25

CPGD: Toward Stable Rule-based Reinforcement Learning for Language Models

Paper • 2505.12504 • Published 16 days ago • 23

liked a dataset 24 days ago

yeliudev/VideoMind-Dataset

Preview • Updated 17 days ago • 6.67k • 4

upvoted a paper 3 months ago

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published Mar 10 • 62

liked 2 models 3 months ago

FanqingM/MM-Eureka-8B

Updated 27 days ago • 894 • 6

FanqingM/MM-Eureka-Zero-38B

Updated Mar 7 • 3

liked 2 datasets 3 months ago

AI-MO/NuminaMath-1.5

Viewer • Updated Feb 10 • 896k • 2k • 145

SynthLabsAI/Big-Math-RL-Verified

Viewer • Updated Mar 25 • 251k • 5.57k • 181

upvoted a collection 3 months ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 24 items • Updated 15 days ago • 146

liked a dataset 3 months ago

open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18 • 450k • 26.8k • 585

liked a model 6 months ago

Qwen/QwQ-32B-Preview

Text Generation • Updated Jan 12 • 28.7k • • 1.73k