Qiaosheng ZHANG
Domingo12
AI & ML interests
None yet
Recent Activity
authored
a paper
14 days ago
CPGD: Toward Stable Rule-based Reinforcement Learning for Language
Models
authored
a paper
14 days ago
MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable
Step-Level Supervision
authored
a paper
3 months ago
MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale
Reinforcement Learning
Organizations
None yet
Domingo12's activity
No public activity