Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Qiaosheng ZHANG
Domingo12
Follow
AI & ML interests
None yet
Recent Activity
authored
a paper
13 days ago
CPGD: Toward Stable Rule-based Reinforcement Learning for Language Models
authored
a paper
13 days ago
MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision
authored
a paper
3 months ago
MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning
View all activity
Organizations
None yet
Papers
3
arxiv:
2505.13427
arxiv:
2505.12504
arxiv:
2503.07365
models
0
None public yet
datasets
0
None public yet