kkkai's picture

2 5 1

kkkai

Zkkkai

·

AI & ML interests

None yet

Recent Activity

authored a paper 13 days ago

MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision

updated a model 13 days ago

Zkkkai/CPGD-7B

new activity 13 days ago

Zkkkai/CPGD-7B:Add pipeline tag, library name and paper information

View all activity

Organizations

None yet

Zkkkai's activity

New activity in Zkkkai/CPGD-7B 13 days ago

Add pipeline tag, library name and paper information

#1 opened 14 days ago by

commented a paper 14 days ago

CPGD: Toward Stable Rule-based Reinforcement Learning for Language Models

Paper • 2505.12504 • Published 16 days ago • 23 •