2 5

Lingxiao Du

Cierra0506

AI & ML interests

None yet

Recent Activity

updated a model 7 days ago

Cierra0506/MM-PRM

updated a dataset 7 days ago

Cierra0506/MM-K12

published a model 8 days ago

Cierra0506/MM-PRM

View all activity

Organizations

None yet

Cierra0506's activity

updated a model 7 days ago

Cierra0506/MM-PRM

Updated 7 days ago • 1

updated a dataset 7 days ago

Cierra0506/MM-K12

Viewer • Updated 7 days ago • 10k • 64

published a model 8 days ago

Cierra0506/MM-PRM

Updated 7 days ago • 1

published a dataset 8 days ago

Cierra0506/MM-K12

Viewer • Updated 7 days ago • 10k • 64

authored 2 papers 14 days ago

CPGD: Toward Stable Rule-based Reinforcement Learning for Language Models

Paper • 2505.12504 • Published 16 days ago • 23

MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision

Paper • 2505.13427 • Published 15 days ago • 25

upvoted 2 papers 14 days ago

CPGD: Toward Stable Rule-based Reinforcement Learning for Language Models

Paper • 2505.12504 • Published 16 days ago • 23

MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision

Paper • 2505.13427 • Published 15 days ago • 25

commented a paper 14 days ago

MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision

Paper • 2505.13427 • Published 15 days ago • 25 •

authored a paper 3 months ago

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published Mar 10 • 62

upvoted 2 papers 3 months ago

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published Mar 10 • 62

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published Feb 25 • 73

upvoted a paper 12 months ago

GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices

Paper • 2406.08451 • Published Jun 12, 2024 • 26