Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
kevinpro
's Collections
R-PRM
MAPO: Multilingual Reasoning with Preference Optimization
R-PRM
updated
Mar 31
R-PRM: Reasoning-Driven Process Reward Modeling
Upvote
2
kevinpro/R-PRM-7B-DPO
Text Generation
•
Updated
Mar 28
•
7
R-PRM: Reasoning-Driven Process Reward Modeling
Paper
•
2503.21295
•
Published
Mar 27
kevinpro/R-PRM
Viewer
•
Updated
Mar 28
•
594k
•
383
Upvote
2
Share collection
View history
Collection guide
Browse collections