Jina Chris

jinachris

AI & ML interests

None yet

Recent Activity

updated a model 5 days ago

jinachris/PURE-PRM-7B

upvoted a paper 5 days ago

Sherlock: Self-Correcting Reasoning in Vision-Language Models

upvoted a paper 5 days ago

Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining

View all activity

Organizations

None yet

jinachris's activity

updated a model 5 days ago

jinachris/PURE-PRM-7B

Token Classification • Updated 5 days ago • 55 • 4

upvoted 2 papers 5 days ago

Sherlock: Self-Correcting Reasoning in Vision-Language Models

Paper • 2505.22651 • Published 6 days ago • 50

Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining

Paper • 2410.00564 • Published Oct 1, 2024 • 1

authored a paper 5 days ago

Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning

Paper • 2504.15275 • Published Apr 21 • 1

liked a model 5 days ago

deepseek-ai/DeepSeek-R1-0528

Text Generation • Updated 5 days ago • 47.9k • • 1.68k

authored a paper 5 days ago

Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining

Paper • 2410.00564 • Published Oct 1, 2024 • 1

upvoted 2 papers 5 days ago

Skywork Open Reasoner 1 Technical Report

Paper • 2505.22312 • Published 6 days ago • 50

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published 6 days ago • 109

upvoted a paper 6 days ago

Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning

Paper • 2504.15275 • Published Apr 21 • 1

updated a collection 12 days ago

PURE

Collection

PRM and fine-tuned LLM used in our PURE github repo: https://github.com/CJReinforce/PURE • 5 items • Updated 12 days ago • 2

upvoted a paper 2 months ago

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published Mar 31 • 62

upvoted an article 3 months ago

Article

Open R1: Update #3

and 9 others •

Mar 11

• 291

upvoted a collection 3 months ago

PURE

Collection

PRM and fine-tuned LLM used in our PURE github repo: https://github.com/CJReinforce/PURE • 5 items • Updated 12 days ago • 2

upvoted a paper 3 months ago

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26 • 84

updated 3 models 3 months ago

liked 3 models 3 months ago

jinachris/Qwen2.5-7B-PURE-PRM

Text Generation • Updated Feb 23 • 7 • 1

jinachris/Qwen2.5-7B-PURE-VR

Text Generation • Updated Feb 23 • 11 • 1

jinachris/Qwen2.5-7B-PURE-PRMVR

Text Generation • Updated Feb 23 • 5 • 1