Junjie Lu's picture

2 3

Junjie Lu

Lux0926

·

Lux0926

AI & ML interests

None yet

Recent Activity

upvoted a paper about 10 hours ago

Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model

upvoted a paper 24 days ago

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

authored a paper about 2 months ago

ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks

View all activity

Organizations

Papers 2

arxiv:2502.13943

arxiv:2311.09835

models 10

Lux0926/ASPRM-D-ORM

Lux0926/LCD-DS

Updated Feb 7 • 1

Lux0926/ASPRM-Training-Evaluation-Environment

Lux0926/ASPRM-MATHCODE-DeepSeek

Updated Jan 30 • 2

Lux0926/ASPRM-MATHCODE-Mistral

Updated Jan 30 • 4

Lux0926/ASPRM-D

Updated Jan 18 • 2

Lux0926/ASPRM-L

Updated Jan 18 • 8 • 1

Lux0926/ASPRM-M

Updated Jan 18 • 9 • 1

Lux0926/metamath_mistral_7b

Updated Jan 18 • 3

Lux0926/metamath_llama31_8b

Updated Jan 18 • 5 • 1

datasets 10

Lux0926/ASPRM-BON-Evaluation-Dataset-Code

Preview • Updated Feb 21 • 98

Lux0926/ASPRM-BON-Evaluation-Dataset-Math

Preview • Updated Feb 21 • 124

Lux0926/ASPRM-Math-Rollout-Result

Viewer • Updated Feb 7 • 215k • 38

Lux0926/ASPRM-MATHCODE-DeepSeek-Training-Dataset

Viewer • Updated Feb 7 • 99.8k • 17

Lux0926/ASPRM-MATHCODE-Mistral-Training-Dataset

Viewer • Updated Feb 7 • 438k • 14

Lux0926/ASPRM-D-Training-Dataset

Viewer • Updated Feb 7 • 49.9k • 36

Lux0926/ASPRM-L-Training-Dataset

Viewer • Updated Feb 7 • 372k • 17

Lux0926/ASPRM-D-Training-Dataset-ORM

Viewer • Updated Feb 7 • 49.9k • 15

Lux0926/ASPRM-M-Training-Dataset

Viewer • Updated Feb 7 • 388k • 22

Lux0926/ASPRM-Code-Rollout-Result

Updated Feb 7 • 6