Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
3
Junjie Lu
Lux0926
Follow
yuliang03181's profile picture
kylebrovloski's profile picture
2 followers
·
1 following
Lux0926
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 10 hours ago
Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model
upvoted
a
paper
24 days ago
Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback
authored
a paper
about 2 months ago
ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks
View all activity
Organizations
Papers
2
arxiv:
2502.13943
arxiv:
2311.09835
models
10
Sort: Recently updated
Lux0926/ASPRM-D-ORM
Updated
Feb 7
Lux0926/LCD-DS
Updated
Feb 7
•
1
Lux0926/ASPRM-Training-Evaluation-Environment
Updated
Feb 6
Lux0926/ASPRM-MATHCODE-DeepSeek
Updated
Jan 30
•
2
Lux0926/ASPRM-MATHCODE-Mistral
Updated
Jan 30
•
4
Lux0926/ASPRM-D
Updated
Jan 18
•
2
Lux0926/ASPRM-L
Updated
Jan 18
•
8
•
1
Lux0926/ASPRM-M
Updated
Jan 18
•
9
•
1
Lux0926/metamath_mistral_7b
Updated
Jan 18
•
3
Lux0926/metamath_llama31_8b
Updated
Jan 18
•
5
•
1
datasets
10
Sort: Recently updated
Lux0926/ASPRM-BON-Evaluation-Dataset-Code
Preview
•
Updated
Feb 21
•
98
Lux0926/ASPRM-BON-Evaluation-Dataset-Math
Preview
•
Updated
Feb 21
•
124
Lux0926/ASPRM-Math-Rollout-Result
Viewer
•
Updated
Feb 7
•
215k
•
38
Lux0926/ASPRM-MATHCODE-DeepSeek-Training-Dataset
Viewer
•
Updated
Feb 7
•
99.8k
•
17
Lux0926/ASPRM-MATHCODE-Mistral-Training-Dataset
Viewer
•
Updated
Feb 7
•
438k
•
14
Lux0926/ASPRM-D-Training-Dataset
Viewer
•
Updated
Feb 7
•
49.9k
•
36
Lux0926/ASPRM-L-Training-Dataset
Viewer
•
Updated
Feb 7
•
372k
•
17
Lux0926/ASPRM-D-Training-Dataset-ORM
Viewer
•
Updated
Feb 7
•
49.9k
•
15
Lux0926/ASPRM-M-Training-Dataset
Viewer
•
Updated
Feb 7
•
388k
•
22
Lux0926/ASPRM-Code-Rollout-Result
Updated
Feb 7
•
6