ymh233
ymh233
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for
Alignment with Human Values
upvoted
a
paper
about 2 months ago
SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for
Open Base Models in the Wild
upvoted
a
paper
2 months ago
Process-based Self-Rewarding Language Models
Organizations
models
0
None public yet
datasets
0
None public yet