chris
emott
AI & ML interests
LLM, NLP
Recent Activity
upvoted
a
paper
about 5 hours ago
Pre-DPO: Improving Data Utilization in Direct Preference Optimization
Using a Guiding Reference Model
upvoted
a
paper
24 days ago
Exploring Data Scaling Trends and Effects in Reinforcement Learning from
Human Feedback
Organizations
None yet
models
None public yet
datasets
None public yet