-
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement
Paper • 2501.12273 • Published • 14 -
CritiQ: Mining Data Quality Criteria from Human Preferences
Paper • 2502.19279 • Published • 9 -
Instruction Pre-Training: Language Models are Supervised Multitask Learners
Paper • 2406.14491 • Published • 94
Eric NG
Eric108
AI & ML interests
NLP
Recent Activity
upvoted
a
paper
4 days ago
Process Reinforcement through Implicit Rewards
upvoted
a
paper
4 days ago
UltraIF: Advancing Instruction Following from the Wild
upvoted
a
collection
5 days ago
Qwen3
Organizations
None yet
Collections
2
-
Large Language Models Can Self-Improve in Long-context Reasoning
Paper • 2411.08147 • Published • 67 -
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
Paper • 2411.11504 • Published • 23 -
Auto-Evolve: Enhancing Large Language Model's Performance via Self-Reasoning Framework
Paper • 2410.06328 • Published • 2 -
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability
Paper • 2411.19943 • Published • 64
models
0
None public yet
datasets
0
None public yet