zhang
prvmax
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 10 hours ago
SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning
upvoted
a
paper
2 months ago
S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement
Learning
Organizations
None yet
models
0
None public yet
datasets
0
None public yet