Not All Correct Answers Are Equal: Why Your Distillation Source Matters Paper • 2505.14464 • Published 14 days ago • 8
Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking Paper • 2503.19855 • Published Mar 25 • 28
high-quality Chinese training datasets Collection a suite of high-quality Chinese datasets, used for pretraining, fine-tuning or preference alignment. And the models trained on these datasets. • 13 items • Updated 13 days ago • 17