Model Merging in Pre-training of Large Language Models Paper • 2505.12082 • Published 17 days ago • 35
CFSP: An Efficient Structured Pruning Framework for LLMs with Coarse-to-Fine Activation Information Paper • 2409.13199 • Published Sep 20, 2024
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models Paper • 2501.09686 • Published Jan 16 • 41
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper • 2504.10479 • Published Apr 14 • 266