Softpick: No Attention Sink, No Massive Activations with Rectified Softmax Paper • 2504.20966 • Published 4 days ago • 19
MediAug: Exploring Visual Augmentation in Medical Imaging Paper • 2504.18983 • Published 7 days ago • 5
LLMs for Engineering: Teaching Models to Design High Powered Rockets Paper • 2504.19394 • Published 6 days ago • 6
DeepCritic: Deliberate Critique with Large Language Models Paper • 2505.00662 • Published 2 days ago • 35
Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think Paper • 2504.20708 • Published 4 days ago • 19
Taming the Titans: A Survey of Efficient LLM Inference Serving Paper • 2504.19720 • Published 5 days ago • 9
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Paper • 2504.21233 • Published 4 days ago • 32
X-Fusion: Introducing New Modality to Frozen Large Language Models Paper • 2504.20996 • Published 4 days ago • 10
RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning Paper • 2504.20073 • Published 9 days ago • 8
Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper • 2504.20571 • Published 4 days ago • 74
SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning Paper • 2504.19162 • Published 6 days ago • 14
BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs Paper • 2504.18415 • Published 8 days ago • 40
Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark Paper • 2504.16427 • Published 11 days ago • 16
DyMU: Dynamic Merging and Virtual Unmerging for Efficient VLMs Paper • 2504.17040 • Published 10 days ago • 13
Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation Paper • 2504.17207 • Published 10 days ago • 28