CoIn: Counting the Invisible Reasoning Tokens in Commercial Opaque LLM APIs Paper • 2505.13778 • Published 15 days ago • 4
Router-Tuning: A Simple and Effective Approach for Enabling Dynamic-Depth in Transformers Paper • 2410.13184 • Published Oct 17, 2024 • 3
What Matters in Transformers? Not All Attention is Needed Paper • 2406.15786 • Published Jun 22, 2024 • 32