CompeteSMoE -- Statistically Guaranteed Mixture of Experts Training via Competition Paper • 2505.13380 • Published 15 days ago • 5
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs Paper • 2504.11536 • Published Apr 15 • 61
SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-Checking Paper • 2503.00955 • Published Mar 2 • 27
LIBMoE: A Library for comprehensive benchmarking Mixture of Experts in Large Language Models Paper • 2411.00918 • Published Nov 1, 2024 • 8
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning Paper • 2410.02884 • Published Oct 3, 2024 • 55
CodeMMLU: A Multi-Task Benchmark for Assessing Code Understanding Capabilities of CodeLLMs Paper • 2410.01999 • Published Oct 2, 2024 • 10