Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Paper • 2504.21233 • Published 8 days ago • 37
Phi-4 Collection Phi-4 family of small language, multi-modal and reasoning models. • 13 items • Updated 6 days ago • 141
YaRN: Efficient Context Window Extension of Large Language Models Paper • 2309.00071 • Published Aug 31, 2023 • 71
Learning Adaptive Parallel Reasoning with Language Models Paper • 2504.15466 • Published 16 days ago • 42
Hyper-RAG: Combating LLM Hallucinations using Hypergraph-Driven Retrieval-Augmented Generation Paper • 2504.08758 • Published Mar 30 • 3
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 20 days ago • 187
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge Paper • 2407.00088 • Published Jun 25, 2024 • 12
1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs Paper • 2410.16144 • Published Oct 21, 2024 • 5
BitNet Collection 🔥BitNet family of large language models (1-bit LLMs). • 7 items • Updated 7 days ago • 36
Cognition Collection Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend. • 200 items • Updated 23 days ago • 5
Training Software Engineering Agents and Verifiers with SWE-Gym Paper • 2412.21139 • Published Dec 30, 2024 • 23
Mirostat: A Neural Text Decoding Algorithm that Directly Controls Perplexity Paper • 2007.14966 • Published Jul 29, 2020 • 1