BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs Paper • 2504.18415 • Published 5 days ago • 38
MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention Paper • 2504.16083 • Published 8 days ago • 8
BitNet Collection 🔥BitNet family of large language models (1-bit LLMs). • 7 items • Updated about 21 hours ago • 32
Language Model Fine-Tuning on Scaled Survey Data for Predicting Distributions of Public Opinions Paper • 2502.16761 • Published Feb 24
Inference-Time Scaling for Complex Tasks: Where We Stand and What Lies Ahead Paper • 2504.00294 • Published 29 days ago • 10
Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks Paper • 2503.21696 • Published Mar 27 • 22