Gemma 3 QAT Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory Collection by google 15 days ago 182 google/gemma-3-4b-it-qat-q4_0-gguf Image-Text-to-Text • Updated 22 days ago • 20.9k • 123 google/gemma-3-4b-pt-qat-q4_0-gguf Image-Text-to-Text • Updated 29 days ago • 1.03k • 16 google/gemma-3-1b-it-qat-q4_0-gguf Text Generation • Updated 22 days ago • 5.23k • 38 google/gemma-3-1b-pt-qat-q4_0-gguf Text Generation • Updated 29 days ago • 279 • 6
DeepSeek-Prover DeepSeek-Prover-Series Collection by deepseek-ai 3 days ago 42 deepseek-ai/DeepSeek-Prover-V2-671B Text Generation • Updated 3 days ago • 1.71k • • 599 deepseek-ai/DeepSeek-Prover-V2-7B Updated 3 days ago • 819 • 63 deepseek-ai/DeepSeek-ProverBench Viewer • Updated 3 days ago • 325 • 387 • 18 deepseek-ai/DeepSeek-Prover-V1.5-Base Updated Aug 29, 2024 • 1.25k • 17
Qwen3-abliterated Collection by huihui-ai 2 days ago 12 huihui-ai/Qwen3-0.6B-abliterated Text Generation • Updated about 7 hours ago • 43 • 7 huihui-ai/Qwen3-1.7B-abliterated Text Generation • Updated about 7 hours ago • 22 • 4 huihui-ai/Qwen3-4B-abliterated Text Generation • Updated about 7 hours ago • 18 • 4 huihui-ai/Qwen3-8B-abliterated Text Generation • Updated about 7 hours ago • 35 • 6
Perception LM Collection by facebook 16 days ago 38 facebook/Perception-LM-1B Image-Text-to-Text • Updated 3 days ago • 1.33k • 17 facebook/Perception-LM-3B Image-Text-to-Text • Updated 3 days ago • 76 • 15 facebook/Perception-LM-8B Image-Text-to-Text • Updated 3 days ago • 935 • 30 facebook/PLM-VideoBench Viewer • Updated 10 days ago • 44k • 1.79k • 8
Qwen2.5-VL Vision-language model series based on Qwen2.5 Collection by Qwen 4 days ago 460 Running 98 98 Qwen2.5 VL 32B Instruct Demo 🏃 Chat with images and videos using Qwen Running 232 232 Qwen2.5 VL 72B Instruct 💻 Chat with an AI that understands text and images Qwen2.5-VL Technical Report Paper • 2502.13923 • Published Feb 19 • 184 Qwen/Qwen2.5-VL-32B-Instruct Image-Text-to-Text • Updated 19 days ago • 422k • 351
OpenMathReasoning Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset" Collection by nvidia 8 days ago 34 nvidia/OpenMathReasoning Viewer • Updated 9 days ago • 5.47M • 19.5k • 157 nvidia/OpenMath-Nemotron-1.5B Text Generation • Updated 2 days ago • 783 • 19 nvidia/OpenMath-Nemotron-7B Text Generation • Updated 2 days ago • 229 • 7 nvidia/OpenMath-Nemotron-14B Text Generation • Updated 2 days ago • 282 • 10
GLM-4-0414 GLM-4-0414 series model Collection by THUDM 18 days ago 121 THUDM/GLM-Z1-32B-0414 Text Generation • Updated 5 days ago • 4.18k • • 153 THUDM/GLM-4-32B-0414 Text Generation • Updated 1 day ago • 14.6k • • 363 THUDM/GLM-Z1-Rumination-32B-0414 Text Generation • Updated 18 days ago • 1.49k • • 95 THUDM/GLM-Z1-9B-0414 Text Generation • Updated 5 days ago • 4.2k • • 61
Meta's Llama 3.1 models & evals Collection by meta-llama Dec 13, 2024 142 meta-llama/Llama-3.1-8B Text Generation • Updated Oct 16, 2024 • 1.17M • 1.59k meta-llama/Llama-3.1-70B Text Generation • Updated Sep 25, 2024 • 98.2k • 358 meta-llama/Llama-3.1-405B Text Generation • Updated Sep 25, 2024 • 20.7k • 929 meta-llama/Llama-3.1-70B-Instruct Text Generation • Updated Dec 15, 2024 • 1.19M • • 807
Gemma 3 Release Collection by google 15 days ago 351 google/gemma-3-4b-it Image-Text-to-Text • Updated Mar 21 • 619k • 487 google/gemma-3-4b-pt Image-Text-to-Text • Updated Mar 21 • 56k • 69 google/gemma-3-1b-pt Text Generation • Updated Mar 21 • 168k • 110 google/gemma-3-1b-it Text Generation • Updated 29 days ago • 2.42M • 358
Kimi-Audio-7B Kimi audio 7B models Collection by moonshotai 5 days ago 8 moonshotai/Kimi-Audio-7B Text-to-Speech • Updated 6 days ago • 166 • 34 moonshotai/Kimi-Audio-7B-Instruct Text-to-Speech • Updated 5 days ago • 3.09k • 270 moonshotai/Kimi-Audio-GenTest Viewer • Updated 5 days ago • 191 • 455 • 2