Reasoning Datasets Collection by ryanmarten 2 days ago 7 bespokelabs/Bespoke-Stratos-17k Viewer • Updated Jan 31 • 16.7k • 15.5k • 308 open-thoughts/OpenThoughts-114k Viewer • Updated Apr 6 • 228k • 19.7k • 704 AI-MO/NuminaMath-CoT Viewer • Updated Nov 25, 2024 • 860k • 2.75k • 447 AI-MO/NuminaMath-1.5 Viewer • Updated Feb 10 • 896k • 2.14k • 137
Mistral Small 3 (All Versions) A collection of Mistral's new Small 3.1 and 3 models including GGUF, 4-bit and more! Collection by unsloth 11 days ago 11 unsloth/Mistral-Small-3.1-24B-Instruct-2503-GGUF Updated 2 days ago • 26.4k • 52 unsloth/Mistral-Small-3.1-24B-Instruct-2503-unsloth-bnb-4bit Updated 4 days ago • 6.62k • 4 unsloth/Mistral-Small-3.1-24B-Instruct-2503-bnb-4bit Updated 4 days ago • 3.51k • 1 unsloth/Mistral-Small-3.1-24B-Instruct-2503 Updated 2 days ago • 2.38k • 13
🧠 Reasoning datasets Datasets with reasoning traces for math and code released by the community Collection by open-r1 4 days ago 137 bespokelabs/Bespoke-Stratos-17k Viewer • Updated Jan 31 • 16.7k • 15.5k • 308 open-thoughts/OpenThoughts-114k Viewer • Updated Apr 6 • 228k • 19.7k • 704 open-r1/OpenThoughts-114k-math Viewer • Updated Jan 30 • 89.1k • 1.08k • 81 PrimeIntellect/NuminaMath-QwQ-CoT-5M Viewer • Updated Jan 22 • 5.14M • 1.38k • 48
DeepSeek R1 (All Versions) DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. Collection by unsloth 11 days ago 222 unsloth/DeepSeek-R1-GGUF-UD Text Generation • Updated 14 days ago • 6.18k • 13 unsloth/DeepSeek-R1-GGUF Text Generation • Updated 18 days ago • 127k • 1.06k unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF Updated Jan 25 • 18.3k • 132 unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF Text Generation • Updated 1 day ago • 36.8k • 264
AceMath We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark. Collection by nvidia 2 days ago 14 nvidia/AceMath-1.5B-Instruct Text Generation • Updated Jan 17 • 5.49k • 11 nvidia/AceMath-7B-Instruct Text Generation • Updated Jan 17 • 4.25k • • 23 nvidia/AceMath-72B-Instruct Text Generation • Updated Jan 17 • 2.98k • 17 nvidia/AceMath-7B-RM Text Generation • Updated Jan 17 • 6.57k • 6
Dolphin 3.0 Dolphin 3.0 is the next generation of the Dolphin series of instruct-tuned models. Designed to be the ultimate general purpose local model. Collection by cognitivecomputations Feb 7 145 cognitivecomputations/Dolphin3.0-Mistral-24B Text Generation • Updated 17 days ago • 1.31k • 58 cognitivecomputations/Dolphin3.0-R1-Mistral-24B Text Generation • Updated 16 days ago • 1.72k • 191 cognitivecomputations/Dolphin3.0-Llama3.2-1B Updated 16 days ago • 628 • 25 cognitivecomputations/Dolphin3.0-Llama3.2-3B Updated 16 days ago • 448 • 43
Google's Gemma models family Collection by google 24 days ago 180 google/gemma-2b Text Generation • Updated Sep 27, 2024 • 583k • 996 google/gemma-2b-it Text Generation • Updated Sep 27, 2024 • 117k • • 737 google/gemma-7b Text Generation • Updated Jun 27, 2024 • 69.3k • 3.17k google/gemma-7b-it Text Generation • Updated Aug 14, 2024 • 99.3k • 1.17k
Meta's Llama 3.1 models & evals Collection by meta-llama Dec 13, 2024 146 meta-llama/Llama-3.1-8B Text Generation • Updated Oct 16, 2024 • 1.32M • • 1.6k meta-llama/Llama-3.1-70B Text Generation • Updated Sep 25, 2024 • 106k • • 361 meta-llama/Llama-3.1-405B Text Generation • Updated Sep 25, 2024 • 14.8k • 932 meta-llama/Llama-3.1-70B-Instruct Text Generation • Updated Dec 15, 2024 • 1.2M • • 809
EHRSHOT Model trained in the paper: EHRSHOT: An EHR Benchmark for Few-Shot Evaluation of Foundation Models (https://arxiv.org/abs/2307.02028) Collection by StanfordShahLab Dec 10, 2024 2 StanfordShahLab/clmbr-t-base Updated Mar 7 • 769 • 64 StanfordShahLab/clmbr-t-base-random Updated Dec 13, 2023 • 3 • 4
Common Models The first generation of models pretrained on Common Corpus. Collection by PleIAs Dec 5, 2024 37 PleIAs/Pleias-350m-Preview Updated Feb 14 • 520 • 22 PleIAs/Pleias-Pico Updated Feb 14 • 171 • 33 PleIAs/Pleias-1.2b-Preview Updated Dec 5, 2024 • 367 • 19 PleIAs/Pleias-Nano Updated Dec 5, 2024 • 427 • 37