Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections trending this week

Reasoning Datasets

bespokelabs/Bespoke-Stratos-17k

Viewer • Updated Jan 31 • 16.7k • 15.5k • 308
open-thoughts/OpenThoughts-114k

Viewer • Updated Apr 6 • 228k • 19.7k • 704
AI-MO/NuminaMath-CoT

Viewer • Updated Nov 25, 2024 • 860k • 2.75k • 447
AI-MO/NuminaMath-1.5

Viewer • Updated Feb 10 • 896k • 2.14k • 137

Mistral Small 3 (All Versions)

A collection of Mistral's new Small 3.1 and 3 models including GGUF, 4-bit and more!

unsloth/Mistral-Small-3.1-24B-Instruct-2503-GGUF

Updated 2 days ago • 26.4k • 52
unsloth/Mistral-Small-3.1-24B-Instruct-2503-unsloth-bnb-4bit

Updated 4 days ago • 6.62k • 4
unsloth/Mistral-Small-3.1-24B-Instruct-2503-bnb-4bit

Updated 4 days ago • 3.51k • 1
unsloth/Mistral-Small-3.1-24B-Instruct-2503

Updated 2 days ago • 2.38k • 13

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community

bespokelabs/Bespoke-Stratos-17k

Viewer • Updated Jan 31 • 16.7k • 15.5k • 308
open-thoughts/OpenThoughts-114k

Viewer • Updated Apr 6 • 228k • 19.7k • 704
open-r1/OpenThoughts-114k-math

Viewer • Updated Jan 30 • 89.1k • 1.08k • 81
PrimeIntellect/NuminaMath-QwQ-CoT-5M

Viewer • Updated Jan 22 • 5.14M • 1.38k • 48

DeepSeek R1 (All Versions)

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models.

unsloth/DeepSeek-R1-GGUF-UD

Text Generation • Updated 14 days ago • 6.18k • 13
unsloth/DeepSeek-R1-GGUF

Text Generation • Updated 18 days ago • 127k • 1.06k
unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF

Updated Jan 25 • 18.3k • 132
unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF

Text Generation • Updated 1 day ago • 36.8k • 264

We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark.

nvidia/AceMath-1.5B-Instruct

Text Generation • Updated Jan 17 • 5.49k • 11
nvidia/AceMath-7B-Instruct

Text Generation • Updated Jan 17 • 4.25k • • 23
nvidia/AceMath-72B-Instruct

Text Generation • Updated Jan 17 • 2.98k • 17
nvidia/AceMath-7B-RM

Text Generation • Updated Jan 17 • 6.57k • 6

Dolphin 3.0 is the next generation of the Dolphin series of instruct-tuned models. Designed to be the ultimate general purpose local model.

cognitivecomputations/Dolphin3.0-Mistral-24B

Text Generation • Updated 17 days ago • 1.31k • 58
cognitivecomputations/Dolphin3.0-R1-Mistral-24B

Text Generation • Updated 16 days ago • 1.72k • 191
cognitivecomputations/Dolphin3.0-Llama3.2-1B

Updated 16 days ago • 628 • 25
cognitivecomputations/Dolphin3.0-Llama3.2-3B

Updated 16 days ago • 448 • 43

Google's Gemma models family

google/gemma-2b

Text Generation • Updated Sep 27, 2024 • 583k • 996
google/gemma-2b-it

Text Generation • Updated Sep 27, 2024 • 117k • • 737
google/gemma-7b

Text Generation • Updated Jun 27, 2024 • 69.3k • 3.17k
google/gemma-7b-it

Text Generation • Updated Aug 14, 2024 • 99.3k • 1.17k

Meta's Llama 3.1 models & evals

meta-llama/Llama-3.1-8B

Text Generation • Updated Oct 16, 2024 • 1.32M • • 1.6k
meta-llama/Llama-3.1-70B

Text Generation • Updated Sep 25, 2024 • 106k • • 361
meta-llama/Llama-3.1-405B

Text Generation • Updated Sep 25, 2024 • 14.8k • 932
meta-llama/Llama-3.1-70B-Instruct

Text Generation • Updated Dec 15, 2024 • 1.2M • • 809

Model trained in the paper: EHRSHOT: An EHR Benchmark for Few-Shot Evaluation of Foundation Models (https://arxiv.org/abs/2307.02028)

StanfordShahLab/clmbr-t-base

Updated Mar 7 • 769 • 64
StanfordShahLab/clmbr-t-base-random

Updated Dec 13, 2023 • 3 • 4

The first generation of models pretrained on Common Corpus.

PleIAs/Pleias-350m-Preview

Updated Feb 14 • 520 • 22
PleIAs/Pleias-Pico

Updated Feb 14 • 171 • 33
PleIAs/Pleias-1.2b-Preview

Updated Dec 5, 2024 • 367 • 19
PleIAs/Pleias-Nano

Updated Dec 5, 2024 • 427 • 37

Previous
1
...
8
9
10
11
12
...
11,780
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs