Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Skier8402
's Collections
Realtime-apps
Leaderboards
Quantization tools
3Dmodels
Reasoning-models
Embedding models
Swahili models
multimodal
Diffusion model tools
metrics
RAG-agents
Speech apps
Prompts
Interesting finds
Chat-agents
Datasets
LLM-transparency-tools
Data creation
Computer vision
Datasets
updated
about 24 hours ago
Interesting datasets to help train LLMs and beyond
Upvote
-
Open-Orca/OpenOrca
Viewer
•
Updated
28 days ago
•
2.94M
•
11.2k
•
1.38k
NeelNanda/pile-10k
Viewer
•
Updated
Oct 14, 2022
•
10k
•
7.04k
•
20
legacy-datasets/mc4
Updated
Mar 5, 2024
•
29.1k
•
151
oscar-corpus/oscar
Updated
Mar 21, 2024
•
38.7k
•
184
deepset/prompt-injections
Viewer
•
Updated
Jul 30, 2024
•
662
•
1.54k
•
55
epfl-llm/guidelines
Viewer
•
Updated
Mar 7, 2024
•
38k
•
1.48k
•
123
wanng/midjourney-v5-202304-clean
Viewer
•
Updated
May 24, 2024
•
1.7M
•
168
•
89
CohereForAI/aya_dataset
Viewer
•
Updated
Jun 28, 2024
•
206k
•
2.56k
•
300
google/fleurs
Updated
Aug 25, 2024
•
29.8k
•
276
HuggingFaceTB/cosmopedia
Viewer
•
Updated
Aug 12, 2024
•
31.1M
•
29.6k
•
593
microsoft/orca-math-word-problems-200k
Viewer
•
Updated
Mar 4, 2024
•
200k
•
2.08k
•
443
HuggingFaceFW/fineweb
Viewer
•
Updated
Jan 31
•
25B
•
297k
•
2.04k
proj-persona/PersonaHub
Viewer
•
Updated
15 days ago
•
375k
•
11.3k
•
547
nyu-visionx/Cambrian-10M
Preview
•
Updated
Jul 8, 2024
•
15.9k
•
108
BAAI/Infinity-Instruct
Viewer
•
Updated
22 days ago
•
20.4M
•
5.23k
•
602
NousResearch/hermes-function-calling-v1
Viewer
•
Updated
Aug 30, 2024
•
11.6k
•
2.18k
•
272
meta-llama/Llama-3.1-405B-Instruct
Text Generation
•
Updated
Sep 25, 2024
•
40k
•
•
568
OpenAssistant/oasst2
Viewer
•
Updated
Jan 11, 2024
•
135k
•
2k
•
245
OpenAssistant/oasst1
Viewer
•
Updated
May 2, 2023
•
88.8k
•
9.49k
•
1.36k
HuggingFaceTB/smoltalk
Viewer
•
Updated
Feb 10
•
2.2M
•
8.6k
•
316
NovaSky-AI/Sky-T1_data_17k
Viewer
•
Updated
Jan 14
•
16.4k
•
1.53k
•
179
cognitivecomputations/dolphin-r1
Viewer
•
Updated
Jan 30
•
814k
•
4.42k
•
273
HuggingFaceFW/fineweb-2
Viewer
•
Updated
Jan 8
•
12.5B
•
71.9k
•
448
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
Jan 31
•
3.3B
•
492k
•
652
open-thoughts/OpenThoughts-114k
Viewer
•
Updated
27 days ago
•
228k
•
63.8k
•
656
open-r1/OpenR1-Math-220k
Viewer
•
Updated
29 days ago
•
450k
•
53.7k
•
515
lelapa/kenspeech
Viewer
•
Updated
Jan 27
•
1.06k
•
269
•
2
lelapa/Inkuba-Mono
Viewer
•
Updated
Sep 5, 2024
•
68.8M
•
32
•
12
lelapa/Inkuba-instruct
Viewer
•
Updated
Sep 5, 2024
•
212M
•
439
•
8
mozilla-foundation/common_voice_17_0
Viewer
•
Updated
Jun 16, 2024
•
13M
•
34.4k
•
234
intronhealth/afrimedqa_v2
Viewer
•
Updated
Feb 10
•
15.3k
•
87
•
8
intronhealth/afrispeech-dialog
Preview
•
Updated
Oct 28, 2024
•
237
•
2
intronhealth/afrispeech-200
Updated
Nov 20, 2023
•
1.43k
•
24
arcinstitute/opengenome2
Preview
•
Updated
29 days ago
•
9.05k
•
72
facebook/natural_reasoning
Viewer
•
Updated
26 days ago
•
1.15M
•
12.9k
•
426
Jofthomas/hermes-function-calling-thinking-V1
Viewer
•
Updated
Feb 16
•
3.57k
•
6.9k
•
26
CohereForAI/Global-MMLU
Viewer
•
Updated
16 days ago
•
487k
•
16.9k
•
112
FreedomIntelligence/medical-o1-reasoning-SFT
Viewer
•
Updated
25 days ago
•
50.1k
•
29.3k
•
502
Upvote
-
Share collection
View history
Collection guide
Browse collections