Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ubermenchh
/
llama3.1-8B-gsm8k-grpo
like
0
PyTorch
Safetensors
GGUF
llama
unsloth
trl
grpo
conversational
License:
mit
Model card
Files
Files and versions
Community
Deploy
Use this model
main
llama3.1-8B-gsm8k-grpo
Ctrl+K
Ctrl+K
1 contributor
History:
5 commits
ubermenchh
(Trained with Unsloth)
e6b220a
verified
3 months ago
.gitattributes
Safe
1.62 kB
(Trained with Unsloth)
3 months ago
README.md
57 Bytes
Trained with Unsloth
3 months ago
adapter_config.json
Safe
814 Bytes
Trained with Unsloth
3 months ago
adapter_model.safetensors
Safe
336 MB
LFS
Trained with Unsloth
3 months ago
config.json
Safe
989 Bytes
Trained with Unsloth
3 months ago
generation_config.json
Safe
166 Bytes
Trained with Unsloth
3 months ago
pytorch_model-00001-of-00004.bin
Safe
pickle
Detected Pickle imports (3)
"torch.HalfStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
4.98 GB
LFS
Trained with Unsloth
3 months ago
pytorch_model-00002-of-00004.bin
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.HalfStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
5 GB
LFS
Trained with Unsloth
3 months ago
pytorch_model-00003-of-00004.bin
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.HalfStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
4.92 GB
LFS
Trained with Unsloth
3 months ago
pytorch_model-00004-of-00004.bin
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.HalfStorage"
What is a pickle import?
1.17 GB
LFS
Trained with Unsloth
3 months ago
pytorch_model.bin.index.json
Safe
24 kB
Trained with Unsloth
3 months ago
special_tokens_map.json
Safe
454 Bytes
Upload tokenizer
3 months ago
tokenizer.json
Safe
17.2 MB
LFS
Upload tokenizer
3 months ago
tokenizer_config.json
Safe
55.5 kB
Upload tokenizer
3 months ago
unsloth.Q8_0.gguf
3.52 GB
LFS
(Trained with Unsloth)
3 months ago