Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Nebius AI Studio
SambaNova
fal
Novita
Cohere
Hyperbolic
Cerebras
Fireworks
Replicate
Together AI
Nscale
HF Inference API
Misc
Reset Misc
rlvr
Inference Endpoints
Misc with no match
text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
6
Full-text search
Edit filters
Sort: Trending
Active filters:
rlvr
Clear all
SultanR/SmolTulu-1.7b-Reinforced-GGUF
Text Generation
•
Updated
Dec 17, 2024
•
4
•
1
thuml/rt1-world-model-multi-step-rlvr
Updated
8 days ago
•
8
thuml/rt1-world-model-single-step-rlvr
Updated
8 days ago
•
6
thuml/webarena-world-model-rlvr
Updated
8 days ago
•
5
thuml/bytesized32-world-model-rlvr-binary-reward
Updated
8 days ago
•
4
thuml/bytesized32-world-model-rlvr-task-specific-reward
Updated
8 days ago
•
3