Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Edit Models filters

Inference Providers
Nebius AI Studio
SambaNova
fal
Novita
Cohere
Hyperbolic
Cerebras
Fireworks
Replicate
Together AI
Nscale
HF Inference API
Misc
rlvr
Inference Endpoints

Misc with no match

text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts

Models

6
Full-text search
Active filters: rlvr

SultanR/SmolTulu-1.7b-Reinforced-GGUF

Text Generation • Updated Dec 17, 2024 • 4 • 1

thuml/rt1-world-model-multi-step-rlvr

Updated 8 days ago • 8

thuml/rt1-world-model-single-step-rlvr

Updated 8 days ago • 6

thuml/webarena-world-model-rlvr

Updated 8 days ago • 5

thuml/bytesized32-world-model-rlvr-binary-reward

Updated 8 days ago • 4

thuml/bytesized32-world-model-rlvr-task-specific-reward

Updated 8 days ago • 3
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs