Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Edit Models filters

Inference Providers
Fireworks
Together AI
Novita
Nebius AI Studio
Hyperbolic
SambaNova
fal
Cohere
Replicate
Cerebras
HF Inference API
Misc
Proximal Policy Optimization
Inference Endpoints
text-generation-inference

Misc with no match

Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts

Models

2
Full-text search
Active filters: Proximal Policy Optimization

LilHairdy/cleanrl_memory_gym

Reinforcement Learning • Updated Sep 17, 2024

estnafinema0/smolLM-variation-ppo

Text Generation • Updated Mar 30 • 3
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs