Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Ryukijano 's Collections
Vision_transformer_robotics
VILA
Midi-composer
Diffusion models
Neural Rendering
Deep Reinforcement Learning
Own Work
Deep learning
LLMs
Computer vision
Audio
Multi modal foundational models
Text_to_video diffusion
Vision_language_models
Text-3D
2D->3D
Audio->3D
Segmentation

Deep Reinforcement Learning

updated Mar 19, 2024

Features implementations and paces of popular RL algorithms and new paradigms on a variety of environments.

Upvote
-

  • Ryukijano/rl_course_vizdoom_health_gathering_supreme

    Reinforcement Learning • Updated Mar 21, 2023

  • Ryukijano/Mujoco_rl_halfcheetah_Decision_Trasformer

    Reinforcement Learning • Updated Jul 22, 2023 • 5

  • Ryukijano/poca-SoccerTwos

    Reinforcement Learning • Updated Jul 18, 2023 • 32

  • AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning

    Paper • 2308.03526 • Published Aug 7, 2023 • 26

  • RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

    Paper • 2309.00267 • Published Sep 1, 2023 • 50

  • PERL: Parameter Efficient Reinforcement Learning from Human Feedback

    Paper • 2403.10704 • Published Mar 15, 2024 • 60
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs