alpaca_farm
RL simulator
A framework for simulating and evaluating reinforcement learning from human feedback methods
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
782 stars
9 watching
59 forks
Language: Python
last commit: 5 months ago
Linked from 1 awesome list
deep-learninginstruction-followinglarge-language-modelsnatural-language-processingreinforcement-learning-from-human-feedback
Related projects:
Repository | Description | Stars |
---|---|---|
horizonrobotics/alf | A reinforcement learning framework designed to implement complex algorithms with flexibility and ease of use | 302 |
layssi/carla_ray_rlib | An open-source reinforcement learning framework for autonomous driving tasks using the Carla-Simulator environment and Ray/Rllib libraries. | 35 |
gokulnc/setting-up-carla-reinforcement-learning | Provides a framework for using CARLA as a reinforcement learning environment | 95 |
kunqian2025/reinforcement-learning | A collection of implementations of reinforcement learning algorithms in MATLAB | 60 |
carla-simulator/reinforcement-learning | An implementation of an actor-critic reinforcement learning algorithm in Python. | 243 |
enlite-ai/maze | An RL framework for building and training reinforcement learning models in Python | 265 |
xrsrke/instructgoose | A framework for training language models using human feedback and reinforcement learning | 169 |
luchris429/purejaxrl | A high-performance implementation of reinforcement learning training pipelines using JAX and PyTorch-like functionality | 722 |
sjtu-marl/malib | A framework for parallel population-based reinforcement learning | 497 |
kaixhin/rainbow | A Python implementation of a deep reinforcement learning algorithm combining multiple techniques for improved performance in Atari games | 1,585 |
eleurent/rl-agents | A collection of implementations of Reinforcement Learning and planning algorithms in Python. | 589 |
alex-petrenko/sample-factory | A high-throughput reinforcement learning library with optimized synchronous and asynchronous implementations of policy gradients. | 828 |
zhangfuyang/rl_carla | A reinforcement learning project to train an autonomous driving agent in a simulated environment using a deep learning approach | 228 |
coax-dev/coax | A modular framework for building reinforcement learning agents in Python using Gymnasium and JAX. | 167 |
nadavbh12/retro-learning-environment | A framework for training AI algorithms using game consoles as input | 185 |