alpaca_farm
RL simulator
A framework for simulating and evaluating reinforcement learning from human feedback methods
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
786 stars
9 watching
59 forks
Language: Python
last commit: 8 months ago
Linked from 1 awesome list
deep-learninginstruction-followinglarge-language-modelsnatural-language-processingreinforcement-learning-from-human-feedback
Related projects:
Repository | Description | Stars |
---|---|---|
| A framework for implementing complex reinforcement learning algorithms with flexibility and ease of implementation | 306 |
| An open-source reinforcement learning framework for autonomous driving tasks using the Carla-Simulator environment and Ray/Rllib libraries. | 35 |
| Provides a framework for using CARLA as a reinforcement learning environment | 95 |
| A collection of implementations of reinforcement learning algorithms in MATLAB | 61 |
| An implementation of an actor-critic reinforcement learning algorithm in Python. | 245 |
| An RL framework for building and training reinforcement learning models in Python | 266 |
| A framework for training language models using human feedback and reinforcement learning | 171 |
| A high-performance implementation of reinforcement learning training pipelines using JAX and PyTorch-like functionality | 755 |
| A framework for parallel population-based reinforcement learning | 507 |
| A Python implementation of a deep reinforcement learning algorithm combining multiple techniques for improved performance in Atari games | 1,591 |
| A collection of implementations of Reinforcement Learning and planning algorithms in Python. | 596 |
| A high-throughput reinforcement learning library with optimized synchronous and asynchronous implementations of policy gradients. | 839 |
| A reinforcement learning project to train an autonomous driving agent in a simulated environment using a deep learning approach | 230 |
| A modular framework for building reinforcement learning agents in Python using Gymnasium and JAX. | 168 |
| A framework for training AI algorithms using game consoles as input | 185 |