alpaca_farm

RL simulator

A framework for simulating and evaluating reinforcement learning from human feedback methods

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

GitHub

782 stars
9 watching
59 forks
Language: Python
last commit: 5 months ago
Linked from 1 awesome list

deep-learninginstruction-followinglarge-language-modelsnatural-language-processingreinforcement-learning-from-human-feedback

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
horizonrobotics/alf A reinforcement learning framework designed to implement complex algorithms with flexibility and ease of use 302
layssi/carla_ray_rlib An open-source reinforcement learning framework for autonomous driving tasks using the Carla-Simulator environment and Ray/Rllib libraries. 35
gokulnc/setting-up-carla-reinforcement-learning Provides a framework for using CARLA as a reinforcement learning environment 95
kunqian2025/reinforcement-learning A collection of implementations of reinforcement learning algorithms in MATLAB 60
carla-simulator/reinforcement-learning An implementation of an actor-critic reinforcement learning algorithm in Python. 243
enlite-ai/maze An RL framework for building and training reinforcement learning models in Python 265
xrsrke/instructgoose A framework for training language models using human feedback and reinforcement learning 169
luchris429/purejaxrl A high-performance implementation of reinforcement learning training pipelines using JAX and PyTorch-like functionality 722
sjtu-marl/malib A framework for parallel population-based reinforcement learning 497
kaixhin/rainbow A Python implementation of a deep reinforcement learning algorithm combining multiple techniques for improved performance in Atari games 1,585
eleurent/rl-agents A collection of implementations of Reinforcement Learning and planning algorithms in Python. 589
alex-petrenko/sample-factory A high-throughput reinforcement learning library with optimized synchronous and asynchronous implementations of policy gradients. 828
zhangfuyang/rl_carla A reinforcement learning project to train an autonomous driving agent in a simulated environment using a deep learning approach 228
coax-dev/coax A modular framework for building reinforcement learning agents in Python using Gymnasium and JAX. 167
nadavbh12/retro-learning-environment A framework for training AI algorithms using game consoles as input 185