alpaca_farm

RL simulator

A framework for simulating and evaluating reinforcement learning from human feedback methods

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

GitHub

786 stars
9 watching
59 forks
Language: Python
last commit: 7 months ago
Linked from 1 awesome list

deep-learninginstruction-followinglarge-language-modelsnatural-language-processingreinforcement-learning-from-human-feedback

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
horizonrobotics/alf A framework for implementing complex reinforcement learning algorithms with flexibility and ease of implementation 306
layssi/carla_ray_rlib An open-source reinforcement learning framework for autonomous driving tasks using the Carla-Simulator environment and Ray/Rllib libraries. 35
gokulnc/setting-up-carla-reinforcement-learning Provides a framework for using CARLA as a reinforcement learning environment 95
kunqian2025/reinforcement-learning A collection of implementations of reinforcement learning algorithms in MATLAB 61
carla-simulator/reinforcement-learning An implementation of an actor-critic reinforcement learning algorithm in Python. 245
enlite-ai/maze An RL framework for building and training reinforcement learning models in Python 266
xrsrke/instructgoose A framework for training language models using human feedback and reinforcement learning 171
luchris429/purejaxrl A high-performance implementation of reinforcement learning training pipelines using JAX and PyTorch-like functionality 755
sjtu-marl/malib A framework for parallel population-based reinforcement learning 507
kaixhin/rainbow A Python implementation of a deep reinforcement learning algorithm combining multiple techniques for improved performance in Atari games 1,591
eleurent/rl-agents A collection of implementations of Reinforcement Learning and planning algorithms in Python. 596
alex-petrenko/sample-factory A high-throughput reinforcement learning library with optimized synchronous and asynchronous implementations of policy gradients. 839
zhangfuyang/rl_carla A reinforcement learning project to train an autonomous driving agent in a simulated environment using a deep learning approach 230
coax-dev/coax A modular framework for building reinforcement learning agents in Python using Gymnasium and JAX. 168
nadavbh12/retro-learning-environment A framework for training AI algorithms using game consoles as input 185