alpaca_farm

RL simulator

A framework for simulating and evaluating reinforcement learning from human feedback methods

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

GitHub

786 stars

9 watching

59 forks

Language: Python

last commit: about 1 year ago

Linked from 1 awesome list

deep-learninginstruction-followinglarge-language-modelsnatural-language-processingreinforcement-learning-from-human-feedback

Screenshot of tatsu-lab/alpaca_farm website

arxiv.org/abs/2305.14387

Backlinks from these awesome lists:

ethicalml/awesome-production-machine-learning

Related projects:

Repository	Description	Stars
horizonrobotics/alf	A framework for implementing complex reinforcement learning algorithms with flexibility and ease of implementation	306
layssi/carla_ray_rlib	An open-source reinforcement learning framework for autonomous driving tasks using the Carla-Simulator environment and Ray/Rllib libraries.	35
gokulnc/setting-up-carla-reinforcement-learning	Provides a framework for using CARLA as a reinforcement learning environment	95
kunqian2025/reinforcement-learning	A collection of implementations of reinforcement learning algorithms in MATLAB	61
carla-simulator/reinforcement-learning	An implementation of an actor-critic reinforcement learning algorithm in Python.	245
enlite-ai/maze	An RL framework for building and training reinforcement learning models in Python	266
xrsrke/instructgoose	A framework for training language models using human feedback and reinforcement learning	171
luchris429/purejaxrl	A high-performance implementation of reinforcement learning training pipelines using JAX and PyTorch-like functionality	755
sjtu-marl/malib	A framework for parallel population-based reinforcement learning	507
kaixhin/rainbow	A Python implementation of a deep reinforcement learning algorithm combining multiple techniques for improved performance in Atari games	1,591
eleurent/rl-agents	A collection of implementations of Reinforcement Learning and planning algorithms in Python.	596
alex-petrenko/sample-factory	A high-throughput reinforcement learning library with optimized synchronous and asynchronous implementations of policy gradients.	839
zhangfuyang/rl_carla	A reinforcement learning project to train an autonomous driving agent in a simulated environment using a deep learning approach	230
coax-dev/coax	A modular framework for building reinforcement learning agents in Python using Gymnasium and JAX.	168
nadavbh12/retro-learning-environment	A framework for training AI algorithms using game consoles as input	185