instructGOOSE
RLHF framework
A framework for training language models using human feedback and reinforcement learning
Implementation of Reinforcement Learning from Human Feedback (RLHF)
169 stars
4 watching
21 forks
Language: Jupyter Notebook
last commit: over 1 year ago chatgpthuman-feedbackinstructgptreinforcement-learningrlhf
Related projects:
Repository | Description | Stars |
---|---|---|
tatsu-lab/alpaca_farm | A framework for simulating and evaluating reinforcement learning from human feedback methods | 782 |
luchris429/purejaxrl | A high-performance implementation of reinforcement learning training pipelines using JAX and PyTorch-like functionality | 722 |
horizonrobotics/alf | A reinforcement learning framework designed to implement complex algorithms with flexibility and ease of use | 302 |
rlhf-v/rlhf-v | Aligns large language models' behavior through fine-grained correctional human feedback to improve trustworthiness and accuracy. | 233 |
layssi/carla_ray_rlib | An open-source reinforcement learning framework for autonomous driving tasks using the Carla-Simulator environment and Ray/Rllib libraries. | 35 |
ethanyanjiali/minchatgpt | This project demonstrates the effectiveness of reinforcement learning from human feedback (RLHF) in improving small language models like GPT-2. | 213 |
kunqian2025/reinforcement-learning | A collection of implementations of reinforcement learning algorithms in MATLAB | 60 |
sjtu-marl/malib | A framework for parallel population-based reinforcement learning | 497 |
volcengine/verl | A flexible and efficient reinforcement learning framework designed for large language models. | 315 |
flint-xf-fan/byzantine-federated-rl | Provides a framework and theoretical foundation for Federated Reinforcement Learning with Byzantine Resilience in distributed systems | 85 |
kaixhin/rainbow | A Python implementation of a deep reinforcement learning algorithm combining multiple techniques for improved performance in Atari games | 1,585 |
matthiasplappert/keras-rl | A Python library implementing state-of-the-art deep reinforcement learning algorithms for Keras and OpenAI Gym environments. | 7 |
gokulnc/setting-up-carla-reinforcement-learning | Provides a framework for using CARLA as a reinforcement learning environment | 95 |
enlite-ai/maze | An RL framework for building and training reinforcement learning models in Python | 265 |
rle-foundation/rlexplore | Provides a unified toolkit for constructing, computing, and optimizing intrinsic reward modules in reinforcement learning | 366 |