PaLM-rlhf-pytorch

RLHF framework

An implementation of RLHF on top of the PaLM architecture to enable human feedback in reinforcement learning for large language models.

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

GitHub

8k stars
143 watching
671 forks
Language: Python
last commit: 11 months ago
artificial-intelligenceattention-mechanismsdeep-learninghuman-feedbackreinforcement-learningtransformers

Related projects:

Repository Description Stars
carperai/trlx A framework for distributed reinforcement learning of large language models with human feedback 4,537
p-christ/deep-reinforcement-learning-algorithms-with-pytorch PyTorch implementations of popular deep reinforcement learning algorithms and environments. 5,669
lucidrains/imagen-pytorch Implements Google's Text-to-Image Neural Network in PyTorch using a cascading DDPM architecture with dynamic clipping and noise level conditioning. 8,127
lucidrains/musiclm-pytorch Implementation of Google's MusicLM model for music generation using attention networks and text-conditioning. 3,189
iffix/machin An open-source reinforcement learning library for PyTorch, providing a simple and clear implementation of various algorithms. 402
ethanyanjiali/minchatgpt This project demonstrates the effectiveness of reinforcement learning from human feedback (RLHF) in improving small language models like GPT-2. 214
luchris429/purejaxrl A high-performance implementation of reinforcement learning training pipelines using JAX and PyTorch-like functionality 755
tristandeleu/pytorch-maml-rl Replication of Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks in PyTorch for reinforcement learning tasks 830
thu-ml/tianshou A high-performance reinforcement learning library with modular interfaces and user-friendly APIs for building deep learning agents. 8,069
lucidrains/dalle2-pytorch An implementation of DALL-E 2's text-to-image synthesis neural network in PyTorch 11,184
freedomintelligence/llmzoo A platform providing data, models, and evaluation benchmarks for large language models to promote accessibility and democratization of AI technology 2,938
huggingface/alignment-handbook Provides recipes and guidelines for training language models to align with human preferences and AI goals 4,800
huggingface/trl A library designed to train transformer language models with reinforcement learning using various optimization techniques and fine-tuning methods. 10,308
tju-drl-lab/ai-optimizer A next-generation deep reinforcement learning toolkit with libraries for multiagent, self-supervised, offline, and transfer/reinforcement learning 4,848
xrsrke/instructgoose A framework for training language models using human feedback and reinforcement learning 171