trlx

RL Framework

A framework for distributed reinforcement learning of large language models with human feedback

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

GitHub

5k stars
50 watching
472 forks
Language: Python
last commit: 11 months ago
Linked from 2 awesome lists

machine-learningpytorchreinforcement-learning

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
huggingface/trl A library designed to train transformer language models with reinforcement learning using various optimization techniques and fine-tuning methods. 10,053
lucidrains/palm-rlhf-pytorch An implementation of RLHF on top of the PaLM architecture to enable human feedback in reinforcement learning for large language models. 7,705
tju-drl-lab/ai-optimizer A next-generation deep reinforcement learning toolkit with libraries for multiagent, self-supervised, offline, and transfer/reinforcement learning 4,755
google-deepmind/trfl Provides building blocks for Reinforcement Learning agents using TensorFlow 3,134
paddlepaddle/parl A high-performance distributed training framework for Reinforcement Learning 3,273
thu-ml/tianshou A high-performance reinforcement learning library with modular interfaces and user-friendly APIs for building deep learning agents. 7,968
iffix/machin An open-source reinforcement learning library for PyTorch, providing a simple and clear implementation of various algorithms. 401
p-christ/deep-reinforcement-learning-algorithms-with-pytorch PyTorch implementations of popular deep reinforcement learning algorithms and environments. 5,640
rle-foundation/rlexplore Provides a unified toolkit for constructing, computing, and optimizing intrinsic reward modules in reinforcement learning 366
tristandeleu/pytorch-maml-rl Replication of Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks in PyTorch for reinforcement learning tasks 827
luchris429/purejaxrl A high-performance implementation of reinforcement learning training pipelines using JAX and PyTorch-like functionality 722
eleutherai/gpt-neox Provides a framework for training large-scale language models on GPUs with advanced features and optimizations. 6,941
tatsu-lab/alpaca_farm A framework for simulating and evaluating reinforcement learning from human feedback methods 782
rlcode/reinforcement-learning A collection of clean and minimal examples for various reinforcement learning algorithms 3,373
yandexdataschool/practical_rl An educational resource teaching practical reinforcement learning skills in Python using popular deep learning frameworks. 5,918