trl

Transformer model trainer

A library designed to train transformer language models with reinforcement learning using various optimization techniques and fine-tuning methods.

Train transformer language models with reinforcement learning.

GitHub

10k stars
76 watching
1k forks
Language: Python
last commit: 6 days ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
carperai/trlx A framework for distributed reinforcement learning of large language models with human feedback 4,502
huggingface/transformers A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. 135,022
google-deepmind/trfl Provides building blocks for Reinforcement Learning agents using TensorFlow 3,134
huggingface/deep-rl-class A course repository containing teaching materials and resources for learning deep reinforcement learning 3,902
thu-ml/tianshou A high-performance reinforcement learning library with modular interfaces and user-friendly APIs for building deep learning agents. 7,968
tristandeleu/pytorch-maml-rl Replication of Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks in PyTorch for reinforcement learning tasks 827
huggingface/peft An efficient method for fine-tuning large pre-trained models by adapting only a small fraction of their parameters 16,437
huggingface/transformers.js An API for using pre-trained machine learning models in web browsers without the need for a server 12,085
tju-drl-lab/ai-optimizer A next-generation deep reinforcement learning toolkit with libraries for multiagent, self-supervised, offline, and transfer/reinforcement learning 4,755
google-research/vision_transformer Provides pre-trained models and code for training vision transformers and mixers using JAX/Flax 10,450
p-christ/deep-reinforcement-learning-algorithms-with-pytorch PyTorch implementations of popular deep reinforcement learning algorithms and environments. 5,640
google/trax An end-to-end deep learning library with clear code and speed 8,096
huggingface/lerobot A platform providing pre-trained models, datasets, and tools for robotics with focus on imitation learning and reinforcement learning. 7,518
huggingface/alignment-handbook Provides training recipes and resources to align language models with human preferences 4,677
kimiyoung/transformer-xl Implementations of a neural network architecture for language modeling 3,611