trl

Transformer model trainer

A library designed to train transformer language models with reinforcement learning using various optimization techniques and fine-tuning methods.

Train transformer language models with reinforcement learning.

GitHub

10k stars

77 watching

1k forks

Language: Python

last commit: 8 months ago

Linked from 1 awesome list

hf.co/docs/trl

Backlinks from these awesome lists:

ethicalml/awesome-production-machine-learning

Related projects:

Repository	Description	Stars
carperai/trlx	A framework for distributed reinforcement learning of large language models with human feedback	4,537
huggingface/transformers	A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects.	136,357
google-deepmind/trfl	Provides building blocks for Reinforcement Learning agents using TensorFlow	3,136
huggingface/deep-rl-class	A course repository containing teaching materials and resources for learning deep reinforcement learning	3,931
thu-ml/tianshou	A high-performance reinforcement learning library with modular interfaces and user-friendly APIs for building deep learning agents.	8,069
tristandeleu/pytorch-maml-rl	Replication of Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks in PyTorch for reinforcement learning tasks	830
huggingface/peft	An efficient method for fine-tuning large pre-trained models by adapting only a small fraction of their parameters	16,699
huggingface/transformers.js	An open-source JavaScript library for running machine learning models in the browser without a server.	12,363
tju-drl-lab/ai-optimizer	A next-generation deep reinforcement learning toolkit with libraries for multiagent, self-supervised, offline, and transfer/reinforcement learning	4,848
google-research/vision_transformer	Provides pre-trained models and code for training vision transformers and mixers using JAX/Flax	10,620
p-christ/deep-reinforcement-learning-algorithms-with-pytorch	PyTorch implementations of popular deep reinforcement learning algorithms and environments.	5,669
google/trax	An end-to-end deep learning library with clear code and speed	8,114
huggingface/lerobot	A platform providing pre-trained models, datasets, and tools for robotics with focus on imitation learning and reinforcement learning.	7,874
huggingface/alignment-handbook	Provides recipes and guidelines for training language models to align with human preferences and AI goals	4,800
kimiyoung/transformer-xl	Implementations of a neural network architecture for language modeling	3,619