trl
Transformer model trainer
A library designed to train transformer language models with reinforcement learning using various optimization techniques and fine-tuning methods.
Train transformer language models with reinforcement learning.
10k stars
76 watching
1k forks
Language: Python
last commit: 6 days ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
carperai/trlx | A framework for distributed reinforcement learning of large language models with human feedback | 4,502 |
huggingface/transformers | A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. | 135,022 |
google-deepmind/trfl | Provides building blocks for Reinforcement Learning agents using TensorFlow | 3,134 |
huggingface/deep-rl-class | A course repository containing teaching materials and resources for learning deep reinforcement learning | 3,902 |
thu-ml/tianshou | A high-performance reinforcement learning library with modular interfaces and user-friendly APIs for building deep learning agents. | 7,968 |
tristandeleu/pytorch-maml-rl | Replication of Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks in PyTorch for reinforcement learning tasks | 827 |
huggingface/peft | An efficient method for fine-tuning large pre-trained models by adapting only a small fraction of their parameters | 16,437 |
huggingface/transformers.js | An API for using pre-trained machine learning models in web browsers without the need for a server | 12,085 |
tju-drl-lab/ai-optimizer | A next-generation deep reinforcement learning toolkit with libraries for multiagent, self-supervised, offline, and transfer/reinforcement learning | 4,755 |
google-research/vision_transformer | Provides pre-trained models and code for training vision transformers and mixers using JAX/Flax | 10,450 |
p-christ/deep-reinforcement-learning-algorithms-with-pytorch | PyTorch implementations of popular deep reinforcement learning algorithms and environments. | 5,640 |
google/trax | An end-to-end deep learning library with clear code and speed | 8,096 |
huggingface/lerobot | A platform providing pre-trained models, datasets, and tools for robotics with focus on imitation learning and reinforcement learning. | 7,518 |
huggingface/alignment-handbook | Provides training recipes and resources to align language models with human preferences | 4,677 |
kimiyoung/transformer-xl | Implementations of a neural network architecture for language modeling | 3,611 |