trl
Transformer model trainer
A library designed to train transformer language models with reinforcement learning using various optimization techniques and fine-tuning methods.
Train transformer language models with reinforcement learning.
10k stars
77 watching
1k forks
Language: Python
last commit: 2 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| A framework for distributed reinforcement learning of large language models with human feedback | 4,537 |
| A collection of pre-trained machine learning models for various natural language and computer vision tasks, enabling developers to fine-tune and deploy these models on their own projects. | 136,357 |
| Provides building blocks for Reinforcement Learning agents using TensorFlow | 3,136 |
| A course repository containing teaching materials and resources for learning deep reinforcement learning | 3,931 |
| A high-performance reinforcement learning library with modular interfaces and user-friendly APIs for building deep learning agents. | 8,069 |
| Replication of Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks in PyTorch for reinforcement learning tasks | 830 |
| An efficient method for fine-tuning large pre-trained models by adapting only a small fraction of their parameters | 16,699 |
| An open-source JavaScript library for running machine learning models in the browser without a server. | 12,363 |
| A next-generation deep reinforcement learning toolkit with libraries for multiagent, self-supervised, offline, and transfer/reinforcement learning | 4,848 |
| Provides pre-trained models and code for training vision transformers and mixers using JAX/Flax | 10,620 |
| PyTorch implementations of popular deep reinforcement learning algorithms and environments. | 5,669 |
| An end-to-end deep learning library with clear code and speed | 8,114 |
| A platform providing pre-trained models, datasets, and tools for robotics with focus on imitation learning and reinforcement learning. | 7,874 |
| Provides recipes and guidelines for training language models to align with human preferences and AI goals | 4,800 |
| Implementations of a neural network architecture for language modeling | 3,619 |