verl
LLM trainer
A flexible RL training framework designed for large language models
veRL: Volcano Engine Reinforcement Learning for LLM
427 stars
8 watching
29 forks
Language: Python
last commit: 4 months ago Related projects:
Repository | Description | Stars |
---|---|---|
| A PyTorch-based framework for training large language models in parallel on multiple devices | 679 |
| A high-performance implementation of reinforcement learning training pipelines using JAX and PyTorch-like functionality | 755 |
| A modular reinforcement learning library with support for various environments and frameworks | 588 |
| A Python library implementing state-of-the-art deep reinforcement learning algorithms for Keras and OpenAI Gym environments. | 8 |
| Replication of Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks in PyTorch for reinforcement learning tasks | 830 |
| Transfers visual prompt generators across large language models to reduce training costs and enable customization of multimodal LLMs | 270 |
| Provides a unified toolkit for constructing, computing, and optimizing intrinsic reward modules in reinforcement learning | 373 |
| A modular and unified framework for implementing common deep reinforcement learning algorithms in PyTorch | 2,236 |
| A Python implementation of a deep reinforcement learning algorithm combining multiple techniques for improved performance in Atari games | 1,591 |
| An RL framework for building and training reinforcement learning models in Python | 266 |
| A collection of modular deep learning components that can be easily configured and reused in various applications. | 276 |
| A collection of implementations of reinforcement learning algorithms in MATLAB | 61 |
| A modular toolkit for rapid prototyping of reinforcement learning algorithms | 373 |
| A Python library for reinforcement learning algorithms and environments. | 824 |
| An open-source reinforcement learning library for PyTorch, providing a simple and clear implementation of various algorithms. | 402 |