verl
LLM trainer
A flexible RL training framework designed for large language models
veRL: Volcano Engine Reinforcement Learning for LLM
427 stars
8 watching
29 forks
Language: Python
last commit: about 1 month ago Related projects:
Repository | Description | Stars |
---|---|---|
volcengine/vescale | A PyTorch-based framework for training large language models in parallel on multiple devices | 679 |
luchris429/purejaxrl | A high-performance implementation of reinforcement learning training pipelines using JAX and PyTorch-like functionality | 755 |
toni-sm/skrl | A modular reinforcement learning library with support for various environments and frameworks | 588 |
matthiasplappert/keras-rl | A Python library implementing state-of-the-art deep reinforcement learning algorithms for Keras and OpenAI Gym environments. | 8 |
tristandeleu/pytorch-maml-rl | Replication of Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks in PyTorch for reinforcement learning tasks | 830 |
vpgtrans/vpgtrans | Transfers visual prompt generators across large language models to reduce training costs and enable customization of multimodal LLMs | 270 |
rle-foundation/rlexplore | Provides a unified toolkit for constructing, computing, and optimizing intrinsic reward modules in reinforcement learning | 373 |
astooke/rlpyt | A modular and unified framework for implementing common deep reinforcement learning algorithms in PyTorch | 2,236 |
kaixhin/rainbow | A Python implementation of a deep reinforcement learning algorithm combining multiple techniques for improved performance in Atari games | 1,591 |
enlite-ai/maze | An RL framework for building and training reinforcement learning models in Python | 266 |
millionintegrals/vel | A collection of modular deep learning components that can be easily configured and reused in various applications. | 276 |
kunqian2025/reinforcement-learning | A collection of implementations of reinforcement learning algorithms in MATLAB | 61 |
zuoxingdong/lagom | A modular toolkit for rapid prototyping of reinforcement learning algorithms | 373 |
mushroomrl/mushroom-rl | A Python library for reinforcement learning algorithms and environments. | 824 |
iffix/machin | An open-source reinforcement learning library for PyTorch, providing a simple and clear implementation of various algorithms. | 402 |