verl
LLM RL framework
A flexible and efficient reinforcement learning framework designed for large language models.
veRL: Volcano Engine Reinforcement Learning for LLM
315 stars
7 watching
15 forks
Language: Python
last commit: 4 days ago Related projects:
Repository | Description | Stars |
---|---|---|
volcengine/vescale | A PyTorch-based framework for training large language models in parallel on multiple devices | 663 |
luchris429/purejaxrl | A high-performance implementation of reinforcement learning training pipelines using JAX and PyTorch-like functionality | 722 |
toni-sm/skrl | A modular reinforcement learning library with support for various environments and frameworks | 560 |
matthiasplappert/keras-rl | A Python library implementing state-of-the-art deep reinforcement learning algorithms for Keras and OpenAI Gym environments. | 7 |
tristandeleu/pytorch-maml-rl | Replication of Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks in PyTorch for reinforcement learning tasks | 827 |
vpgtrans/vpgtrans | Transfers visual prompt generators across large language models to reduce training costs and enable customization of multimodal LLMs | 269 |
rle-foundation/rlexplore | Provides a unified toolkit for constructing, computing, and optimizing intrinsic reward modules in reinforcement learning | 366 |
astooke/rlpyt | A modular and unified framework for implementing common deep reinforcement learning algorithms in PyTorch | 2,232 |
kaixhin/rainbow | A Python implementation of a deep reinforcement learning algorithm combining multiple techniques for improved performance in Atari games | 1,585 |
enlite-ai/maze | An RL framework for building and training reinforcement learning models in Python | 265 |
millionintegrals/vel | A collection of modular deep learning components that can be easily configured and reused in various applications. | 276 |
kunqian2025/reinforcement-learning | A collection of implementations of reinforcement learning algorithms in MATLAB | 60 |
zuoxingdong/lagom | A modular toolkit for rapid prototyping of reinforcement learning algorithms | 373 |
mushroomrl/mushroom-rl | A Python library for reinforcement learning algorithms and environments. | 822 |
iffix/machin | An open-source reinforcement learning library for PyTorch, providing a simple and clear implementation of various algorithms. | 401 |