verl

LLM trainer

A flexible RL training framework designed for large language models

veRL: Volcano Engine Reinforcement Learning for LLM

GitHub

427 stars
8 watching
29 forks
Language: Python
last commit: about 1 month ago

Related projects:

Repository Description Stars
volcengine/vescale A PyTorch-based framework for training large language models in parallel on multiple devices 679
luchris429/purejaxrl A high-performance implementation of reinforcement learning training pipelines using JAX and PyTorch-like functionality 755
toni-sm/skrl A modular reinforcement learning library with support for various environments and frameworks 588
matthiasplappert/keras-rl A Python library implementing state-of-the-art deep reinforcement learning algorithms for Keras and OpenAI Gym environments. 8
tristandeleu/pytorch-maml-rl Replication of Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks in PyTorch for reinforcement learning tasks 830
vpgtrans/vpgtrans Transfers visual prompt generators across large language models to reduce training costs and enable customization of multimodal LLMs 270
rle-foundation/rlexplore Provides a unified toolkit for constructing, computing, and optimizing intrinsic reward modules in reinforcement learning 373
astooke/rlpyt A modular and unified framework for implementing common deep reinforcement learning algorithms in PyTorch 2,236
kaixhin/rainbow A Python implementation of a deep reinforcement learning algorithm combining multiple techniques for improved performance in Atari games 1,591
enlite-ai/maze An RL framework for building and training reinforcement learning models in Python 266
millionintegrals/vel A collection of modular deep learning components that can be easily configured and reused in various applications. 276
kunqian2025/reinforcement-learning A collection of implementations of reinforcement learning algorithms in MATLAB 61
zuoxingdong/lagom A modular toolkit for rapid prototyping of reinforcement learning algorithms 373
mushroomrl/mushroom-rl A Python library for reinforcement learning algorithms and environments. 824
iffix/machin An open-source reinforcement learning library for PyTorch, providing a simple and clear implementation of various algorithms. 402