verl

LLM trainer

A flexible RL training framework designed for large language models

veRL: Volcano Engine Reinforcement Learning for LLM

427 stars

8 watching

29 forks

Language: Python

last commit: 7 months ago

Screenshot of volcengine/verl website

verl.readthedocs.io/en/latest/index.html

Related projects:

Repository	Description	Stars
volcengine/vescale	A PyTorch-based framework for training large language models in parallel on multiple devices	679
luchris429/purejaxrl	A high-performance implementation of reinforcement learning training pipelines using JAX and PyTorch-like functionality	755
toni-sm/skrl	A modular reinforcement learning library with support for various environments and frameworks	588
matthiasplappert/keras-rl	A Python library implementing state-of-the-art deep reinforcement learning algorithms for Keras and OpenAI Gym environments.	8
tristandeleu/pytorch-maml-rl	Replication of Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks in PyTorch for reinforcement learning tasks	830
vpgtrans/vpgtrans	Transfers visual prompt generators across large language models to reduce training costs and enable customization of multimodal LLMs	270
rle-foundation/rlexplore	Provides a unified toolkit for constructing, computing, and optimizing intrinsic reward modules in reinforcement learning	373
astooke/rlpyt	A modular and unified framework for implementing common deep reinforcement learning algorithms in PyTorch	2,236
kaixhin/rainbow	A Python implementation of a deep reinforcement learning algorithm combining multiple techniques for improved performance in Atari games	1,591
enlite-ai/maze	An RL framework for building and training reinforcement learning models in Python	266
millionintegrals/vel	A collection of modular deep learning components that can be easily configured and reused in various applications.	276
kunqian2025/reinforcement-learning	A collection of implementations of reinforcement learning algorithms in MATLAB	61
zuoxingdong/lagom	A modular toolkit for rapid prototyping of reinforcement learning algorithms	373
mushroomrl/mushroom-rl	A Python library for reinforcement learning algorithms and environments.	824
iffix/machin	An open-source reinforcement learning library for PyTorch, providing a simple and clear implementation of various algorithms.	402