pytorch-trpo

Optimization algorithm

A PyTorch implementation of an optimization algorithm for continuous control and reinforcement learning tasks

PyTorch implementation of Trust Region Policy Optimization

GitHub

435 stars

12 watching

91 forks

Language: Python

last commit: almost 7 years ago

continuous-controldeep-learningdeep-reinforcement-learningmujocopytorchreinforcement-learningtrpotrust-region-policy-optimization

Related projects:

Repository	Description	Stars
ikostrikov/pytorch-meta-optimizer	A PyTorch implementation of meta-learning using gradient descent to adapt to new tasks.	312
ikostrikov/pytorch-ddpg-naf	An implementation of reinforcement learning algorithms for continuous control tasks using deep neural networks.	307
moskomule/eve.pytorch	An implementation of an optimization algorithm inspired by a 2016 research paper	33
alexis-jacq/pytorch-dppo	A PyTorch implementation of Distributed Proximal Policy Optimization algorithm	180
ikostrikov/pytorch-flows	PyTorch implementations of algorithms for density estimation using invertible transformations.	577
jiangoforit/yellowfin_pytorch	An optimizer that automatically tunes momentum and learning rate using local quadratic approximation.	287
dgriff777/rl_a3c_pytorch	An implementation of an A3C algorithm for reinforcement learning in Pytorch, with various optimizations and extensions to accelerate training.	562
pemami4911/neural-combinatorial-rl-pytorch	An implementation of Neural Combinatorial Optimization with Reinforcement Learning using PyTorch.	562
rentruewang/koila	A lightweight wrapper around PyTorch to prevent CUDA out-of-memory errors and optimize model execution	1,823
kaiyangzhou/dassl.pytorch	A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision.	1,236
locuslab/optnet	A PyTorch module that adds differentiable optimization as a layer to neural networks	517
tristandeleu/pytorch-maml-rl	Replication of Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks in PyTorch for reinforcement learning tasks	830
metaopt/torchopt	An efficient library for differentiable optimization built on top of PyTorch.	554
chingyaoc/pytorch-reinforce	A PyTorch implementation of the REINFORCE algorithm for reinforcement learning in continuous and discrete environments.	266
osqp/osqpth	A PyTorch module that wraps the OSQP solver for differentiable optimization problems	59