pytorch-trpo

Optimization algorithm

A PyTorch implementation of an optimization algorithm for continuous control and reinforcement learning tasks

PyTorch implementation of Trust Region Policy Optimization

GitHub

433 stars
12 watching
91 forks
Language: Python
last commit: about 6 years ago
continuous-controldeep-learningdeep-reinforcement-learningmujocopytorchreinforcement-learningtrpotrust-region-policy-optimization

Related projects:

Repository Description Stars
ikostrikov/pytorch-meta-optimizer A PyTorch implementation of meta-learning using gradient descent to adapt to new tasks. 312
ikostrikov/pytorch-ddpg-naf An implementation of reinforcement learning algorithms for continuous control tasks using deep neural networks. 307
moskomule/eve.pytorch An implementation of an optimization algorithm inspired by a 2016 research paper 33
alexis-jacq/pytorch-dppo A PyTorch implementation of Distributed Proximal Policy Optimization algorithm 180
ikostrikov/pytorch-flows PyTorch implementations of algorithms for density estimation using invertible transformations. 576
jiangoforit/yellowfin_pytorch An optimizer that automatically tunes momentum and learning rate using local quadratic approximation. 287
dgriff777/rl_a3c_pytorch An implementation of an A3C algorithm for reinforcement learning in Pytorch, with various optimizations and extensions to accelerate training. 562
pemami4911/neural-combinatorial-rl-pytorch An implementation of Neural Combinatorial Optimization with Reinforcement Learning using PyTorch. 558
rentruewang/koila A lightweight wrapper around PyTorch to prevent CUDA out-of-memory errors and optimize model execution 1,821
kaiyangzhou/dassl.pytorch A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision. 1,217
locuslab/optnet A PyTorch module that adds differentiable optimization as a layer to neural networks 513
tristandeleu/pytorch-maml-rl Replication of Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks in PyTorch for reinforcement learning tasks 827
metaopt/torchopt An efficient library for differentiable optimization built on top of PyTorch. 544
chingyaoc/pytorch-reinforce A PyTorch implementation of the REINFORCE algorithm for reinforcement learning in continuous and discrete environments. 264
osqp/osqpth A PyTorch module that wraps the OSQP solver for differentiable optimization problems 59