pytorch-trpo
Optimization algorithm
A PyTorch implementation of an optimization algorithm for continuous control and reinforcement learning tasks
PyTorch implementation of Trust Region Policy Optimization
433 stars
12 watching
91 forks
Language: Python
last commit: about 6 years ago continuous-controldeep-learningdeep-reinforcement-learningmujocopytorchreinforcement-learningtrpotrust-region-policy-optimization
Related projects:
Repository | Description | Stars |
---|---|---|
ikostrikov/pytorch-meta-optimizer | A PyTorch implementation of meta-learning using gradient descent to adapt to new tasks. | 312 |
ikostrikov/pytorch-ddpg-naf | An implementation of reinforcement learning algorithms for continuous control tasks using deep neural networks. | 307 |
moskomule/eve.pytorch | An implementation of an optimization algorithm inspired by a 2016 research paper | 33 |
alexis-jacq/pytorch-dppo | A PyTorch implementation of Distributed Proximal Policy Optimization algorithm | 180 |
ikostrikov/pytorch-flows | PyTorch implementations of algorithms for density estimation using invertible transformations. | 576 |
jiangoforit/yellowfin_pytorch | An optimizer that automatically tunes momentum and learning rate using local quadratic approximation. | 287 |
dgriff777/rl_a3c_pytorch | An implementation of an A3C algorithm for reinforcement learning in Pytorch, with various optimizations and extensions to accelerate training. | 562 |
pemami4911/neural-combinatorial-rl-pytorch | An implementation of Neural Combinatorial Optimization with Reinforcement Learning using PyTorch. | 558 |
rentruewang/koila | A lightweight wrapper around PyTorch to prevent CUDA out-of-memory errors and optimize model execution | 1,821 |
kaiyangzhou/dassl.pytorch | A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision. | 1,217 |
locuslab/optnet | A PyTorch module that adds differentiable optimization as a layer to neural networks | 513 |
tristandeleu/pytorch-maml-rl | Replication of Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks in PyTorch for reinforcement learning tasks | 827 |
metaopt/torchopt | An efficient library for differentiable optimization built on top of PyTorch. | 544 |
chingyaoc/pytorch-reinforce | A PyTorch implementation of the REINFORCE algorithm for reinforcement learning in continuous and discrete environments. | 264 |
osqp/osqpth | A PyTorch module that wraps the OSQP solver for differentiable optimization problems | 59 |