pytorch-a3c-mujoco
Actor-Critic algorithm
An implementation of the Actor-Critic algorithm for continuous control tasks in MuJoCo environments using PyTorch.
Implement A3C for Mujoco gym envs
73 stars
6 watching
19 forks
Language: Python
last commit: about 7 years ago a3cactor-criticcontinuous-controlmujocopytorchreinforcement-learning
Related projects:
Repository | Description | Stars |
---|---|---|
rarilurelo/pytorch_a3c | An implementation of Asynchronous Advantage Actor-Critic in PyTorch for training AI models on reinforcement learning tasks | 38 |
onlytailei/a3c-pytorch | An implementation of Advantage async Actor-Critic Algorithms in PyTorch for Deep Reinforcement Learning | 114 |
dgriff777/rl_a3c_pytorch | An implementation of an A3C algorithm for reinforcement learning in Pytorch, with various optimizations and extensions to accelerate training. | 562 |
chingyaoc/pytorch-reinforce | A PyTorch implementation of the REINFORCE algorithm for reinforcement learning in continuous and discrete environments. | 264 |
ruotianluo/self-critical.pytorch | An implementation of Self-critical Sequence Training for Image Captioning and related techniques. | 997 |
ikostrikov/pytorch-trpo | A PyTorch implementation of an optimization algorithm for continuous control and reinforcement learning tasks | 433 |
ikostrikov/pytorch-a2c-ppo-acktr-gail | An open-source implementation of several reinforcement learning algorithms in PyTorch | 3,605 |
atgambardella/pytorch-es | An implementation of an optimization algorithm for training neural networks in machine learning environments. | 351 |
yukkyo/pytorch-filterresponsenormalizationlayer | An implementation of Filter Response Normalization Layer in PyTorch to improve the training of deep neural networks by eliminating batch dependence. | 85 |
alexis-jacq/pytorch-dppo | A PyTorch implementation of Distributed Proximal Policy Optimization algorithm | 180 |
tianhongdai/integrated-gradient-pytorch | A PyTorch implementation of attributing the impact of inputs on deep neural network outputs | 181 |
fducau/aae_pytorch | An implementation of Adversarial Autoencoders using PyTorch for training neural networks on structured data. | 198 |
packtpublishing/hands-on-intelligent-agents-with-openai-gym | Teaching software developers to build intelligent agents using deep reinforcement learning and OpenAI Gym | 373 |
moskomule/eve.pytorch | An implementation of an optimization algorithm inspired by a 2016 research paper | 33 |
ethanluoyc/e2c-pytorch | An implementation of the E2C control policy in PyTorch, allowing customization and comparison with different neural network architectures. | 43 |