pytorch-a3c-mujoco
Actor-Critic algorithm
An implementation of the Actor-Critic algorithm for continuous control tasks in MuJoCo environments using PyTorch.
Implement A3C for Mujoco gym envs
73 stars
6 watching
19 forks
Language: Python
last commit: over 7 years ago a3cactor-criticcontinuous-controlmujocopytorchreinforcement-learning
Related projects:
Repository | Description | Stars |
---|---|---|
| An implementation of Asynchronous Advantage Actor-Critic in PyTorch for training AI models on reinforcement learning tasks | 38 |
| An implementation of Advantage async Actor-Critic Algorithms in PyTorch for Deep Reinforcement Learning | 114 |
| An implementation of an A3C algorithm for reinforcement learning in Pytorch, with various optimizations and extensions to accelerate training. | 562 |
| A PyTorch implementation of the REINFORCE algorithm for reinforcement learning in continuous and discrete environments. | 266 |
| An implementation of Self-critical Sequence Training for Image Captioning and related techniques. | 998 |
| A PyTorch implementation of an optimization algorithm for continuous control and reinforcement learning tasks | 435 |
| An open-source implementation of several reinforcement learning algorithms in PyTorch | 3,644 |
| An implementation of an optimization algorithm for training neural networks in machine learning environments. | 351 |
| An implementation of Filter Response Normalization Layer in PyTorch to improve the training of deep neural networks by eliminating batch dependence. | 86 |
| A PyTorch implementation of Distributed Proximal Policy Optimization algorithm | 180 |
| A PyTorch implementation of attributing the impact of inputs on deep neural network outputs | 182 |
| An implementation of Adversarial Autoencoders using PyTorch for training neural networks on structured data. | 199 |
| Teaching software developers to build intelligent agents using deep reinforcement learning and OpenAI Gym | 374 |
| An implementation of an optimization algorithm inspired by a 2016 research paper | 33 |
| An implementation of the E2C control policy in PyTorch, allowing customization and comparison with different neural network architectures. | 43 |