rl_a3c_pytorch
A3C algorithm
An implementation of an A3C algorithm for reinforcement learning in Pytorch, with various optimizations and extensions to accelerate training.
A3C LSTM Atari with Pytorch plus A3G design
562 stars
20 watching
119 forks
Language: Python
last commit: over 1 year ago a3ca3c-gpua3gactor-criticasynchronous-advantage-actor-criticatarideep-reinforcement-learningopenai-gympythonpytorchpytorch-a3creinforcement-learning
Related projects:
Repository | Description | Stars |
---|---|---|
rarilurelo/pytorch_a3c | An implementation of Asynchronous Advantage Actor-Critic in PyTorch for training AI models on reinforcement learning tasks | 38 |
onlytailei/a3c-pytorch | An implementation of Advantage async Actor-Critic Algorithms in PyTorch for Deep Reinforcement Learning | 114 |
andrewliao11/pytorch-a3c-mujoco | An implementation of the Actor-Critic algorithm for continuous control tasks in MuJoCo environments using PyTorch. | 73 |
chingyaoc/pytorch-reinforce | A PyTorch implementation of the REINFORCE algorithm for reinforcement learning in continuous and discrete environments. | 266 |
tristandeleu/pytorch-maml-rl | Replication of Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks in PyTorch for reinforcement learning tasks | 830 |
ikostrikov/pytorch-trpo | A PyTorch implementation of an optimization algorithm for continuous control and reinforcement learning tasks | 435 |
xxradon/igcv3-pytorch | Reimplements MobileNet-V2 and IGCV3 using PyTorch for efficient deep learning. | 19 |
pemami4911/neural-combinatorial-rl-pytorch | An implementation of Neural Combinatorial Optimization with Reinforcement Learning using PyTorch. | 562 |
graal-research/poutyne | A PyTorch framework simplifying neural network training with automated boilerplate code and callback utilities | 572 |
atgambardella/pytorch-es | An implementation of an optimization algorithm for training neural networks in machine learning environments. | 351 |
devendrachaplot/deeprl-grounding | Trains an RL agent to execute natural language instructions in a 3D environment using a combination of A3C and gated attention mechanisms. | 237 |
kaiyangzhou/dassl.pytorch | A PyTorch toolbox for supporting research and development of domain adaptation, generalization, and semi-supervised learning methods in computer vision. | 1,236 |
prabhuomkar/pytorch-cpp | A C++ implementation of PyTorch tutorials | 1,978 |
iffix/machin | An open-source reinforcement learning library for PyTorch, providing a simple and clear implementation of various algorithms. | 402 |
jingweiz/pytorch-rl | A comprehensive project that provides an implementation of deep reinforcement learning algorithms using PyTorch and Visdom. | 798 |