pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

GitHub

4k stars
67 watching
829 forks
Language: Python
last commit: over 2 years ago
a2cacktractor-criticadvantage-actor-criticaleataricontinuous-controldeep-learningdeep-reinforcement-learninghessiankfackronecker-factored-approximationmujoconatural-gradientsppoproximal-policy-optimizationpytorchreinforcement-learningroboschoolsecond-order