gps
Policy optimizer
An implementation of guided policy search and LQG-based trajectory optimization for reinforcement learning
Guided Policy Search
599 stars
46 watching
241 forks
Language: Python
last commit: almost 5 years ago
Linked from 1 awesome list
deep-learningdeep-reinforcement-learningreinforcement-learningreinforcement-learning-algorithmsrobotics
Related projects:
| Repository | Description | Stars |
|---|---|---|
| | A software toolkit implementing a novel reinforcement learning framework for portfolio management with policy optimization and financial-model-based algorithms. | 1,755 |
| | A reinforcement learning-based framework for optimizing hyperparameters in distributed machine learning environments. | 15 |
| | A reinforcement learning-based system for optimizing multi-cell selection in wireless networks | 58 |
| | A reinforcement learning environment library for compiler optimization tasks | 917 |
| | Reinforcement learning-based algorithm for optimizing stock trading and portfolio management | 182 |
| | A tool for exploring and optimizing the architecture of Convolutional Neural Networks using a Genetic Algorithm | 218 |
| | An open-source project providing hardware accelerated, batchable and differentiable optimizers in JAX for deep learning. | 941 |
| | A flexible framework for optimizing model parameters in computational neuroscience and related fields. | 204 |
| | An algorithm that optimizes portfolio allocation using Reinforcement Learning and Supervised learning. | 168 |
| | An implementation of a reinforcement learning algorithm using multi-branch architecture and Deep Deterministic Policy Gradients (DDPG) to control autonomous vehicles in simulation environments. | 81 |
| | A framework for training reinforcement learning models to optimize traffic control and simulation | 1,078 |
| | An agent-based traffic management system using model-free reinforcement learning to optimize traffic signal control. | 48 |
| | An optimization framework using genetic algorithms to train and improve the performance of Backpropagation Neural Networks for traffic flow prediction | 157 |
| | Automates cost modeling and optimization for indexers in blockchain networks using reinforcement learning and GraphQL APIs. | 11 |
| | Environments and data for training reinforcement learning agents in a kitchen simulator | 108 |