KR-DL-UCT
Game strategy trainer
A framework for training deep neural networks to learn strategies in continuous action spaces using a kernel-based Monte Carlo tree search.
[ICML 2018] Deep Reinforcement Learning in Continuous Action Spaces: a Case Study in the Game of Simulated Curling
35 stars
4 watching
3 forks
Language: Python
last commit: over 6 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
liamconnell/deep-algotrading | A project demonstrating the application of deep learning techniques to financial data and algorithmic trading | 234 |
ikostrikov/jaxrl | Provides JAX implementations of various reinforcement learning algorithms with continuous action spaces. | 630 |
kh-kim/stock_market_reinforcement_learning | Provides a general environment for stock market trading simulation using OpenAI Gym and reinforcement learning algorithms | 791 |
ikostrikov/pytorch-ddpg-naf | An implementation of reinforcement learning algorithms for continuous control tasks using deep neural networks. | 307 |
kristjankorjus/replicating-deepmind | Reproducing DeepMind's Atari game-playing system using C++ and GPU acceleration | 653 |
vict0rsch/deep_learning | A collection of tutorials and resources on implementing deep learning models using Python libraries such as Keras and Lasagne. | 426 |
thedimlebowski/trading-gym | A reinforcement learning framework for developing trading strategies | 550 |
zhengwang100/rect | A deep learning framework for graph representation learning with partially labeled data | 18 |
intelligent-machine-learning/dlrover | An automatic distributed deep learning system that simplifies the training of large AI models | 1,270 |
rksltnl/deep-metric-learning-cvpr16 | A software framework for building deep metric learning models using lifted structured feature embedding | 342 |
pucklaj/dynareadout | A C/C++ library for parsing binary output files and key files of LS Dyna simulations. | 16 |
eckucukoglu/cengball | A soccer simulation game with artificial intelligence and stochastic structure, designed to be programmed by users. | 33 |
anirudhjoshi/fluid_table_tennis | A web application that combines a game of table tennis with a fluid dynamics simulation | 78 |
chrisnc/tangaroa | A toy implementation of the Raft protocol with an experimental BFT variant in Haskell. | 111 |
tdeboissiere/deeplearningimplementations | A collection of implementations of recent deep learning papers in Python | 1,815 |