AlphaZero_Gomoku

Gomoku AI model

An implementation of the AlphaZero algorithm for playing Gomoku from pure self-play training

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

GitHub

3k stars
102 watching
972 forks
Language: Python
last commit: 9 months ago
alphagoalphago-zeroalphazeroboard-gamegobanggomokumctsmonte-carlo-tree-searchpytorchreinforcement-learningrlself-learningtensorflow

Related projects:

Repository Description Stars
zeta36/chess-alpha-zero An implementation of AlphaGo Zero's reinforcement learning approach to master the game of chess 2,137
gorgonia/gorgonia A low-level machine learning and graph computation library for Go. 5,582
leela-zero/leela-zero A Go program implementing a neural network-based AI system designed to play the game of Go without human-provided knowledge. 5,368
packtpublishing/hands-on-intelligent-agents-with-openai-gym Teaching software developers to build intelligent agents using deep reinforcement learning and OpenAI Gym 374
ikostrikov/pytorch-a2c-ppo-acktr-gail An open-source implementation of several reinforcement learning algorithms in PyTorch 3,644
tju-drl-lab/ai-optimizer A next-generation deep reinforcement learning toolkit with libraries for multiagent, self-supervised, offline, and transfer/reinforcement learning 4,848
farama-foundation/gymnasium Develops and compares reinforcement learning algorithms by providing a standard API to communicate between learning algorithms and environments 7,613
inancgumus/learngo A repository of thousands of Go examples and exercises to help developers learn the language by fixing and solving problems. 18,987
thu-ml/tianshou A high-performance reinforcement learning library with modular interfaces and user-friendly APIs for building deep learning agents. 8,069
dvyukov/go-fuzz A tool for generating and testing random inputs to ensure software reliability 4,790
hubfire/muti-branch-ddpg-carla An implementation of a reinforcement learning algorithm using multi-branch architecture and Deep Deterministic Policy Gradients (DDPG) to control autonomous vehicles in simulation environments. 81
erikbern/deep-pink An AI system designed to play chess using deep learning techniques 813
alexis-jacq/lola_dice PyTorch implementation of LOLA using DiCE for decision-making in game-playing AI 91
joonspk-research/generative_agents A research project simulating human behavior in interactive environments. 17,888
golang/go A programming language designed to build simple, reliable, and efficient software 124,564