AlphaZero_Gomoku

Gomoku AI model

An implementation of the AlphaZero algorithm for playing Gomoku from pure self-play training

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

GitHub

3k stars
101 watching
969 forks
Language: Python
last commit: 7 months ago
alphagoalphago-zeroalphazeroboard-gamegobanggomokumctsmonte-carlo-tree-searchpytorchreinforcement-learningrlself-learningtensorflow

Related projects:

Repository Description Stars
zeta36/chess-alpha-zero An implementation of AlphaGo Zero's reinforcement learning approach to master the game of chess 2,133
gorgonia/gorgonia A low-level machine learning and graph computation library for Go. 5,555
leela-zero/leela-zero A Go program implementing a neural network-based AI system designed to play the game of Go without human-provided knowledge. 5,360
packtpublishing/hands-on-intelligent-agents-with-openai-gym Teaching software developers to build intelligent agents using deep reinforcement learning and OpenAI Gym 373
ikostrikov/pytorch-a2c-ppo-acktr-gail An open-source implementation of several reinforcement learning algorithms in PyTorch 3,605
tju-drl-lab/ai-optimizer A next-generation deep reinforcement learning toolkit with libraries for multiagent, self-supervised, offline, and transfer/reinforcement learning 4,755
farama-foundation/gymnasium Develops and compares reinforcement learning algorithms by providing a standard API to communicate between learning algorithms and environments 7,374
inancgumus/learngo A repository of thousands of Go examples and exercises to help developers learn the language by fixing and solving problems. 18,903
thu-ml/tianshou A high-performance reinforcement learning library with modular interfaces and user-friendly APIs for building deep learning agents. 7,968
dvyukov/go-fuzz A tool for generating and testing random inputs to ensure software reliability 4,780
hubfire/muti-branch-ddpg-carla An implementation of a reinforcement learning algorithm using multi-branch architecture and Deep Deterministic Policy Gradients (DDPG) to control autonomous vehicles in simulation environments. 79
erikbern/deep-pink An AI system designed to play chess using deep learning techniques 813
alexis-jacq/lola_dice PyTorch implementation of LOLA using DiCE for decision-making in game-playing AI 90
joonspk-research/generative_agents A research project simulating human behavior in interactive environments. 17,558
golang/go A programming language designed to build simple, reliable, and efficient software 124,071