AlphaZero_Gomoku

Gomoku AI model

An implementation of the AlphaZero algorithm for playing Gomoku from pure self-play training

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

GitHub

3k stars

102 watching

972 forks

Language: Python

last commit: over 1 year ago

alphagoalphago-zeroalphazeroboard-gamegobanggomokumctsmonte-carlo-tree-searchpytorchreinforcement-learningrlself-learningtensorflow

Related projects:

Repository	Description	Stars
zeta36/chess-alpha-zero	An implementation of AlphaGo Zero's reinforcement learning approach to master the game of chess	2,137
gorgonia/gorgonia	A low-level machine learning and graph computation library for Go.	5,582
leela-zero/leela-zero	A Go program implementing a neural network-based AI system designed to play the game of Go without human-provided knowledge.	5,368
packtpublishing/hands-on-intelligent-agents-with-openai-gym	Teaching software developers to build intelligent agents using deep reinforcement learning and OpenAI Gym	374
ikostrikov/pytorch-a2c-ppo-acktr-gail	An open-source implementation of several reinforcement learning algorithms in PyTorch	3,644
tju-drl-lab/ai-optimizer	A next-generation deep reinforcement learning toolkit with libraries for multiagent, self-supervised, offline, and transfer/reinforcement learning	4,848
farama-foundation/gymnasium	Develops and compares reinforcement learning algorithms by providing a standard API to communicate between learning algorithms and environments	7,613
inancgumus/learngo	A repository of thousands of Go examples and exercises to help developers learn the language by fixing and solving problems.	18,987
thu-ml/tianshou	A high-performance reinforcement learning library with modular interfaces and user-friendly APIs for building deep learning agents.	8,069
dvyukov/go-fuzz	A tool for generating and testing random inputs to ensure software reliability	4,790
hubfire/muti-branch-ddpg-carla	An implementation of a reinforcement learning algorithm using multi-branch architecture and Deep Deterministic Policy Gradients (DDPG) to control autonomous vehicles in simulation environments.	81
erikbern/deep-pink	An AI system designed to play chess using deep learning techniques	813
alexis-jacq/lola_dice	PyTorch implementation of LOLA using DiCE for decision-making in game-playing AI	91
joonspk-research/generative_agents	A research project simulating human behavior in interactive environments.	17,888
golang/go	A programming language designed to build simple, reliable, and efficient software	124,564