opponent

Opponent modeler

This implementation learns adaptive strategies against different opponents in reinforcement learning using a Deep Q-Network framework.

Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"

GitHub

71 stars

7 watching

18 forks

Language: Lua

last commit: almost 10 years ago

Linked from 1 awesome list

Backlinks from these awesome lists:

carpedm20/awesome-torch

Related projects:

Repository	Description	Stars
01-ai/yi	A series of large language models trained from scratch to excel in multiple NLP tasks	7,743
yiyangzhou/lure	Analyzing and mitigating object hallucination in large vision-language models to improve their accuracy and reliability.	136
keplr-io/hera	Enables real-time training and evaluation of machine learning models with remote dashboard visualization.	487
itijyou/ademxapp	This is an open-source implementation of a deep learning model for semantic image segmentation and image classification.	340
eryixie/planerecnet	An implementation of a deep learning model for instance segmentation and monocular depth estimation.	79
yuyang-huang/keras-inception-resnet-v2	Represents an implementation of the Inception-ResNet v2 deep learning model in Keras.	180
locuslab/e2e-model-learning	Develops an approach to learning probabilistic models in stochastic optimization problems	201
lonepatient/nezha_chinese_pytorch	An implementation of a Chinese language model using PyTorch and transformer architecture.	262
lxtgh/omg-seg	Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model.	1,336
jiasenlu/cdssm	An implementation of a specific deep learning model in PyTorch for natural language processing tasks.	41
5vision/darqn	An implementation of a deep reinforcement learning model for continuous control tasks	116
h2oai/h2o-llm-eval	An evaluation framework for large language models with Elo rating system and A/B testing capabilities	50
kaixhin/atari	A deep reinforcement learning framework for Atari games using persistent advantage learning and dueling double DQN architecture.	265
jnhwkim/nips-mrn-vqa	This project presents a neural network model designed to answer visual questions by combining question and image features in a residual learning framework.	39
ethanyanjiali/minchatgpt	This project demonstrates the effectiveness of reinforcement learning from human feedback (RLHF) in improving small language models like GPT-2.	214