opponent

Opponent modeler

A system for learning adaptive strategies against different opponents in reinforcement learning using deep neural networks

Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"

GitHub

71 stars
7 watching
18 forks
Language: Lua
last commit: over 8 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
01-ai/yi A series of large language models trained from scratch to excel in multiple NLP tasks 7,699
yiyangzhou/lure Analyzing and mitigating object hallucination in large vision-language models to improve their accuracy and reliability. 134
keplr-io/hera Enables real-time training and evaluation of machine learning models with remote dashboard visualization. 487
itijyou/ademxapp This is an open-source implementation of a deep learning model for semantic image segmentation and image classification. 340
eryixie/planerecnet An implementation of a deep learning model for instance segmentation and monocular depth estimation. 79
yuyang-huang/keras-inception-resnet-v2 Represents an implementation of the Inception-ResNet v2 deep learning model in Keras. 180
locuslab/e2e-model-learning Develops an approach to learning probabilistic models in stochastic optimization problems 200
lonepatient/nezha_chinese_pytorch An implementation of a Chinese language model using PyTorch and transformer architecture. 262
lxtgh/omg-seg Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model. 1,300
jiasenlu/cdssm An implementation of a specific deep learning model in PyTorch for natural language processing tasks. 40
5vision/darqn An implementation of a deep reinforcement learning model for continuous control tasks 115
h2oai/h2o-llm-eval An evaluation framework for large language models with Elo rating system and A/B testing capabilities 50
kaixhin/atari A deep reinforcement learning framework for Atari games using persistent advantage learning and dueling double DQN architecture. 264
jnhwkim/nips-mrn-vqa This project presents a neural network model designed to answer visual questions by combining question and image features in a residual learning framework. 39
ethanyanjiali/minchatgpt This project demonstrates the effectiveness of reinforcement learning from human feedback (RLHF) in improving small language models like GPT-2. 213