opponent
Opponent modeler
A system for learning adaptive strategies against different opponents in reinforcement learning using deep neural networks
Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"
71 stars
7 watching
18 forks
Language: Lua
last commit: over 8 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
01-ai/yi | A series of large language models trained from scratch to excel in multiple NLP tasks | 7,699 |
yiyangzhou/lure | Analyzing and mitigating object hallucination in large vision-language models to improve their accuracy and reliability. | 134 |
keplr-io/hera | Enables real-time training and evaluation of machine learning models with remote dashboard visualization. | 487 |
itijyou/ademxapp | This is an open-source implementation of a deep learning model for semantic image segmentation and image classification. | 340 |
eryixie/planerecnet | An implementation of a deep learning model for instance segmentation and monocular depth estimation. | 79 |
yuyang-huang/keras-inception-resnet-v2 | Represents an implementation of the Inception-ResNet v2 deep learning model in Keras. | 180 |
locuslab/e2e-model-learning | Develops an approach to learning probabilistic models in stochastic optimization problems | 200 |
lonepatient/nezha_chinese_pytorch | An implementation of a Chinese language model using PyTorch and transformer architecture. | 262 |
lxtgh/omg-seg | Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model. | 1,300 |
jiasenlu/cdssm | An implementation of a specific deep learning model in PyTorch for natural language processing tasks. | 40 |
5vision/darqn | An implementation of a deep reinforcement learning model for continuous control tasks | 115 |
h2oai/h2o-llm-eval | An evaluation framework for large language models with Elo rating system and A/B testing capabilities | 50 |
kaixhin/atari | A deep reinforcement learning framework for Atari games using persistent advantage learning and dueling double DQN architecture. | 264 |
jnhwkim/nips-mrn-vqa | This project presents a neural network model designed to answer visual questions by combining question and image features in a residual learning framework. | 39 |
ethanyanjiali/minchatgpt | This project demonstrates the effectiveness of reinforcement learning from human feedback (RLHF) in improving small language models like GPT-2. | 213 |