opponent
Opponent modeler
This implementation learns adaptive strategies against different opponents in reinforcement learning using a Deep Q-Network framework.
Implementation for ICML 16 paper "Deep reinforcement learning with opponent modeling"
71 stars
7 watching
18 forks
Language: Lua
last commit: over 8 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| A series of large language models trained from scratch to excel in multiple NLP tasks | 7,743 |
| Analyzing and mitigating object hallucination in large vision-language models to improve their accuracy and reliability. | 136 |
| Enables real-time training and evaluation of machine learning models with remote dashboard visualization. | 487 |
| This is an open-source implementation of a deep learning model for semantic image segmentation and image classification. | 340 |
| An implementation of a deep learning model for instance segmentation and monocular depth estimation. | 79 |
| Represents an implementation of the Inception-ResNet v2 deep learning model in Keras. | 180 |
| Develops an approach to learning probabilistic models in stochastic optimization problems | 201 |
| An implementation of a Chinese language model using PyTorch and transformer architecture. | 262 |
| Develops an end-to-end model for multiple visual perception and reasoning tasks using a single encoder, decoder, and large language model. | 1,336 |
| An implementation of a specific deep learning model in PyTorch for natural language processing tasks. | 41 |
| An implementation of a deep reinforcement learning model for continuous control tasks | 116 |
| An evaluation framework for large language models with Elo rating system and A/B testing capabilities | 50 |
| A deep reinforcement learning framework for Atari games using persistent advantage learning and dueling double DQN architecture. | 265 |
| This project presents a neural network model designed to answer visual questions by combining question and image features in a residual learning framework. | 39 |
| This project demonstrates the effectiveness of reinforcement learning from human feedback (RLHF) in improving small language models like GPT-2. | 214 |