Collaborative-Incentives
Collaboration incentive code
Experimental code mirroring a research study on collaboration and incentives in machine learning
6 stars
2 watching
0 forks
Language: Jupyter Notebook
last commit: over 3 years ago Related projects:
Repository | Description | Stars |
---|---|---|
rle-foundation/rlexplore | Provides a unified toolkit for constructing, computing, and optimizing intrinsic reward modules in reinforcement learning | 366 |
instadeepai/jumanji | Provides a suite of scalable reinforcement learning environments for research and development. | 622 |
catalyst-team/catalyst-rl | A PyTorch framework for accelerating reinforcement learning research and development by providing a modular, reusable, and customizable training loop | 46 |
rlworkgroup/garage | A toolkit for developing and evaluating reinforcement learning algorithms in a reproducible manner | 1,880 |
ethanyanjiali/minchatgpt | This project demonstrates the effectiveness of reinforcement learning from human feedback (RLHF) in improving small language models like GPT-2. | 213 |
google-research/deep_ope | A set of pre-trained reinforcement learning policies and benchmarking data for offline model selection in reinforcement learning. | 85 |
catalyst-team/alchemy | Provides tools and infrastructure to log and visualize experiments in deep learning research | 50 |
stable-baselines-team/rl-colab-notebooks | A collection of Jupyter Notebooks demonstrating various reinforcement learning techniques with the Stable Baselines3 library | 208 |
yandex/rep | A toolset for building and running reproducible machine learning experiments in Python | 689 |
trekhleb/machine-learning-experiments | An interactive platform for exploring and comparing various machine learning algorithms and techniques using visualizations and example code. | 1,654 |
ron1818/phd_code | A collection of source code and supporting materials for a PhD study on ensemble learning methods and machine learning algorithms. | 45 |
google-research/rlds | A toolkit for storing and manipulating episodic data in reinforcement learning and related tasks. | 293 |
haoyuzhao123/soteriafl | Numerical experiments for private federated learning with communication compression algorithms | 7 |
rlworkgroup/dowel | A tool for logging and tracking machine learning research progress in Python | 32 |
google-research/relay-policy-learning | Environments and data for training reinforcement learning agents in a kitchen simulator | 107 |