Collaborative-Incentives
Collaboration incentive code
Experimental code mirroring a research study on collaboration and incentives in machine learning
6 stars
2 watching
0 forks
Language: Jupyter Notebook
last commit: over 3 years ago Related projects:
Repository | Description | Stars |
---|---|---|
rle-foundation/rlexplore | Provides a unified toolkit for constructing, computing, and optimizing intrinsic reward modules in reinforcement learning | 373 |
instadeepai/jumanji | A suite of scalable reinforcement learning environments | 657 |
catalyst-team/catalyst-rl | A PyTorch framework for accelerating reinforcement learning research and development by providing a modular, reusable, and customizable training loop | 46 |
rlworkgroup/garage | A toolkit for developing and evaluating reinforcement learning algorithms in a reproducible manner | 1,893 |
ethanyanjiali/minchatgpt | This project demonstrates the effectiveness of reinforcement learning from human feedback (RLHF) in improving small language models like GPT-2. | 214 |
google-research/deep_ope | Provides benchmarking policies and datasets for offline reinforcement learning | 85 |
catalyst-team/alchemy | Provides tools and infrastructure to log and visualize experiments in deep learning research | 50 |
stable-baselines-team/rl-colab-notebooks | A collection of Jupyter Notebooks demonstrating various reinforcement learning techniques with the Stable Baselines3 library | 209 |
yandex/rep | A toolset for building and running reproducible machine learning experiments in Python | 689 |
trekhleb/machine-learning-experiments | An interactive platform for exploring and comparing various machine learning algorithms and techniques using visualizations and example code. | 1,667 |
ron1818/phd_code | A collection of source code and supporting materials for a PhD study on ensemble learning methods and machine learning algorithms. | 45 |
google-research/rlds | A toolkit for storing and manipulating episodic data in reinforcement learning and related tasks. | 302 |
haoyuzhao123/soteriafl | Numerical experiments for private federated learning with communication compression algorithms | 7 |
rlworkgroup/dowel | A tool for logging and tracking machine learning research progress in Python | 32 |
google-research/relay-policy-learning | Environments and data for training reinforcement learning agents in a kitchen simulator | 108 |