zr-obp

Off-policy eval

A framework for off-policy evaluation and learning in multi-armed bandit algorithms

Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation

GitHub

648 stars

88 watching

89 forks

Language: Python

last commit: about 2 years ago

Linked from 1 awesome list

contextual-banditsdatasetsmulti-armed-banditsoff-policy-evaluationresearch

Backlinks from these awesome lists:

hanjuku-kaso/awesome-offline-rl

Related projects:

Repository	Description	Stars
clvoloshin/cobs	A toolkit for evaluating and analyzing off-policy policy estimation methods in reinforcement learning	61
onlytailei/carla_cil_pytorch	Implementation of a conditional imitation learning policy in PyTorch for autonomous driving using the Carla dataset.	65
matrix-org/rust-opa-wasm	A Rust SDK to evaluate Open Policy Agent policies in WebAssembly format.	50
zzhanghub/eval-co-sod	An evaluation tool for co-saliency detection tasks	97
google-research/dice_rl	This library provides tools and algorithms for estimating the distribution correction in off-policy reinforcement learning problems	99
sony/pyieoe	Develops an interpretable evaluation procedure for off-policy evaluation (OPE) methods to quantify their sensitivity to hyper-parameter choices and/or evaluation policy choices.	31
a2d24/python-opa-wasm	A Python SDK for executing and managing Open Policy Agent policies in WebAssembly format	10
psecio/propauth	Evaluates policies against user credentials and properties to determine access permissions.	59
oeg-upm/lubm4obda	Evaluates Ontology-Based Data Access systems with inference and meta knowledge benchmarking	4
resibots/blackdrops	An open-source policy search algorithm for robotics that uses Gaussian processes to model robot dynamics and accounts for uncertainty.	64
netflix-skunkworks/policyuniverse	A Python package for parsing and processing AWS IAM policies and statements.	427
denisyarats/exorl	Provides exploratory data and algorithms for offline reinforcement learning in various control domains	105
demisto/cops	Standardized framework for creating and sharing incident response processes in a shared language	151
lartpang/pysodevaltoolkit	A comprehensive Python toolbox for evaluating salient object detection and camouflaged object detection tasks	168
hubfire/muti-branch-ddpg-carla	An implementation of a reinforcement learning algorithm using multi-branch architecture and Deep Deterministic Policy Gradients (DDPG) to control autonomous vehicles in simulation environments.	81