pyIEOE

Evaluation tool

Develops an interpretable evaluation procedure for off-policy evaluation (OPE) methods to quantify their sensitivity to hyper-parameter choices and/or evaluation policy choices.

GitHub

31 stars
2 watching
4 forks
Language: Python
last commit: about 3 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
lartpang/pysodevaltoolkit A comprehensive Python toolbox for evaluating salient object detection and camouflaged object detection tasks 167
prometheus-eval/prometheus-eval An open-source framework that enables language model evaluation using Prometheus and GPT4 796
microsoft/pylance-release A Python language server extension providing code analysis and features like auto-imports and type checking 1,719
sidneycadot/oeis Tools for analyzing and processing sequence data from the Online Encyclopedia of Integer Sequences. 46
pymeasure/pymeasure A Python library for scientific measurement and experiment automation with graphical live plotting capabilities. 630
allenai/olmo-eval An evaluation framework for large language models. 310
jorgenschaefer/elpy An Emacs package to provide a comprehensive Python development environment. 1,900
ymyke/pypme Calculates Public Market Equivalent values and rates for investment analysis 10
open-eo/openeo-python-client A Python client library for interacting with the openEO API to access remote sensing data from various sources. 155
clvoloshin/cobs A toolkit for evaluating and analyzing off-policy policy estimation methods in reinforcement learning 61
tisimst/pydoe Designs experimental procedures in Python to optimize performance 271
metno/pyaerocom Tools for evaluating climate and air quality models using Earth observation data. 26
python/pyperformance An authoritative source of real-world benchmarks for Python implementations. 869
grid-parity-exchange/egret A Python-based package for solving optimization problems in power systems 133
martinkersner/py-img-seg-eval A Python package providing metrics and tools for evaluating image segmentation models 282