private-benchmarking
Benchmarking
A platform for private benchmarking of machine learning models with different trust levels.
A platform that enables users to perform private benchmarking of machine learning models. The platform facilitates the evaluation of models based on different trust levels between the model owners and the dataset owners.
7 stars
3 watching
1 forks
Language: Python
last commit: 3 months ago benchmarkingconfidential-computingcontaminationezpcinferencelarge-language-modelsllms-benchmarkingmpcplatformprivateprivate-benchmarkingsecuretrusted-execution-environment
Related projects:
Repository | Description | Stars |
---|---|---|
mlcommons/inference | Measures the performance of deep learning models in various deployment scenarios. | 1,256 |
catboost/benchmarks | Comparative benchmarks of various machine learning algorithms | 169 |
nikolaydubina/go-ml-benchmarks | A benchmarking project comparing performance of different machine learning inference frameworks and models on Go platform | 30 |
szilard/benchm-ml | A benchmark for evaluating machine learning algorithms' performance on large datasets | 1,874 |
automl/hpobench | A collection of benchmark problems for hyperparameter optimization | 140 |
python/pyperformance | An authoritative source of real-world benchmarks for Python implementations. | 877 |
bencheeorg/benchee | A tool for benchmarking Elixir code and comparing performance statistics | 1,422 |
qcri/llmebench | A benchmarking framework for large language models | 81 |
ecoapm/benchmarkmocknet | Performs performance benchmarking of various .NET mocking libraries | 22 |
openml/automlbenchmark | A framework for evaluating and comparing machine learning pipelines and neural architectures. | 413 |
mazhar-ansari-ardeh/benchmarkfcns | Provides benchmarking functions for mathematical optimization algorithms | 67 |
bencherdev/bencher | Tools and frameworks for continuous performance benchmarking of software systems | 586 |
arlineq/arline_benchmarks | A platform for benchmarking quantum circuit mapping and compression algorithms against various hardware types and target circuit classes. | 31 |
huggingface/optimum-benchmark | A tool for comparing and optimizing the performance of various machine learning frameworks and models on different hardware platforms. | 274 |
damo-nlp-sg/m3exam | A benchmark for evaluating large language models in multiple languages and formats | 93 |