optimum-benchmark
🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.
233 stars
5 watching
41 forks
Language: Python
last commit: 10 days ago
Linked from 1 awesome list
benchmarkneural-compressoronnxruntimeopenvinopytorchtensorrt-llmtext-generation-inference