optimum-benchmark

🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.

GitHub

233 stars
5 watching
41 forks
Language: Python
last commit: 10 days ago
Linked from 1 awesome list

benchmarkneural-compressoronnxruntimeopenvinopytorchtensorrt-llmtext-generation-inference

Backlinks from these awesome lists: