Name | Version | Summary | date |
---|---|---|---|
autoawq | 0.2.8 | AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. | 2025-01-20 11:03:42 |
autoawq-kernels | 0.0.9 | AutoAWQ Kernels implements the AWQ kernels. | 2024-11-16 15:42:59 |
optimum-benchmark | 0.4.0 | Optimum-Benchmark is a unified multi-backend utility for benchmarking Transformers, Timm, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes. | 2024-07-31 08:35:53 |
hour | day | week | total |
---|---|---|---|
80 | 2124 | 10202 | 312902 |