Name | Version | Summary | date |
quantizers |
1.0.1 |
None |
2024-11-20 02:17:02 |
neural-compressor |
3.1.1 |
Repository of Intel® Neural Compressor |
2024-11-01 08:57:34 |
neural-compressor-pt |
3.1.1 |
Repository of Intel® Neural Compressor |
2024-11-01 08:50:46 |
neural-compressor-tf |
3.1.1 |
Repository of Intel® Neural Compressor |
2024-11-01 08:46:37 |
mobius-faster-whisper |
1.1.1 |
Mobius Version of Faster Whisper transcription with CTranslate2 |
2024-10-24 14:02:53 |
ctranslate2 |
4.5.0 |
Fast inference engine for Transformer models |
2024-10-22 13:32:16 |
topai-faster-whisper |
1.0.4.post4 |
Faster Whisper transcription with CTranslate2 |
2024-10-17 09:28:10 |
auto-round |
0.3.1 |
Repository of AutoRound: Advanced Weight-Only Quantization Algorithm for LLMs |
2024-10-17 08:35:19 |
pngquant-cli |
3.0.3 |
Precompiled binaries for pngquant, the lossy PNG compressor based on libimagequant. |
2024-10-04 08:58:36 |
bitsandbytes |
0.44.1 |
k-bit optimizers and matrix multiplication routines. |
2024-09-30 16:20:50 |
mindspore-gs |
0.5.0 |
A MindSpore model optimization algorithm set.. |
2024-08-15 04:17:53 |
neural-compressor-3x-tf |
3.0 |
Repository of Intel® Neural Compressor |
2024-08-11 13:26:43 |
neural-compressor-3x-pt |
3.0 |
Repository of Intel® Neural Compressor |
2024-08-11 13:24:08 |
onnx-neural-compressor |
1.0 |
Repository of Neural Compressor ORT |
2024-07-31 16:36:14 |
neural-solution |
2.6.1 |
Repository of Intel® Neural Compressor |
2024-07-02 03:30:03 |
faster-whisper |
1.0.3 |
Faster Whisper transcription with CTranslate2 |
2024-07-01 10:06:25 |
qattn |
0.1.1 |
Efficient GPU Kernels in Triton for Quantized Vision Transformers |
2024-06-21 17:27:15 |
neural-insights |
2.6 |
Repository of Intel® Neural Compressor |
2024-06-14 14:50:33 |
intel-extension-for-transformers |
1.4.2 |
Repository of Intel® Intel Extension for Transformers |
2024-05-24 09:22:06 |
neural-compressor-3x-ort |
2.5.1 |
Repository of Intel® Neural Compressor |
2024-04-03 14:11:31 |