Name | Version | Summary | date |
ai-edge-quantizer-nightly |
0.4.0.dev20250916 |
A quantizer for advanced developers to quantize converted AI Edge models. |
2025-09-16 00:12:40 |
gptqmodel |
4.2.0 |
Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang. |
2025-09-12 09:35:46 |
autopack-grn |
0.1.3.2 |
CLI to quantize and release Hugging Face models in multiple formats |
2025-09-11 10:42:43 |
hilbert-quantization |
1.3.0 |
Ultra-fast similarity search with Hilbert curve quantization and MPEG-AI compression |
2025-09-03 09:52:48 |
hgq2 |
0.1.1 |
High Granularity Quantization 2 |
2025-09-01 00:37:21 |
vector-quantize-pytorch |
1.23.2 |
Vector Quantization - Pytorch |
2025-08-29 18:45:28 |
unfake |
1.0.4 |
High-performance tool for improving AI-generated pixel art |
2025-08-23 17:43:23 |
metis-agent |
0.19.0 |
Advanced AI agent framework with composable assets (personas, instructions, workflows, skills), Claude Code-style CLI, multi-agent orchestration, 36+ tools, and enterprise security |
2025-08-23 17:26:29 |
torch-floating-point |
0.0.11 |
A PyTorch library for custom floating point quantization with autograd support |
2025-08-23 11:59:02 |
llmcompressor |
0.7.1 |
A library for compressing large language models utilizing the latest techniques and research in the field for both training aware and post training techniques. The library is designed to be flexible and easy to use on top of PyTorch and HuggingFace Transformers, allowing for quick experimentation. |
2025-08-21 21:36:37 |
ai-edge-quantizer |
0.3.0 |
A quantizer for advanced developers to quantize converted AI Edge models. |
2025-08-21 20:40:00 |
openvino-easy |
1.0.0 |
Framework-agnostic Python wrapper for OpenVINO 2025 |
2025-08-21 04:53:21 |
optimum-benchmark |
0.6.0 |
Optimum-Benchmark is a unified multi-backend utility for benchmarking Transformers, Timm, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes. |
2025-08-19 22:40:14 |
bitsandbytes |
0.47.0 |
k-bit optimizers and matrix multiplication routines. |
2025-08-11 18:51:20 |
optimum-intel |
1.25.0 |
Optimum Library is an extension of the Hugging Face Transformers library, providing a framework to integrate third-party libraries from Hardware Partners and interface with their specific functionality. |
2025-08-04 16:41:28 |
tensorflores |
0.1.11 |
TensorFlores is a Python-based framework for optimizing machine learning deployment in resource-constrained environments, with support for TinyML, EdgeAI, and quantization. |
2025-08-04 00:29:03 |
optimum |
1.27.0 |
Optimum Library is an extension of the Hugging Face Transformers library, providing a framework to integrate third-party libraries from Hardware Partners and interface with their specific functionality. |
2025-07-30 16:40:44 |
fedcore |
0.0.5.3 |
Federated learning core library |
2025-07-09 21:11:50 |
llmcompressor-nightly |
0.4.1.20250314 |
A library for compressing large language models utilizing the latest techniques and research in the field for both training aware and post training techniques. The library is designed to be flexible and easy to use on top of PyTorch and HuggingFace Transformers, allowing for quick experimentation. |
2025-03-14 03:23:13 |
kvquant |
0.0.1 |
More for Keys, Less for Values: Adaptive KV Cache Quantization 🐍🚀🎉🦕 |
2025-02-27 20:12:37 |