Name | Version | Summary | date |
anemoi-inference |
0.7.2 |
A package to run inference from data-driven forecasts weather models. |
2025-08-22 09:17:29 |
b10-tcache |
0.3.5 |
Distributed PyTorch compilation cache for Baseten - Environment-aware, lock-free compilation cache management |
2025-08-21 17:30:04 |
openvino-easy |
1.0.0 |
Framework-agnostic Python wrapper for OpenVINO 2025 |
2025-08-21 04:53:21 |
finsim |
1.2.0 |
Financial simulation and inference |
2025-08-21 04:35:52 |
kglab |
0.7.0 |
a simple abstraction layer in Python for building knowledge graphs |
2025-08-20 02:53:18 |
optimum-benchmark |
0.6.0 |
Optimum-Benchmark is a unified multi-backend utility for benchmarking Transformers, Timm, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes. |
2025-08-19 22:40:14 |
matrice-inference |
0.1.1 |
Common utilities for Matrice.ai services |
2025-08-19 11:43:09 |
celux |
0.7.3 |
Lightspeed video decoding directly into tensors! |
2025-08-17 18:19:38 |
xgrammar |
0.1.23 |
Efficient, Flexible and Portable Structured Generation |
2025-08-15 07:31:42 |
llmbuilder |
0.4.6 |
A comprehensive toolkit for building, training, and deploying language models |
2025-08-14 20:16:12 |
aiconfigurator |
0.1.0 |
aiconfigurator: automatic disaggregated serving offline configuration |
2025-08-12 19:10:42 |
mblt-model-zoo |
0.3.2 |
A collection of pre-quantized AI models for Mobilint NPUs. |
2025-08-12 07:19:45 |
ai-dynamo-runtime |
0.4.0 |
Dynamo Inference Framework Runtime |
2025-08-11 23:23:51 |
ai-dynamo |
0.4.0 |
Distributed Inference Framework |
2025-08-11 23:22:20 |
baseten-tcache |
0.0.1 |
Distributed PyTorch compilation cache for Baseten - Environment-aware, lock-free compilation cache management |
2025-08-11 18:53:18 |
llmq |
0.0.4 |
High-Performance vLLM Job Queue Package |
2025-08-11 08:21:32 |
trtruntime |
0.1.0 |
A lightweight TensorRT inference runtime for Python, inspired by onnxruntime |
2025-08-10 20:12:58 |
verbatim-llm |
0.2.0 |
Library to mitigate verbatim or near-verbatim memorization in LLMs |
2025-08-10 00:38:18 |
torch-tensorrt |
2.8.0 |
Torch-TensorRT is a package which allows users to automatically compile PyTorch and TorchScript modules to TensorRT while remaining in PyTorch |
2025-08-09 06:02:00 |
b10-xgrammar |
0.1.22 |
Efficient, Flexible and Portable Structured Generation |
2025-08-08 23:28:16 |