Name | Version | Summary | date |
mteb |
1.38.35 |
Massive Text Embedding Benchmark |
2025-07-16 20:16:26 |
DLcomm |
0.2.0 |
Distributed GPU Communication Benchmarking Framework for Deep Learning |
2025-07-16 15:24:15 |
nodespecs |
0.1.7 |
The specs summarize utilities for computer instance |
2025-07-14 18:39:15 |
mrna-bench |
1.2.2 |
Benchmarking suite for mRNA property prediction. |
2025-07-13 19:53:54 |
pytest-codspeed |
4.0.0 |
Pytest plugin to create CodSpeed benchmarks |
2025-07-10 08:37:53 |
swesmith |
0.0.5 |
The official SWE-smith package - A toolkit for generating software engineering training data at scale. |
2025-07-09 19:41:33 |
swebench |
3.0.15 |
The official SWE-bench package - a benchmark for evaluating LMs on software engineering |
2025-03-02 23:50:15 |
tpch-runner |
1.0.1 |
A tool for running TPC-H benchmarks and analyzing results. |
2025-02-22 22:07:06 |
fusion-bench |
0.2.10 |
A Comprehensive Benchmark of Deep Model Fusion |
2025-02-13 01:13:47 |
rdt |
1.14.0 |
Reversible Data Transforms |
2025-02-12 02:14:26 |
sdgym |
0.10.0 |
Benchmark tabular synthetic data generators using a variety of datasets |
2025-02-07 02:37:02 |
airflow-parse-bench |
1.0.1 |
Easily measure and compare your Airflow DAGs' parse time. |
2025-01-26 03:39:23 |
opencompass |
0.4.0 |
A comprehensive toolkit for large model evaluation |
2025-01-22 06:42:16 |
folktexts |
0.0.27 |
Use LLMs to get classification risk scores on tabular tasks. |
2025-01-17 16:27:47 |
pydftracer |
1.0.8 |
I/O profiler for deep learning python apps. Specifically for dlio_benchmark. |
2024-12-17 03:37:11 |
qpbenchmark |
2.4.0 |
Benchmark for quadratic programming solvers available in Python. |
2024-12-16 09:24:00 |
ms-opencompass |
0.1.5 |
A lightweight toolkit for evaluating LLMs based on OpenCompass. |
2024-12-16 08:05:22 |
mlrb-agent-tasks |
0.0.23 |
A task package for ML Research Bench |
2024-12-10 16:21:44 |
EpiLog |
1.1.2 |
Simple No-Frills Logging Manager |
2024-12-06 21:32:56 |
Younger |
0.0.1a2 |
A Younger Project for Artificial Intelligence: Datasets, Benchmarks, and Applications. |
2024-11-25 08:01:45 |