| Name | Version | Summary | date |
| timingstack |
0.1.5 |
A powerful timing and performance measurement library for Python applications |
2025-10-20 08:53:42 |
| nvidia-crfm-helm |
25.8 |
NVIDIA: Benchmark for language models - Fork of Stanford CRFM HELM |
2025-09-04 10:50:17 |
| molecule-benchmarks |
0.1.13 |
A comprehensive benchmark suite for evaluating generative models for molecules |
2025-08-31 11:29:19 |
| beir |
2.1.0 |
A Heterogeneous Benchmark for Information Retrieval |
2025-02-25 22:13:36 |
| fmbench |
2.1.2 |
Benchmark performance of **any Foundation Model (FM)** deployed on **any AWS Generative AI service**, be it **Amazon SageMaker**, **Amazon Bedrock**, **Amazon EKS**, or **Amazon EC2**. The FMs could be deployed on these platforms either directly through `FMbench`, or, if they are already deployed then also they could be benchmarked through the **Bring your own endpoint** mode supported by `FMBench`. |
2025-02-12 18:28:49 |
| catbench |
0.1.20 |
CatBench: Benchmark of Graph Neural Networks for Adsorption Energy Predictions in Heterogeneous Catalysis |
2025-01-15 07:07:34 |
| scib |
1.1.7 |
Evaluating single-cell data integration methods |
2025-01-13 18:53:25 |
| nnbench |
0.4.0 |
A small framework for benchmarking machine learning models. |
2024-12-03 14:53:48 |
| BenchExec |
3.27 |
A Framework for Reliable Benchmarking and Resource Measurement. |
2024-11-23 09:21:27 |
| cytobench |
0.1.27 |
Benchmarking library for generative algorithms |
2024-09-20 14:48:06 |
| perun |
0.8.7 |
Measure the energy used by your MPI+Python applications. |
2024-08-16 14:28:45 |
| rliable |
1.2.0 |
rliable: Reliable evaluation on reinforcement learning and machine learning benchmarks. |
2024-08-12 20:50:58 |
| enfobench |
0.7.2 |
Energy forecast benchmarking toolkit. |
2024-06-25 10:13:10 |
| COOM |
1.0.0 |
COOM: Benchmarking Continual Reinforcement Learning on Doom |
2024-01-27 19:26:59 |
| RnaBench |
0.1.2 |
RNA benchmarking tools and utilities. |
2023-10-31 14:29:05 |
| ILAMB |
2.7 |
The International Land Model Benchmarking Package |
2023-06-28 11:56:11 |
| diefpy |
1.2.1 |
Python package for computing diefficiency metrics dief@t and dief@k. |
2023-06-22 08:18:02 |
| cellulose-sdk |
0.0.3 |
Cellulose Python SDK |
2023-06-12 04:13:14 |
| lexer-sdk |
0.0.2 |
Lexer Python SDK |
2023-02-06 23:41:42 |
| hydrogym |
0.1.2.1 |
A Reinforcement Learning Benchmarking Environment for Fluid Dynamics |
2022-12-26 02:30:35 |