Name | Version | Summary | date |
molecule-benchmarks |
0.1.12 |
A comprehensive benchmark suite for evaluating generative models for molecules |
2025-07-08 15:49:38 |
beir |
2.1.0 |
A Heterogeneous Benchmark for Information Retrieval |
2025-02-25 22:13:36 |
fmbench |
2.1.2 |
Benchmark performance of **any Foundation Model (FM)** deployed on **any AWS Generative AI service**, be it **Amazon SageMaker**, **Amazon Bedrock**, **Amazon EKS**, or **Amazon EC2**. The FMs could be deployed on these platforms either directly through `FMbench`, or, if they are already deployed then also they could be benchmarked through the **Bring your own endpoint** mode supported by `FMBench`. |
2025-02-12 18:28:49 |
catbench |
0.1.20 |
CatBench: Benchmark of Graph Neural Networks for Adsorption Energy Predictions in Heterogeneous Catalysis |
2025-01-15 07:07:34 |
scib |
1.1.7 |
Evaluating single-cell data integration methods |
2025-01-13 18:53:25 |
nnbench |
0.4.0 |
A small framework for benchmarking machine learning models. |
2024-12-03 14:53:48 |
BenchExec |
3.27 |
A Framework for Reliable Benchmarking and Resource Measurement. |
2024-11-23 09:21:27 |
crfm-helm |
0.5.4 |
Benchmark for language models |
2024-10-10 03:07:51 |
cytobench |
0.1.27 |
Benchmarking library for generative algorithms |
2024-09-20 14:48:06 |
perun |
0.8.7 |
Measure the energy used by your MPI+Python applications. |
2024-08-16 14:28:45 |
rliable |
1.2.0 |
rliable: Reliable evaluation on reinforcement learning and machine learning benchmarks. |
2024-08-12 20:50:58 |
enfobench |
0.7.2 |
Energy forecast benchmarking toolkit. |
2024-06-25 10:13:10 |
LevDoom |
1.0.1 |
LevDoom: A Generalization Benchmark for Deep Reinforcement Learning |
2024-02-02 14:52:14 |
knows |
1.0.0 |
Property graph benchmark that creates graphs with specified node and edge numbers, supporting multiple output formats and visualization |
2024-01-28 22:22:41 |
COOM |
1.0.0 |
COOM: Benchmarking Continual Reinforcement Learning on Doom |
2024-01-27 19:26:59 |
RnaBench |
0.1.2 |
RNA benchmarking tools and utilities. |
2023-10-31 14:29:05 |
ILAMB |
2.7 |
The International Land Model Benchmarking Package |
2023-06-28 11:56:11 |
diefpy |
1.2.1 |
Python package for computing diefficiency metrics dief@t and dief@k. |
2023-06-22 08:18:02 |
cellulose-sdk |
0.0.3 |
Cellulose Python SDK |
2023-06-12 04:13:14 |
lexer-sdk |
0.0.2 |
Lexer Python SDK |
2023-02-06 23:41:42 |