Name | Version | Summary | date |
tno.sdg.tabular.eval.utility-metrics |
0.4.1 |
Utility metrics for tabular data |
2024-12-10 13:24:13 |
pyclust-evl |
0.1.0 |
A Python library for clustering operations. Evaluation and meta-feature generation. |
2024-12-09 11:51:55 |
AutoRAG |
0.3.12 |
Automatically Evaluate RAG pipelines with your own data. Find optimal structure for new RAG product. |
2024-12-09 06:09:23 |
evo |
1.30.4 |
Python package for the evaluation of odometry and SLAM |
2024-12-06 16:31:07 |
trajectopy-core |
4.0.0 |
Trajectory Evaluation in Python |
2024-12-06 15:42:10 |
unbabel-comet |
2.2.4 |
High-quality Machine Translation Evaluation |
2024-12-05 13:09:20 |
redlite |
0.3.8 |
LLM testing on steroids |
2024-12-04 20:47:35 |
opencompass |
0.3.7 |
A comprehensive toolkit for large model evaluation |
2024-12-04 05:41:45 |
treeval |
0.0.1 |
. |
2024-12-02 10:09:06 |
coconut-develop |
3.1.2.post0.dev6 |
Simple, elegant, Pythonic functional programming. |
2024-12-01 22:30:19 |
enos |
8.0.0 |
Experimental eNvironment for OpenStack |
2024-11-28 16:23:10 |
lsada |
1.9.1 |
A flexible evaluation framework for content using LLMs |
2024-11-25 13:02:58 |
pymia |
0.3.4 |
A Python package for data handling and evaluation in deep learning-based medical image analysis. |
2024-11-22 10:27:57 |
ConfigSpace |
1.2.1 |
Creation and manipulation of parameter configuration spaces for automated algorithm configuration and hyperparameter tuning. |
2024-11-21 10:18:39 |
syntherela |
0.0.3 |
SyntheRela - Synthetic Relational Data Generation Benchmark |
2024-11-21 06:20:37 |
llama-index-packs-rag-evaluator |
0.3.0 |
llama-index packs rag_evaluator integration |
2024-11-18 01:31:47 |
eval-suite |
2.1.4 |
User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc. |
2024-11-12 07:16:03 |
guardbench |
1.0.0 |
GuardBench: A Large-Scale Benchmark for Guardrail Models |
2024-11-12 02:44:56 |
evalify |
1.0.0 |
Evaluate your face or voice verification models literally in seconds. |
2024-11-08 23:55:00 |
llmjudge |
0.1.0 |
A package for evaluating language model outputs. |
2024-11-04 03:12:29 |