Name | Version | Summary | date |
GAICo |
0.2.0 |
GenAI Results Comparator, GAICo, is a Python library to help compare, analyze and visualize outputs from Large Language Models (LLMs), often against a reference text. In doing so, one can use a range of extensible metrics from the literature. |
2025-07-15 02:17:28 |
evaluate |
0.4.5 |
HuggingFace community-driven open-source library of evaluation |
2025-07-10 13:26:46 |
ragbits-guardrails |
1.1.0 |
Guardrails module for Ragbits components |
2025-07-09 15:46:13 |
ragbits-evaluate |
1.1.0 |
Evaluation module for Ragbits components |
2025-07-09 15:46:12 |
enoslib |
10.2.0 |
A library to build (distributed) systems experiments |
2025-07-08 22:21:18 |
zeroeval |
0.2.9 |
ZeroEval SDK |
2025-07-08 18:36:25 |
subset2evaluate |
1.0.5 |
Find informative examples to efficiently (human-)evaluate NLG models. |
2025-02-19 16:13:55 |
daindex |
0.6.4 |
Deterioration Allocation Index Framework |
2025-02-09 11:15:26 |
mlrl-testbed |
0.11.2 |
Provides utilities for the training and evaluation of machine learning algorithms |
2025-01-22 21:47:40 |
uval |
0.2.1 |
This python package is meant to provide a high level interface to facilitate the evaluation of object detection and segmentation algorithms that operate on 3D volumetric data. |
2025-01-21 18:48:28 |
llm-evaluation-in-reasoning |
1.4.2 |
A project for evaluating reasoning capabilities in large language models (LLMs). |
2025-01-17 07:13:34 |
lighteval |
0.7.0 |
A lightweight and configurable evaluation package |
2025-01-03 15:44:54 |
chainforge |
0.3.2.8 |
A Visual Programming Environment for Prompt Engineering |
2024-12-29 16:33:06 |
indoxJudge |
0.1.0 |
Indox Judge |
2024-12-19 14:09:13 |
costra |
1.1 |
None |
2024-12-13 12:08:58 |
eyantra-autoeval |
0.1.49 |
A python module to aid auto evaluation |
2024-12-06 13:29:17 |
reco-eval-tool |
1.1.5 |
Reco evaluation tool |
2024-12-06 06:32:43 |
hulu-evaluate |
0.0.2 |
Client library to train and evaluate models on the HuLu benchmark. |
2024-12-04 13:14:56 |
xretrieval |
0.2.0 |
Retrieve and Evaluate with X(any) models |
2024-12-04 07:13:48 |
llama-index-packs-llama-dataset-metadata |
0.3.0 |
llama-index packs llama_dataset_metadata integration |
2024-11-17 22:43:11 |