PyDigger - unearthing stuff about Python

Found 73 out of 310,052. Showing 20 on page 1. Total pages: 4.

Name	Version	Summary	date
ragbits-guardrails	1.2.2	Guardrails module for Ragbits components	2025-08-09 18:12:34
ragbits-evaluate	1.2.2	Evaluation module for Ragbits components	2025-08-09 18:12:33
langsmith	0.4.11	Client library to connect to the LangSmith LLM Tracing and Evaluation Platform.	2025-08-05 00:12:41
llama-index-packs-llama-dataset-metadata	0.4.0	llama-index packs llama_dataset_metadata integration	2025-07-30 20:51:35
daindex	0.8.2	Deterioration Allocation Index Framework	2025-07-25 23:23:24
multimedeval	1.0.0	A Python tool to evaluate the performance of VLM on the medical domain.	2025-07-23 14:44:40
GAICo	0.2.0	GenAI Results Comparator, GAICo, is a Python library to help compare, analyze and visualize outputs from Large Language Models (LLMs), often against a reference text. In doing so, one can use a range of extensible metrics from the literature.	2025-07-15 02:17:28
evaluate	0.4.5	HuggingFace community-driven open-source library of evaluation	2025-07-10 13:26:46
enoslib	10.2.0	A library to build (distributed) systems experiments	2025-07-08 22:21:18
subset2evaluate	1.0.5	Find informative examples to efficiently (human-)evaluate NLG models.	2025-02-19 16:13:55
uval	0.2.1	This python package is meant to provide a high level interface to facilitate the evaluation of object detection and segmentation algorithms that operate on 3D volumetric data.	2025-01-21 18:48:28
llm-evaluation-in-reasoning	1.4.2	A project for evaluating reasoning capabilities in large language models (LLMs).	2025-01-17 07:13:34
lighteval	0.7.0	A lightweight and configurable evaluation package	2025-01-03 15:44:54
chainforge	0.3.2.8	A Visual Programming Environment for Prompt Engineering	2024-12-29 16:33:06
indoxJudge	0.1.0	Indox Judge	2024-12-19 14:09:13
costra	1.1	None	2024-12-13 12:08:58
eyantra-autoeval	0.1.49	A python module to aid auto evaluation	2024-12-06 13:29:17
reco-eval-tool	1.1.5	Reco evaluation tool	2024-12-06 06:32:43
hulu-evaluate	0.0.2	Client library to train and evaluate models on the HuLu benchmark.	2024-12-04 13:14:56
xretrieval	0.2.0	Retrieve and Evaluate with X(any) models	2024-12-04 07:13:48

Found 73 out of 310,052. Showing 20 on page 1. Total pages: 4.

first prev next last