Name | Version | Summary | date |
dyff-schema |
0.24.1 |
Data models for the Dyff AI auditing platform. |
2025-02-10 02:09:30 |
langsmith |
0.3.8 |
Client library to connect to the LangSmith LLM Tracing and Evaluation Platform. |
2025-02-09 23:35:06 |
python-lilypad |
0.0.15 |
An open-source prompt engineering framework. |
2025-02-08 00:07:45 |
judges |
0.0.6 |
A small library of research-backed LLM judges |
2025-02-07 21:05:29 |
pyevalai |
0.0.7 |
Automated python exercise evaluations with AI. |
2025-02-06 01:22:32 |
dyff |
0.31.0 |
Meta-package to install the local SDK for the Dyff AI auditing platform. |
2025-02-05 03:51:22 |
nuggetizer |
0.0.5 |
A package for Nuggetizer - a tool for information nugget creation and assignment to LLM-generated answers. |
2025-02-04 23:25:12 |
dyff-audit |
0.10.5 |
Audit tools for the Dyff AI auditing platform. |
2025-02-04 05:42:40 |
evo |
1.30.6 |
Python package for the evaluation of odometry and SLAM |
2025-02-02 16:01:02 |
corec |
1.0.6 |
A Context-Aware Recommendation Framework for Python |
2025-02-01 20:17:56 |
frechet-music-distance |
1.0.0 |
A library for computing Frechet Music Distance. |
2025-01-31 17:39:54 |
tieval |
0.1.8 |
A framework for evaluation and development of temporal-aware models. |
2025-01-29 10:01:54 |
agenta |
0.32.0 |
The SDK for agenta is an open-source LLMOps platform. |
2025-01-27 15:58:02 |
AutoRAG |
0.3.13 |
Automatically Evaluate RAG pipelines with your own data. Find optimal structure for new RAG product. |
2025-01-25 05:28:38 |
dyff-client |
0.15.2 |
Python client for the Dyff AI auditing platform. |
2025-01-24 03:56:28 |
evalscope |
0.10.1 |
EvalScope: Lightweight LLMs Evaluation Framework |
2025-01-23 05:45:05 |
opencompass |
0.4.0 |
A comprehensive toolkit for large model evaluation |
2025-01-22 06:42:16 |
phasellm |
0.0.25 |
Wrappers for common large language models (LLMs) with support for evaluation. |
2025-01-21 06:08:40 |
vellum-uptrain-fork |
0.7.2 |
Vellum UpTrain Fork |
2025-01-16 22:17:04 |
tokenization-scorer |
1.1.8 |
Package for evaluating text tokenizations. |
2025-01-13 10:36:40 |