Name | Version | Summary | date |
indoxJudge |
0.1.0 |
Indox Judge |
2024-12-19 14:09:13 |
chainforge |
0.3.2.4 |
A Visual Programming Environment for Prompt Engineering |
2024-12-16 21:26:19 |
costra |
1.1 |
None |
2024-12-13 12:08:58 |
ragbits-guardrails |
0.5.1 |
Guardrails module for Ragbits components |
2024-12-09 13:49:49 |
ragbits-evaluate |
0.5.1 |
Evaluation module for Ragbits components |
2024-12-09 13:49:48 |
eyantra-autoeval |
0.1.49 |
A python module to aid auto evaluation |
2024-12-06 13:29:17 |
reco-eval-tool |
1.1.5 |
Reco evaluation tool |
2024-12-06 06:32:43 |
daindex |
0.5.2 |
Deterioration Allocation Index Framework |
2024-12-04 18:45:01 |
hulu-evaluate |
0.0.2 |
Client library to train and evaluate models on the HuLu benchmark. |
2024-12-04 13:14:56 |
xretrieval |
0.2.0 |
Retrieve and Evaluate with X(any) models |
2024-12-04 07:13:48 |
enoslib |
10.0.1 |
A library to build (distributed) systems experiments |
2024-11-28 18:38:08 |
llama-index-packs-llama-dataset-metadata |
0.3.0 |
llama-index packs llama_dataset_metadata integration |
2024-11-17 22:43:11 |
ntqr |
0.4.2.3 |
Tools for the logic of evaluation using unlabeled data |
2024-10-31 13:59:07 |
lighteval |
0.6.2 |
A lightweight and configurable evaluation package |
2024-10-23 14:11:49 |
DAindex |
0.1.0 |
Deterioration Allocation Index Framework |
2024-09-12 17:06:46 |
evaluate |
0.4.3 |
HuggingFace community-driven open-source library of evaluation |
2024-09-11 10:15:32 |
simple-smatch |
0.1.3.1 |
Simple Smatch |
2024-08-10 21:39:47 |
nutcracker-py |
0.0.2a2 |
streamline LLM evaluation |
2024-08-03 10:09:01 |
semevalplatform |
0.0.11.post1 |
Semantic Evaluation Platform |
2024-07-15 09:12:37 |
ragtime |
0.0.43 |
Ragtime 🎹 is an LLMOps framework to automatically evaluate Retrieval Augmented Generation (RAG) systems and compare different RAGs / LLMs |
2024-06-10 15:20:30 |