Name | Version | Summary | date |
haupt |
2.9.3 |
Lineage metadata API, artifacts streams, sandbox, ML-API, and spaces for Polyaxon. |
2025-08-15 15:28:33 |
polyaxon |
2.9.3 |
Command Line Interface (CLI) and client to interact with Polyaxon API. |
2025-08-15 15:28:25 |
swe-ai-agent |
2.1.7 |
Headless Agentic IDE with reasoning mode and in built Browser |
2025-08-15 06:19:01 |
rlgym-tools |
2.3.10 |
Extra tools for RLGym. |
2025-08-14 14:28:09 |
pybandits |
4.0.13 |
Python Multi-Armed Bandit Library |
2025-08-14 08:27:16 |
openrubricrl |
0.1.0 |
Open-source pipeline that converts human-written rubrics into LLM-based reward functions for RL and RLHF training |
2025-08-13 23:34:29 |
mab-lite-dhruv |
0.1.2 |
Minimal multi-armed bandit helpers: pure exploration & pure exploitation. |
2025-08-11 03:20:48 |
metasim-core |
0.1.0 |
Meta-Sim: A unified simulation framework for robotics |
2025-08-10 09:41:06 |
verifiers |
0.1.2.post0 |
Verifiers for reinforcement learning with LLMs |
2025-08-09 00:38:28 |
abundant-sdk |
0.2.4 |
Python SDK for the Abundant Environment API - simulation environments for RL agent training |
2025-08-07 18:52:18 |
reinforced-lib |
1.2.3 |
Reinforcement learning library |
2025-07-30 10:45:52 |
autospice |
0.1.5 |
SPICE: Sparse and Interpretable Cognitive Equations |
2025-07-23 16:25:37 |
rlgym-learn-algos |
0.2.5 |
Algorithm implementations for rlgym-learn |
2025-07-23 03:46:59 |
toulouse |
1.1.1 |
High-performance card library for ML and RL applications. |
2025-07-21 16:04:10 |
gama-gymnasium |
0.1.1 |
A Gymnasium environment for reinforcement learning with GAMA agent-based simulations |
2025-07-17 04:09:06 |
arc-advisor |
0.1.0 |
The learning co-pilot for AI agents. Implements the Executor-Advisor pattern for building self-improving agentic systems. |
2025-07-15 02:17:42 |
congruent |
0.0.8 |
A CLI for interacting with LLMs, with tracking, validation, integrations, and UI. |
2025-02-12 10:50:00 |
pokerkit |
0.6.1 |
An open-source Python library for poker game simulations, hand evaluations, and statistical analysis |
2025-01-21 03:34:18 |
rlportfolio |
0.2.1 |
Reinforcement learning framework for portfolio optimization tasks. |
2025-01-16 04:39:00 |
adam-robotics |
0.3.3 |
Automatic Differentiation for rigid-body-dynamics AlgorithMs |
2025-01-08 17:28:15 |