| Name | Version | Summary | date |
| fragaria |
0.1.2 |
Advanced Chain of Thought (CoT) Reasoning API with Reinforcement Learning (RL) |
2025-08-28 21:24:46 |
| awraf-toolkit |
0.1.0 |
Adaptive Weight-Based Resource Allocation Framework for 6G IoT with THz Links and Intelligent Surfaces |
2025-08-25 17:22:01 |
| floxs |
0.1.1 |
Multi-agent RL flock and swarm environments implemented in JAX |
2025-08-21 16:06:04 |
| haupt |
2.9.3 |
Lineage metadata API, artifacts streams, sandbox, ML-API, and spaces for Polyaxon. |
2025-08-15 15:28:33 |
| polyaxon |
2.9.3 |
Command Line Interface (CLI) and client to interact with Polyaxon API. |
2025-08-15 15:28:25 |
| swe-ai-agent |
2.1.7 |
Headless Agentic IDE with reasoning mode and in built Browser |
2025-08-15 06:19:01 |
| pybandits |
4.0.13 |
Python Multi-Armed Bandit Library |
2025-08-14 08:27:16 |
| openrubricrl |
0.1.0 |
Open-source pipeline that converts human-written rubrics into LLM-based reward functions for RL and RLHF training |
2025-08-13 23:34:29 |
| mab-lite-dhruv |
0.1.2 |
Minimal multi-armed bandit helpers: pure exploration & pure exploitation. |
2025-08-11 03:20:48 |
| metasim-core |
0.1.0 |
Meta-Sim: A unified simulation framework for robotics |
2025-08-10 09:41:06 |
| abundant-sdk |
0.2.4 |
Python SDK for the Abundant Environment API - simulation environments for RL agent training |
2025-08-07 18:52:18 |
| reinforced-lib |
1.2.3 |
Reinforcement learning library |
2025-07-30 10:45:52 |
| autospice |
0.1.5 |
SPICE: Sparse and Interpretable Cognitive Equations |
2025-07-23 16:25:37 |
| rlgym-learn-algos |
0.2.5 |
Algorithm implementations for rlgym-learn |
2025-07-23 03:46:59 |
| toulouse |
1.1.1 |
High-performance card library for ML and RL applications. |
2025-07-21 16:04:10 |
| gama-gymnasium |
0.1.1 |
A Gymnasium environment for reinforcement learning with GAMA agent-based simulations |
2025-07-17 04:09:06 |
| arc-advisor |
0.1.0 |
The learning co-pilot for AI agents. Implements the Executor-Advisor pattern for building self-improving agentic systems. |
2025-07-15 02:17:42 |
| congruent |
0.0.8 |
A CLI for interacting with LLMs, with tracking, validation, integrations, and UI. |
2025-02-12 10:50:00 |
| pokerkit |
0.6.1 |
An open-source Python library for poker game simulations, hand evaluations, and statistical analysis |
2025-01-21 03:34:18 |
| rlportfolio |
0.2.1 |
Reinforcement learning framework for portfolio optimization tasks. |
2025-01-16 04:39:00 |