autonomize-model-sdk

Name	autonomize-model-sdk JSON
Version	1.1.66 JSON
	download
home_page	None
Summary	SDK for creating and managing machine learning pipelines.
upload_time	2025-09-11 16:43:40
maintainer	None
docs_url	None
author	None
requires_python	>=3.12
license	Proprietary
keywords	inference machine learning mlflow modelhub sdk
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            # ModelHub SDK

ModelHub SDK is a powerful tool for orchestrating and managing machine learning workflows, experiments, datasets, and deployments on Kubernetes. It integrates seamlessly with MLflow and supports custom pipelines, dataset management, model logging, prompt management for LLMs, and universal model serving with intelligent inference types.

**🚀 New in Latest Version:** Revolutionary **Universal Inference Types System** with automatic type detection for 9 data types (TEXT, IMAGE, PDF, TABULAR, AUDIO, VIDEO, JSON, CSV, AUTO), KServe V2 protocol compliance, and production-ready model serving. Built on **autonomize-core** foundation with enhanced authentication, improved HTTP client management, and comprehensive SSL support.

![Python Version](https://img.shields.io/badge/Python-3.9+-blue?style=for-the-badge&logo=python)
![MLflow Version](https://img.shields.io/badge/MLflow-2.21.2-blue?style=for-the-badge&logo=mlflow)
![PyPI Version](https://img.shields.io/pypi/v/autonomize-model-sdk?style=for-the-badge&logo=pypi)
![Code Formatter](https://img.shields.io/badge/code%20style-black-000000.svg?style=for-the-badge)
![Code Linter](https://img.shields.io/badge/linting-pylint-green.svg?style=for-the-badge)
![Code Checker](https://img.shields.io/badge/mypy-checked-blue?style=for-the-badge)
![Code Coverage](https://img.shields.io/badge/coverage-96%25-a4a523?style=for-the-badge&logo=codecov)

## Table of Contents

- [ModelHub SDK](#modelhub-sdk)
  - [Table of Contents](#table-of-contents)
  - [Installation](#installation)
  - [Environment Setup](#environment-setup)
  - [CLI Tool](#cli-tool)
  - [Quickstart](#quickstart)
  - [Experiments and Runs](#experiments-and-runs)
    - [Logging Parameters and Metrics](#logging-parameters-and-metrics)
    - [Artifact Management](#artifact-management)
  - [Pipeline Management](#pipeline-management)
    - [Basic Pipeline](#basic-pipeline)
    - [Running a Pipeline](#running-a-pipeline)
    - [Advanced Configuration](#advanced-configuration)
  - [Dataset Management](#dataset-management)
    - [Loading Datasets](#loading-datasets)
    - [Using Blob Storage for Dataset](#using-blob-storage-for-dataset)
  - [Universal Model Serving](#universal-model-serving)
    - [Inference Types System](#inference-types-system)
    - [Quick Start Model Serving](#quick-start-model-serving)
    - [Advanced Model Serving](#advanced-model-serving)
  - [Model Deployment through KServe](#model-deployment-through-kserve)
    - [Create a model wrapper:](#create-a-model-wrapper)
    - [Serve models with ModelHub:](#serve-models-with-modelhub)
    - [Deploy with KServe:](#deploy-with-kserve)
  - [Examples](#examples)
    - [Training Pipeline with Multiple Stages](#training-pipeline-with-multiple-stages)
    - [Dataset Version Management](#dataset-version-management)
- [InferenceClient](#inferenceclient)
  - [Installation](#installation-1)
  - [Authentication](#authentication)
  - [Text Inference](#text-inference)
  - [File Inference](#file-inference)
    - [Local File Path](#local-file-path)
    - [File Object](#file-object)
    - [URL](#url)
    - [Signed URL from Cloud Storage](#signed-url-from-cloud-storage)
  - [Response Format](#response-format)
  - [Error Handling](#error-handling)
  - [Additional Features](#additional-features)
  - [Async Support](#async-support)
- [Prompt Management](#prompt-management)
  - [Features](#features)
  - [Installation](#installation-2)
  - [Basic Usage](#basic-usage)
  - [Loading and Using Prompts](#loading-and-using-prompts)
  - [Managing Prompt Versions](#managing-prompt-versions)
  - [Evaluating Prompts](#evaluating-prompts)
    - [Online Evaluation (Backend Processing)](#online-evaluation-backend-processing)
    - [Offline Evaluation (Local Development)](#offline-evaluation-local-development)
    - [Template-Based Evaluation](#template-based-evaluation)
- [ML Monitoring](#model-monitoring-and-evaluation)
  - [LLL](#llm-monitoring)
  - [Traditional Model Monitoring](#traditional-ml-monitoring)
- [Migration Guide](#migration-guide)

## Installation

To install the ModelHub SDK, simply run:

```bash
pip install autonomize-model-sdk
```

**🔒 Security Update**: We strongly recommend upgrading to version 1.1.39 or later, which includes enhanced security features with MLflow no longer being directly exposed. All MLflow traffic now routes through our secure API gateway.

### Optional Dependencies

The SDK uses a modular dependency structure, allowing you to install only what you need:

```bash
# Install with core functionality (base, mlflow, pipeline, datasets)
pip install "autonomize-model-sdk[core]"

# Install with monitoring capabilities
pip install "autonomize-model-sdk[monitoring]"

# Install with serving capabilities
pip install "autonomize-model-sdk[serving]"

# Install with Azure integration
pip install "autonomize-model-sdk[azure]"

# Install the full package with all dependencies
pip install "autonomize-model-sdk[full]"

# Install for specific use cases
pip install "autonomize-model-sdk[data-science]"
pip install "autonomize-model-sdk[deployment]"
```

## What's New: autonomize-core Integration

The ModelHub SDK has been enhanced with **autonomize-core**, providing a more robust and feature-rich foundation:

### 🔧 **Core Improvements**
- **Enhanced HTTP Client**: Built on `httpx` for better async support and connection management
- **Comprehensive Exception Handling**: Detailed error types for better debugging and error handling
- **Improved Authentication**: More secure and flexible credential management
- **Better Logging**: Centralized logging system with configurable levels
- **SSL Certificate Support**: Custom certificate handling for enterprise environments

### 🚀 **Key Features**
- **Backward Compatibility**: All existing code continues to work without changes
- **New Environment Variables**: Cleaner, more consistent naming (with backward compatibility)
- **SSL Verification Control**: Support for custom certificates and SSL configuration
- **Better Error Messages**: More descriptive error messages for troubleshooting
- **Performance Improvements**: Optimized HTTP client and connection pooling

### 📦 **Dependencies**
The integration brings the autonomize-core package as a dependency, which includes:
- Modern HTTP client (`httpx`)
- Comprehensive exception handling
- Advanced credential management
- SSL certificate support
- Structured logging

## Environment Setup

### New Preferred Environment Variables (autonomize-core)

We recommend using the new environment variable names for better consistency and clarity:

```bash
export MODELHUB_URI=https://your-modelhub.com
export MODELHUB_AUTH_CLIENT_ID=your_client_id
export MODELHUB_AUTH_CLIENT_SECRET=your_secret
export GENESIS_CLIENT_ID=your_genesis_client
export GENESIS_COPILOT_ID=your_copilot
export MLFLOW_EXPERIMENT_ID=your_experiment_id
```

### Legacy Environment Variables (Backward Compatibility)

The following environment variables are still supported for backward compatibility:

```bash
export MODELHUB_BASE_URL=https://your-modelhub.com
export MODELHUB_CLIENT_ID=your_client_id
export MODELHUB_CLIENT_SECRET=your_secret
export CLIENT_ID=your_client
export COPILOT_ID=your_copilot
export MLFLOW_EXPERIMENT_ID=your_experiment_id
```

### SSL Certificate Configuration

The SDK now supports custom SSL certificate verification through the `verify_ssl` parameter. This is useful when working with self-signed certificates or custom certificate authorities:

```python
from modelhub.core import ModelhubCredential

# Disable SSL verification (not recommended for production)
credential = ModelhubCredential(
    modelhub_url="https://your-modelhub.com",
    client_id="your_client_id",
    client_secret="your_client_secret",
    verify_ssl=False
)

# Use custom certificate bundle
credential = ModelhubCredential(
    modelhub_url="https://your-modelhub.com",
    client_id="your_client_id",
    client_secret="your_client_secret",
    verify_ssl="/path/to/your/certificate.pem"
)
```

### Environment File Configuration

Alternatively, create a `.env` file in your project directory and add the environment variables:

```bash
# .env file
MODELHUB_URI=https://your-modelhub.com
MODELHUB_AUTH_CLIENT_ID=your_client_id
MODELHUB_AUTH_CLIENT_SECRET=your_secret
GENESIS_CLIENT_ID=your_genesis_client
GENESIS_COPILOT_ID=your_copilot
MLFLOW_EXPERIMENT_ID=your_experiment_id
```

## Enhanced Security (v1.1.39+)

Starting with version 1.1.39, the ModelHub SDK includes significant security enhancements:

### 🔒 MLflow Security Improvements

**No Direct MLflow Exposure**: MLflow is no longer directly accessible. All MLflow operations now route through our secure BFF (Backend-for-Frontend) API gateway, providing:

- **Centralized Authentication**: All requests are authenticated at the gateway level
- **Request Validation**: Enhanced input validation and sanitization
- **Audit Logging**: Complete audit trail for all MLflow operations
- **Network Isolation**: MLflow server runs in an internal-only network

### Secure MLflow Tracking URI

The SDK now automatically configures MLflow to use the secure gateway endpoint:

```python
# The SDK handles this automatically - no manual configuration needed
# MLflow tracking URI is set to: {api_url}/mlflow-tracking
# All requests are proxied through the secure gateway

from modelhub.core import ModelhubCredential
from modelhub.clients import MLflowClient

# Initialize with standard credentials
credential = ModelhubCredential()
client = MLflowClient(credential=credential)

# Use MLflow as normal - security is handled transparently
with client.start_run():
    client.mlflow.log_param("param", "value")
    client.mlflow.log_metric("metric", 0.95)
```

**Note**: For long-running MLflow operations, ensure your token lifetime is sufficient. The token is set when the MLflowClient is initialized.

## CLI Tool

The ModelHub SDK includes a command-line interface for managing ML pipelines:

```bash
# Start a pipeline in local mode (with local scripts)
pipeline start -f pipeline.yaml --mode local --pyproject pyproject.toml

# Start a pipeline in CI/CD mode (using container)
pipeline start -f pipeline.yaml --mode cicd
```

CLI Options:

- `-f, --file`: Path to pipeline YAML file (default: pipeline.yaml)
- `--mode`: Execution mode ('local' or 'cicd')
  - local: Runs with local scripts and installs dependencies using uv
  - cicd: Uses container image with pre-installed dependencies
- `--pyproject`: Path to pyproject.toml file (required for local mode)

## Quickstart

The ModelHub SDK allows you to easily log experiments, manage pipelines, and use datasets.

Here's a quick example of how to initialize the client and log a run:

### Basic Usage

```python
from modelhub.core import ModelhubCredential
from modelhub.clients import MLflowClient

# Initialize the credential
credential = ModelhubCredential(
    modelhub_url="https://your-modelhub.com",
    client_id="your_client_id",
    client_secret="your_client_secret"
)

# Initialize the MLflow client with the credential
client = MLflowClient(
    credential=credential,
    client_id="your_client_id",
    copilot_id="your_copilot_id"
)

experiment_id = "your_experiment_id"
client.set_experiment(experiment_id=experiment_id)

# Start an MLflow run
with client.start_run(run_name="my_experiment_run"):
    client.mlflow.log_param("param1", "value1")
    client.mlflow.log_metric("accuracy", 0.85)
    client.mlflow.log_artifact("model.pkl")
```

### Advanced Usage with SSL Configuration

```python
from modelhub.core import ModelhubCredential
from modelhub.clients import MLflowClient

# Initialize with custom SSL configuration
credential = ModelhubCredential(
    modelhub_url="https://your-modelhub.com",
    client_id="your_client_id",
    client_secret="your_client_secret",
    verify_ssl="/path/to/custom/certificate.pem"  # or False to disable
)

# The rest remains the same
client = MLflowClient(
    credential=credential,
    client_id="your_client_id",
    copilot_id="your_copilot_id"
)
```

### Using Environment Variables

```python
from modelhub.core import ModelhubCredential
from modelhub.clients import MLflowClient

# Credentials will be loaded from environment variables automatically
# MODELHUB_URI, MODELHUB_AUTH_CLIENT_ID, MODELHUB_AUTH_CLIENT_SECRET
credential = ModelhubCredential()

# Client IDs will be loaded from GENESIS_CLIENT_ID, GENESIS_COPILOT_ID
client = MLflowClient(credential=credential)
```

## Experiments and Runs

ModelHub SDK provides an easy way to interact with MLflow for managing experiments and runs.

### Logging Parameters and Metrics

To log parameters, metrics, and artifacts:

```python
with client.start_run(run_name="my_run"):
    # Log parameters
    client.mlflow.log_param("learning_rate", 0.01)

    # Log metrics
    client.mlflow.log_metric("accuracy", 0.92)
    client.mlflow.log_metric("precision", 0.88)

    # Log artifacts
    client.mlflow.log_artifact("/path/to/model.pkl")
```

### Artifact Management

You can log or download artifacts with ease:

```python
# Log artifact
client.mlflow.log_artifact("/path/to/file.csv")

# Download artifact
client.mlflow.artifacts.download_artifacts(run_id="run_id_here", artifact_path="artifact.csv", dst_path="/tmp")
```

## Pipeline Management

ModelHub SDK enables users to define, manage, and run multi-stage pipelines that automate your machine learning workflow. You can define pipelines in YAML and submit them using the SDK.

### Basic Pipeline

Here's a simple pipeline example:

```yaml
name: "Simple Pipeline"
description: "Basic ML pipeline"
experiment_id: "123"
image_tag: "my-image:1.0.0"
stages:
  - name: train
    type: custom
    script: scripts/train.py
```

### Running a Pipeline

Using CLI:

```bash
# Local development
pipeline start -f pipeline.yaml --mode local --pyproject pyproject.toml

# CI/CD environment
pipeline start -f pipeline.yaml --mode cicd
```

Using SDK:

```python
from modelhub.core import ModelhubCredential
from modelhub.clients import PipelineManager

# Initialize the credential
credential = ModelhubCredential(
    modelhub_url="https://your-modelhub.com",
    client_id="your_client_id",
    client_secret="your_client_secret"
)

# Initialize the pipeline manager with the credential
pipeline_manager = PipelineManager(
    credential=credential,
    client_id="your_client_id",
    copilot_id="your_copilot_id"
)

# Start the pipeline
pipeline = pipeline_manager.start_pipeline("pipeline.yaml")
```

### Advanced Configuration

For detailed information about pipeline configuration including:

- Resource management (CPU, Memory, GPU)
- Node scheduling with selectors and tolerations
- Blob storage integration
- Stage dependencies
- Advanced examples and best practices

See our [Pipeline Configuration Guide](./PIPELINE.md).

## Dataset Management

ModelHub SDK allows you to load and manage datasets easily, with support for loading data from external storage or datasets managed through the frontend.

### Loading Datasets

To load datasets using the SDK:

```python
from modelhub.core import ModelhubCredential
from modelhub.clients import DatasetClient

# Initialize the credential
credential = ModelhubCredential(
    modelhub_url="https://your-modelhub.com",
    client_id="your_client_id",
    client_secret="your_client_secret"
)

# Initialize the dataset client with the credential
dataset_client = DatasetClient(
    credential=credential,
    client_id="your_client_id",
    copilot_id="your_copilot_id"
)

# Load a dataset by name
dataset = dataset_client.load_dataset("my_dataset")

# Load a dataset from a specific directory
dataset = dataset_client.load_dataset("my_dataset", directory="data_folder/")

# Load a specific version and split
dataset = dataset_client.load_dataset("my_dataset", version=2, split="train")
```

### Using Blob Storage for Dataset

```python
# Load dataset from Azure Blob Storage
dataset = dataset_client.load_dataset(
    "my_dataset",
    blob_storage_config={
        "container": "data",
        "blob_url": "https://storage.blob.core.windows.net",
        "mount_path": "/data"
    }
)
```

### Google Cloud Storage Support (v1.1.39+)

ModelHub SDK now supports Google Cloud Storage (GCS) for dataset storage and artifact management:

```python
# Load dataset from Google Cloud Storage
dataset = dataset_client.load_dataset(
    "my_dataset",
    gcs_config={
        "bucket": "my-ml-datasets",
        "prefix": "datasets/training/",
        "credentials_path": "/path/to/service-account.json",  # Optional
        "mount_path": "/data"
    }
)

# Using GCS for MLflow artifacts
import os
os.environ["MLFLOW_GCS_DEFAULT_ARTIFACT_ROOT"] = "gs://my-ml-artifacts"

with client.start_run():
    # Artifacts will be automatically stored in GCS
    client.mlflow.log_artifact("model.pkl")  # Stored in gs://my-ml-artifacts/
    client.mlflow.log_metric("accuracy", 0.95)
```

**GCS Configuration Options:**
- **bucket**: The GCS bucket name
- **prefix**: Optional path prefix within the bucket
- **credentials_path**: Path to service account JSON (uses default credentials if not specified)
- **mount_path**: Local mount point for the dataset

## Universal Model Serving

ModelHub SDK provides a revolutionary inference types system that automatically detects and handles different data types, making it easy to deploy any model with a unified interface. The system supports 9 inference types and provides KServe V2 protocol compliance without requiring KServe dependencies.

### Inference Types System

The inference types system automatically detects and processes different input data types:

**Supported Types:**
- **TEXT**: Natural language text processing
- **IMAGE**: Image classification, object detection, etc.
- **PDF**: Document processing and analysis
- **TABULAR**: Structured data (CSV, DataFrame)
- **AUDIO**: Speech recognition, audio classification
- **VIDEO**: Video analysis and processing
- **JSON**: Structured JSON data
- **CSV**: Comma-separated values
- **AUTO**: Automatic type detection

### Quick Start Model Serving

Deploy any model with minimal configuration:

```python
from modelhub.serving import BaseModelPredictor, ModelServer

class MyTextClassifier(BaseModelPredictor):
    def load_model(self):
        # Load your model (any framework: scikit-learn, PyTorch, TensorFlow, etc.)
        import joblib
        return joblib.load("my_model.pkl")

    def predict(self, data):
        # Simple prediction logic
        predictions = self.model.predict(data)
        return {"predictions": predictions.tolist()}

# Start the server
model_service = MyTextClassifier(name="text-classifier")
server = ModelServer()
server.start([model_service])

# Your model is now available at:
# POST http://localhost:8080/v2/models/text-classifier/infer
```

### Advanced Model Serving

Use the AutoModelPredictor for automatic type handling:

```python
from modelhub.serving import AutoModelPredictor, ModelServer
from modelhub.serving.inference_types import InferenceType

class UniversalModel(AutoModelPredictor):
    def load_model(self):
        # Load your universal model
        return load_your_model()

    def predict(self, processed_data, inference_type: InferenceType):
        """
        Handle different input types automatically
        processed_data: Already transformed by InputTransformer
        inference_type: Detected type (TEXT, IMAGE, PDF, etc.)
        """
        if inference_type == InferenceType.TEXT:
            return {"text_prediction": self.model.predict_text(processed_data)}
        elif inference_type == InferenceType.IMAGE:
            return {"image_prediction": self.model.predict_image(processed_data)}
        elif inference_type == InferenceType.TABULAR:
            return {"tabular_prediction": self.model.predict_tabular(processed_data)}
        else:
            # Handle AUTO and other types
            return {"prediction": self.model.predict(processed_data)}

# Deploy with full KServe V2 protocol support
model_service = UniversalModel(name="universal-model")
server = ModelServer()
server.start([model_service])

# Available endpoints:
# POST /v2/models/universal-model/infer     - Main inference
# GET  /v2/models/universal-model           - Model metadata
# GET  /v2/health/live                      - Liveness check
# GET  /v2/health/ready                     - Readiness check
```

**Key Features:**
- **Automatic Type Detection**: Input data is automatically classified
- **Input/Output Transformation**: Data is preprocessed and postprocessed automatically
- **KServe V2 Protocol**: Full compliance with industry standard
- **Production Ready**: Comprehensive error handling and logging
- **Framework Agnostic**: Works with any ML framework
- **Scalable**: Designed for production deployment

**Example Requests:**

```bash
# Text input
curl -X POST http://localhost:8080/v2/models/universal-model/infer \
  -H "Content-Type: application/json" \
  -d '{"inputs": [{"name": "text", "data": ["Hello world"]}]}'

# Image input (base64 encoded)
curl -X POST http://localhost:8080/v2/models/universal-model/infer \
  -H "Content-Type: application/json" \
  -d '{"inputs": [{"name": "image", "data": ["data:image/jpeg;base64,..."]}]}'

# Auto-detection (let the system detect the type)
curl -X POST http://localhost:8080/v2/models/universal-model/infer \
  -H "Content-Type: application/json" \
  -d '{"inputs": [{"name": "input", "data": ["any_data_here"]}]}'
```

For comprehensive documentation on model serving capabilities, see our [Model Serving Guide](./SERVING.md).

## Model Deployment through KServe

Deploy models via KServe after logging them with MLflow:

### Create a model wrapper:

Use the MLflow PythonModel interface to define your model's prediction logic.

```python
import mlflow.pyfunc
import joblib

class ModelWrapper(mlflow.pyfunc.PythonModel):
    def load_context(self, context):
        self.model = joblib.load("/path/to/model.pkl")

    def predict(self, context, model_input):
        return self.model.predict(model_input)

# Log the model
client.mlflow.pyfunc.log_model(
    artifact_path="model",
    python_model=ModelWrapper()
)
```

### Serve models with ModelHub:

ModelHub SDK provides enhanced classes for serving models with the new Universal Inference Types System:

```python
from modelhub.serving import BaseModelPredictor, ModelServer

# Simple approach - serve any MLflow model
class MLflowModelService(BaseModelPredictor):
    def __init__(self, name: str, run_uri: str):
        super().__init__(name=name)
        self.run_uri = run_uri

    def load_model(self):
        import mlflow.pyfunc
        return mlflow.pyfunc.load_model(self.run_uri)

    def predict(self, data):
        predictions = self.model.predict(data)
        return {"predictions": predictions}

# Create and start model service
model_service = MLflowModelService(
    name="my-classifier",
    run_uri="runs:/abc123def456/model"
)
server = ModelServer()
server.start([model_service])
```

The new system supports automatic type detection for text, images, PDFs, tabular data, audio, video, JSON, CSV, and more. For comprehensive documentation on model serving capabilities, see our [Model Serving Guide](./SERVING.md).

### Deploy with KServe:

After logging the model, deploy it using KServe:

```yaml
apiVersion: "serving.kserve.io/v1beta1"
kind: "InferenceService"
metadata:
  name: "model-service"
  namespace: "modelhub"
  labels:
    azure.workload.identity/use: "true"
spec:
  predictor:
    containers:
      - image: your-registry.io/model-serve:latest
        name: model-service
        resources:
          requests:
            cpu: "1"
            memory: "2Gi"
          limits:
            cpu: "2"
            memory: "4Gi"
        command:
          [
            "sh",
            "-c",
            "python app/main.py --model_name my-classifier --run runs:/abc123def456/model",
          ]
        env:
          - name: MODELHUB_BASE_URL
            value: "https://api-modelhub.example.com"
    serviceAccountName: "service-account-name"
```

## Examples

### Training Pipeline with Multiple Stages

```python
from modelhub.core import ModelhubCredential
from modelhub.clients import MLflowClient, PipelineManager

# Initialize credential
credential = ModelhubCredential(
    modelhub_url="https://api-modelhub.example.com",
    client_id="your_client_id",
    client_secret="your_client_secret"
)

# Setup clients
mlflow_client = MLflowClient(credential=credential)
pipeline_manager = PipelineManager(credential=credential)

# Define and run pipeline
pipeline = pipeline_manager.start_pipeline("pipeline.yaml")

# Track experiment in MLflow
with mlflow_client.start_run(run_name="Training Run"):
    # Log training parameters
    mlflow_client.log_param("model_type", "transformer")
    mlflow_client.log_param("epochs", 10)

    # Log metrics
    mlflow_client.log_metric("train_loss", 0.123)
    mlflow_client.log_metric("val_accuracy", 0.945)

    # Log model artifacts
    mlflow_client.log_artifact("model.pkl")
```

### Dataset Version Management

```python
from modelhub.core import ModelhubCredential
from modelhub.clients import DatasetClient

# Initialize credential
credential = ModelhubCredential(
    modelhub_url="https://api-modelhub.example.com",
    client_id="your_client_id",
    client_secret="your_client_secret"
)

# Initialize client
dataset_client = DatasetClient(credential=credential)

# List available datasets
datasets = dataset_client.list_datasets()

# Get specific version
dataset_v2 = dataset_client.get_dataset_versions("dataset_id")

# Load dataset with version control
dataset = dataset_client.load_dataset(
    "my_dataset",
    version=2,
    split="train"
)
```

# InferenceClient

The `InferenceClient` provides a simple interface to perform inference using deployed models. It supports both text-based and file-based inference with comprehensive error handling and support for various input types.

## Installation

The inference client is part of the ModelHub SDK optional dependencies. To install:

```bash
pip install "autonomize-model-sdk[serving]"
```

Or with uv:

```bash
uv add autonomize-model-sdk --extra serving
```

## Authentication

The client supports multiple authentication methods:

```python
from modelhub.core import ModelhubCredential
from modelhub.clients import InferenceClient

# Create credential
credential = ModelhubCredential(
    modelhub_url="https://your-modelhub-instance",
    client_id="your-client-id",
    client_secret="your-client-secret"
)

# Using credential (recommended approach)
client = InferenceClient(
    credential=credential,
    client_id="client_id",
    copilot_id="copilot_id"
)

# Using environment variables (MODELHUB_BASE_URL, MODELHUB_CLIENT_ID, MODELHUB_CLIENT_SECRET)
# Note: This approach is deprecated and will be removed in a future version
client = InferenceClient()

# Using direct parameters (deprecated)
client = InferenceClient(
    base_url="https://your-modelhub-instance",
    sa_client_id="your-client-id",
    sa_client_secret="your-client-secret",
    genesis_client_id="client id",
    genesis_copilot_id="copilot id"
)

# Using a token (deprecated)
client = InferenceClient(
    base_url="https://your-modelhub-instance",
    token="your-token"
)
```

## Text Inference

For models that accept text input:

```python
# Simple text inference
response = client.run_text_inference(
    model_name="text-model",
    text="This is the input text"
)

# With additional parameters
response = client.run_text_inference(
    model_name="llm-model",
    text="Translate this to French: Hello, world!",
    parameters={
        "temperature": 0.7,
        "max_tokens": 100
    }
)

# Access the result
result = response["result"]
print(f"Processing time: {response.get('processing_time')} seconds")
```

## File Inference

The client supports multiple file input methods:

### Local File Path

```python
# Using a local file path
response = client.run_file_inference(
    model_name="image-recognition",
    file_path="/path/to/image.jpg"
)
```

### File Object

```python
# Using a file-like object
with open("document.pdf", "rb") as f:
    response = client.run_file_inference(
        model_name="document-processor",
        file_path=f,
        file_name="document.pdf",
        content_type="application/pdf"
    )
```

### URL

```python
# Using a URL
response = client.run_file_inference(
    model_name="image-recognition",
    file_path="https://example.com/images/sample.jpg"
)
```

### Signed URL from Cloud Storage

```python
# Using a signed URL from S3 or Azure Blob Storage
response = client.run_file_inference(
    model_name="document-processor",
    file_path="https://your-bucket.s3.amazonaws.com/path/to/document.pdf?signature=...",
    file_name="confidential-document.pdf",  # Optional: Override filename
    content_type="application/pdf"          # Optional: Override content type
)
```

## Response Format

The response format is consistent across inference types:

```python
{
    "result": {
        # Model-specific output
        # For example, text models might return:
        "text": "Generated text",

        # Image models might return:
        "objects": [
            {"class": "car", "confidence": 0.95, "bbox": [10, 20, 100, 200]},
            {"class": "person", "confidence": 0.87, "bbox": [150, 30, 220, 280]}
        ]
    },
    "processing_time": 0.234,  # Time in seconds
    "model_version": "1.0.0",  # Optional version info
    "metadata": {              # Optional additional information
        "runtime": "cpu",
        "batch_size": 1
    }
}
```

## Error Handling

The client provides comprehensive error handling with specific exception types:

```python
from modelhub.clients import InferenceClient
from modelhub.core.exceptions import (
    ModelHubException,
    ModelHubResourceNotFoundException,
    ModelHubBadRequestException,
    ModelhubUnauthorizedException
)

client = InferenceClient(credential=credential)

try:
    response = client.run_text_inference("model-name", "input text")
    print(response)
except ModelHubResourceNotFoundException as e:
    print(f"Model not found: {e}")
    # Handle 404 error
except ModelhubUnauthorizedException as e:
    print(f"Authentication failed: {e}")
    # Handle 401/403 error
except ModelHubBadRequestException as e:
    print(f"Invalid request: {e}")
    # Handle 400 error
except ModelHubException as e:
    print(f"Inference failed: {e}")
    # Handle other errors
```

## Additional Features

- **SSL verification control**: You can disable SSL verification for development environments
- **Automatic content type detection**: The client automatically detects the content type of files based on their extension
- **Customizable timeout**: You can set a custom timeout for inference requests
- **Comprehensive logging**: All operations are logged for easier debugging

## Async Support

The InferenceClient also provides async versions of all methods for use in async applications:

```python
import asyncio
from modelhub.clients import InferenceClient

async def run_inference():
    client = InferenceClient(credential=credential)

    # Text inference
    response = await client.arun_text_inference(
        model_name="text-model",
        text="This is async inference"
    )

    # File inference
    file_response = await client.arun_file_inference(
        model_name="image-model",
        file_path="/path/to/image.jpg"
    )

    return response, file_response

# Run with asyncio
responses = asyncio.run(run_inference())
```

# Prompt Management

The ModelHub SDK provides comprehensive prompt management capabilities through the dedicated PromptClient. This allows you to version, track, evaluate, and reuse prompts across your organization with support for complex multi-message templates.

## Features

- **Versioning** - Track the evolution of your prompts with version control
- **Multi-Message Templates** - Support for complex system/user message structures
- **Reusability** - Store and manage prompts in a centralized registry
- **Aliases** - Create aliases for prompt versions to simplify deployment pipelines
- **Evaluation** - Built-in prompt evaluation with metrics and traces
- **Tags & Metadata** - Rich tagging and metadata support for organization
- **Async Support** - Full async/await support for all operations

## Installation

Prompt management is included in the core SDK:

```bash
pip install autonomize-model-sdk
```

## Basic Usage

```python
from modelhub.core import ModelhubCredential
from modelhub.clients import PromptClient
from modelhub.models.prompts import PromptCreation, Message, Content

# Initialize credential
credential = ModelhubCredential(
    modelhub_url="https://api-modelhub.example.com",
    client_id="your_client_id",
    client_secret="your_client_secret"
)

# Initialize prompt client
prompt_client = PromptClient(credential=credential)

# Create a new prompt with system and user messages
prompt = prompt_client.create_prompt(
    PromptCreation(
        name="summarization-prompt",
        template=[
            Message(
                role="system",
                content=Content(type="text", text="You are a helpful summarization assistant."),
                input_variables=[]
            ),
            Message(
                role="user",
                content=Content(
                    type="text",
                    text="Summarize this in {{ num_sentences }} sentences: {{ content }}"
                ),
                input_variables=["num_sentences", "content"]
            )
        ],
        commit_message="Initial version",
        version_metadata={"author": "author@example.com"},
        tags=[{"key": "task", "value": "summarization"}]
    )
)

print(f"Created prompt '{prompt['name']}' (version {prompt['latest_versions'][0]['version']})")
```

## Loading and Using Prompts

```python
# Get a specific prompt version
prompt = prompt_client.get_registered_prompt_version("summarization-prompt", version=1)

# Get the latest version
latest_prompt = prompt_client.get_registered_prompt_by_name("summarization-prompt")

# Create an alias for deployment
from modelhub.models.models import Alias
prompt_client.create_alias(
    "summarization-prompt",
    Alias(name="production", version=1)
)

# Search for prompts
from modelhub.models.models import SearchModelsCriteria
prompts = prompt_client.get_prompts(
    SearchModelsCriteria(
        filter_string="tags.task = 'summarization'"
    )
)

# Evaluate a prompt
from modelhub.models.prompts import EvaluationInput
evaluation_result = prompt_client.evaluate_prompt(
    EvaluationInput(
        model="gpt-3.5-turbo",
        provider="azure",
        template=prompt['latest_versions'][0]['template'],
        temperature=0.1,
        variables={"num_sentences": "2", "content": "Your text here..."}
    )
)
```

## Managing Prompt Versions

```python
# Create a new version of existing prompt
from modelhub.models.prompts import UpdatePromptVersionRequest

new_version = prompt_client.create_prompt_version(
    "summarization-prompt",
    UpdatePromptVersionRequest(
        template=[
            Message(
                role="system",
                content=Content(
                    type="text",
                    text="You are an expert summarizer. Be concise and accurate."
                ),
                input_variables=[]
            ),
            Message(
                role="user",
                content=Content(
                    type="text",
                    text="Summarize in exactly {{ num_sentences }} sentences: {{ content }}"
                ),
                input_variables=["num_sentences", "content"]
            )
        ],
        commit_message="Improved prompt with clearer instructions"
    )
)

# Update version tags
from modelhub.models.models import Tag
prompt_client.update_prompt_version_tag(
    "summarization-prompt",
    version="2",
    version_metadata=[
        Tag(key="tested", value="true"),
        Tag(key="performance", value="improved")
    ]
)

# List all versions of a prompt
versions = prompt_client.get_prompt_versions_with_name("summarization-prompt")
```

## Evaluating Prompts

The ModelHub SDK provides both **online** (backend-processed) and **offline** (local) evaluation capabilities for prompt testing and development.

### Online Evaluation (Backend Processing)

For comprehensive evaluation with results tracked on the ModelHub dashboard:

```python
# Online evaluation via backend API
from modelhub.models.prompts import EvaluationInput

# Get the prompt to evaluate
prompt = prompt_client.get_registered_prompt_version("summarization-prompt", version=1)

# Submit evaluation job (processed asynchronously via Kafka)
evaluation_result = prompt_client.evaluate_prompt(
    EvaluationInput(
        model="gpt-3.5-turbo",
        provider="azure",  # or "openai"
        template=prompt['template'],
        temperature=0.1,
        variables={
            "num_sentences": "2",
            "content": "Artificial intelligence has transformed how businesses operate..."
        }
    )
)

# Get execution traces for analysis
from modelhub.models.prompts import PromptRunTracesDto

traces = prompt_client.get_traces(
    PromptRunTracesDto(
        experiment_ids=["your-experiment-id"],
        filter_string="tags.prompt_name = 'summarization-prompt'",
        max_results=100
    )
)
```

### Offline Evaluation (Local Development)

For immediate feedback during prompt development, use the **offline evaluation** capabilities:

```python
# Install with evaluation dependencies
# pip install "autonomize-model-sdk[monitoring]"

import pandas as pd
from modelhub.evaluation import PromptEvaluator, EvaluationConfig

# Configure evaluation settings
config = EvaluationConfig(
    evaluations=["metrics"],  # Basic text metrics
    save_html=True,
    save_json=True,
    output_dir="./evaluation_reports"
)

# Initialize evaluator
evaluator = PromptEvaluator(config)

# Prepare evaluation data
data = pd.DataFrame({
    'prompt': [
        'Summarize this article in 2 sentences.',
        'Explain quantum computing in simple terms.'
    ],
    'response': [
        'This article discusses AI advancements and applications in various industries.',
        'Quantum computing uses quantum mechanics principles for faster computations.'
    ],
    'expected': [  # Optional reference responses
        'AI has advanced significantly with diverse applications.',
        'Quantum computing leverages quantum mechanics for speed.'
    ]
})

# Run offline evaluation
report = evaluator.evaluate_offline(
    data=data,
    prompt_col='prompt',
    response_col='response',
    reference_col='expected'  # Optional
)

# Access results
print(f"Total samples evaluated: {report.summary['total_samples']}")
print(f"Average prompt length: {report.summary['basic_stats']['avg_prompt_length']}")
print(f"Average response length: {report.summary['basic_stats']['avg_response_length']}")

# Reports saved to ./evaluation_reports/ directory
print(f"HTML report: {report.html_path}")
print(f"JSON report: {report.json_path}")
```

### Template-Based Evaluation

Evaluate prompt templates with multiple test cases:

```python
from modelhub.models.prompts import Message, Content

# Define a prompt template
template = [
    Message(
        role="system",
        content=Content(type="text", text="You are a helpful assistant."),
        input_variables=[]
    ),
    Message(
        role="user",
        content=Content(type="text", text="Summarize in {{num_sentences}} sentences: {{content}}"),
        input_variables=["num_sentences", "content"]
    )
]

# Test data with variable combinations
test_data = pd.DataFrame({
    'variables': [
        {'num_sentences': '2', 'content': 'Long article about AI...'},
        {'num_sentences': '3', 'content': 'Research paper on quantum computing...'}
    ],
    'expected': [
        'Expected summary 1...',
        'Expected summary 2...'
    ]
})

# Optional: Provide LLM function for actual response generation
def generate_response(prompt):
    # Your LLM call here
    return "Generated response..."

# Evaluate template
report = evaluator.evaluate_prompt_template(
    prompt_template=template,
    test_data=test_data,
    variables_col='variables',
    expected_col='expected',
    llm_generate_func=generate_response  # Optional
)
```

**Key Differences:**
- **Online Evaluation**: Comprehensive analysis, dashboard integration, requires backend processing time
- **Offline Evaluation**: Immediate results, local development, basic text metrics only
- **Use Cases**: Online for production testing, offline for rapid iteration

## Async Support

All prompt operations support async/await:

```python
# Async prompt creation
async def create_prompt_async():
    prompt = await prompt_client.acreate_prompt(prompt_creation)
    return prompt

# Async version retrieval
async def get_versions_async():
    versions = await prompt_client.aget_prompt_versions_with_name("summarization-prompt")
    return versions
```

For more detailed information about prompt management, including advanced usage patterns, best practices, and in-depth examples, see our [Prompt Management Guide](./PROMPT.md).

# Model Monitoring and Evaluation

ModelHub SDK provides comprehensive tools for monitoring and evaluating both traditional ML models and Large Language Models (LLMs). These tools help track model performance, detect data drift, and assess LLM-specific metrics.

To install with monitoring capabilities:

```bash
pip install "autonomize-model-sdk[monitoring]"
```

## LLM Monitoring

The `LLMMonitor` utility allows you to evaluate and monitor LLM outputs using specialized metrics and visualizations.

### Basic LLM Evaluation

```python
from modelhub.core import ModelhubCredential
from modelhub.clients.mlflow_client import MLflowClient
from modelhub.monitors.llm_monitor import LLMMonitor

# Initialize credential
credential = ModelhubCredential(
    modelhub_url="https://your-modelhub-instance",
    client_id="your-client-id",
    client_secret="your-client-secret"
)

# Initialize clients
mlflow_client = MLflowClient(credential=credential)
llm_monitor = LLMMonitor(mlflow_client=mlflow_client)

# Create a dataframe with LLM responses
data = pd.DataFrame({
    "prompt": ["Explain AI", "What is MLOps?"],
    "response": ["AI is a field of computer science...", "MLOps combines ML and DevOps..."],
    "category": ["education", "technical"]
})

# Create column mapping
column_mapping = llm_monitor.create_column_mapping(
    prompt_col="prompt",
    response_col="response",
    categorical_cols=["category"]
)

# Run evaluations
length_report = llm_monitor.evaluate_text_length(
    data=data,
    response_col="response",
    column_mapping=column_mapping,
    save_html=True
)

# Generate visualizations
dashboard_path = llm_monitor.generate_dashboard(
    data=data,
    response_col="response",
    category_col="category"
)

# Log metrics to MLflow
llm_monitor.log_metrics_to_mlflow(length_report)
```

### Evaluating Content Patterns

```python
patterns_report = llm_monitor.evaluate_content_patterns(
    data=data,
    response_col="response",
    words_to_check=["AI", "model", "learning"],
    patterns_to_check=["neural network", "deep learning"],
    prefix_to_check="I'll explain"
)
```

### Semantic Properties Analysis

```python
semantic_report = llm_monitor.evaluate_semantic_properties(
    data=data,
    response_col="response",
    prompt_col="prompt",
    check_sentiment=True,
    check_toxicity=True,
    check_prompt_relevance=True
)
```

### Comprehensive Evaluation

```python
results = llm_monitor.run_comprehensive_evaluation(
    data=data,
    response_col="response",
    prompt_col="prompt",
    categorical_cols=["category"],
    words_to_check=["AI", "model", "learning"],
    run_sentiment=True,
    run_toxicity=True,
    save_html=True
)
```

### LLM-as-Judge Evaluation

Evaluate responses using OpenAI's models as a judge (requires OpenAI API key):

```python
judge_report = llm_monitor.evaluate_llm_as_judge(
    data=data,
    response_col="response",
    check_pii=True,
    check_decline=True,
    custom_evals=[{
        "name": "Educational Value",
        "criteria": "Evaluate whether the response has educational value.",
        "target": "educational",
        "non_target": "not_educational"
    }]
)
```

### Comparing LLM Models

Compare responses from different LLM models:

```python
comparison_report = llm_monitor.generate_comparison_report(
    reference_data=model_a_data,
    current_data=model_b_data,
    response_col="response",
    category_col="category"
)

comparison_viz = llm_monitor.create_comparison_visualization(
    reference_data=model_a_data,
    current_data=model_b_data,
    response_col="response",
    metrics=["length", "word_count", "sentiment_score"]
)
```

## Traditional ML Monitoring

The SDK also includes `MLMonitor` for traditional ML models, providing capabilities for:

- Data drift detection
- Data quality assessment
- Model performance monitoring
- Target drift analysis
- Regression and classification metrics

```python
from modelhub.core import ModelhubCredential
from modelhub.clients.mlflow_client import MLflowClient
from modelhub.monitors.ml_monitor import MLMonitor

# Initialize credential
credential = ModelhubCredential(
    modelhub_url="https://your-modelhub-instance",
    client_id="your-client-id",
    client_secret="your-client-secret"
)

# Initialize clients
mlflow_client = MLflowClient(credential=credential)
ml_monitor = MLMonitor(mlflow_client=mlflow_client)

results = ml_monitor.run_and_log_reports(
    reference_data=reference_data,
    current_data=current_data,
    report_types=["data_drift", "data_quality", "target_drift", "regression"],
    column_mapping=column_mapping,
    target_column="target",
    prediction_column="prediction",
    log_to_mlflow=True
)
```

## Migration Guide

### autonomize-core Integration (Latest Version)

The latest version of ModelHub SDK is built on **autonomize-core**, providing enhanced functionality and better performance. Here's what you need to know:

#### Environment Variables Migration

**New Preferred Variables:**
```bash
export MODELHUB_URI=https://your-modelhub.com
export MODELHUB_AUTH_CLIENT_ID=your_client_id
export MODELHUB_AUTH_CLIENT_SECRET=your_secret
export GENESIS_CLIENT_ID=your_genesis_client
export GENESIS_COPILOT_ID=your_copilot
```

**Legacy Variables (Still Supported):**
```bash
export MODELHUB_BASE_URL=https://your-modelhub.com
export MODELHUB_CLIENT_ID=your_client_id
export MODELHUB_CLIENT_SECRET=your_secret
export CLIENT_ID=your_client
export COPILOT_ID=your_copilot
```

#### SSL Certificate Support

New SSL configuration options are now available:

```python
from modelhub.core import ModelhubCredential

# Custom certificate path
credential = ModelhubCredential(
    modelhub_url="https://your-modelhub.com",
    client_id="your_client_id",
    client_secret="your_client_secret",
    verify_ssl="/path/to/certificate.pem"
)

# Disable SSL verification (development only)
credential = ModelhubCredential(
    modelhub_url="https://your-modelhub.com",
    client_id="your_client_id",
    client_secret="your_client_secret",
    verify_ssl=False
)
```

#### What's Changed
- **HTTP Client**: Now uses `httpx` instead of `requests` for better performance
- **Exception Handling**: More detailed exception types from autonomize-core
- **Authentication**: Enhanced credential management system
- **Logging**: Improved logging with autonomize-core's logging system

#### What Stays the Same
- **API Compatibility**: All existing client methods work without changes
- **Import Statements**: No changes needed to your import statements
- **Environment Variables**: Legacy environment variables continue to work

### Client Architecture Changes

Starting with version 1.2.0, the ModelHub SDK uses a new architecture based on HTTPX and a centralized credential system. If you're upgrading from an earlier version, you'll need to update your code as follows:

#### Old Way (Deprecated)

```python
from modelhub.clients import BaseClient, DatasetClient, MLflowClient

# Direct initialization with credentials
client = BaseClient(
    base_url="https://api-modelhub.example.com",
    sa_client_id="your_client_id",
    sa_client_secret="your_client_secret"
)

dataset_client = DatasetClient(
    base_url="https://api-modelhub.example.com",
    sa_client_id="your_client_id",
    sa_client_secret="your_client_secret"
)
```

#### New Way (Recommended)

```python
from modelhub.core import ModelhubCredential
from modelhub.clients import BaseClient, DatasetClient, MLflowClient

# Create a credential object
credential = ModelhubCredential(
    modelhub_url="https://api-modelhub.example.com",
    client_id="your_client_id",
    client_secret="your_client_secret"
)

# Initialize clients with the credential
base_client = BaseClient(
    credential=credential,
    client_id="your_client_id",  # For RBAC
    copilot_id="your_copilot_id"  # For RBAC
)

dataset_client = DatasetClient(
    credential=credential,
    client_id="your_client_id",
    copilot_id="your_copilot_id"
)

mlflow_client = MLflowClient(
    credential=credential,
    client_id="your_client_id",
    copilot_id="your_copilot_id"
)
```

### Prompt Management Changes

The PromptClient has been replaced with MLflow's built-in prompt registry capabilities:

#### Old Way (Deprecated)

```python
from modelhub.clients.prompt_client import PromptClient

prompt_client = PromptClient(
    base_url="https://api-modelhub.example.com",
    sa_client_id="your_client_id",
    sa_client_secret="your_client_secret"
)

prompt_client.create_prompt(
    name="summarization-prompt",
    template="Summarize this text: {{context}}",
    prompt_type="USER"
)
```

#### New Way (Recommended)

```python
from modelhub.core import ModelhubCredential
from modelhub.clients import MLflowClient

credential = ModelhubCredential(
    modelhub_url="https://api-modelhub.example.com",
    client_id="your_client_id",
    client_secret="your_client_secret"
)

client = MLflowClient(credential=credential)

client.mlflow.register_prompt(
    name="summarization-prompt",
    template="Summarize this text: {{ context }}",
    commit_message="Initial version"
)

# Load and use a prompt
prompt = client.mlflow.load_prompt("prompts:/summarization-prompt/1")
formatted_prompt = prompt.format(context="Your text to summarize")
```

### New Async Support

All clients now support asynchronous operations:

```python
# Synchronous
result = client.get("endpoint")

# Asynchronous
result = await client.aget("endpoint")
```

For detailed information about the new prompt management capabilities, see the [Prompt Management Guide](./PROMPT.md).

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "autonomize-model-sdk",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.12",
    "maintainer_email": null,
    "keywords": "inference, machine learning, mlflow, modelhub, sdk",
    "author": null,
    "author_email": "Jagveer Singh <jagveer@autonomize.ai>",
    "download_url": "https://files.pythonhosted.org/packages/4b/0b/75e7b8465597342d24bc47917475aa86cb3665af353dd04ccb2fb64525d2/autonomize_model_sdk-1.1.66.tar.gz",
    "platform": null,
    "description": "# ModelHub SDK\n\nModelHub SDK is a powerful tool for orchestrating and managing machine learning workflows, experiments, datasets, and deployments on Kubernetes. It integrates seamlessly with MLflow and supports custom pipelines, dataset management, model logging, prompt management for LLMs, and universal model serving with intelligent inference types.\n\n**\ud83d\ude80 New in Latest Version:** Revolutionary **Universal Inference Types System** with automatic type detection for 9 data types (TEXT, IMAGE, PDF, TABULAR, AUDIO, VIDEO, JSON, CSV, AUTO), KServe V2 protocol compliance, and production-ready model serving. Built on **autonomize-core** foundation with enhanced authentication, improved HTTP client management, and comprehensive SSL support.\n\n![Python Version](https://img.shields.io/badge/Python-3.9+-blue?style=for-the-badge&logo=python)\n![MLflow Version](https://img.shields.io/badge/MLflow-2.21.2-blue?style=for-the-badge&logo=mlflow)\n![PyPI Version](https://img.shields.io/pypi/v/autonomize-model-sdk?style=for-the-badge&logo=pypi)\n![Code Formatter](https://img.shields.io/badge/code%20style-black-000000.svg?style=for-the-badge)\n![Code Linter](https://img.shields.io/badge/linting-pylint-green.svg?style=for-the-badge)\n![Code Checker](https://img.shields.io/badge/mypy-checked-blue?style=for-the-badge)\n![Code Coverage](https://img.shields.io/badge/coverage-96%25-a4a523?style=for-the-badge&logo=codecov)\n\n## Table of Contents\n\n- [ModelHub SDK](#modelhub-sdk)\n  - [Table of Contents](#table-of-contents)\n  - [Installation](#installation)\n  - [Environment Setup](#environment-setup)\n  - [CLI Tool](#cli-tool)\n  - [Quickstart](#quickstart)\n  - [Experiments and Runs](#experiments-and-runs)\n    - [Logging Parameters and Metrics](#logging-parameters-and-metrics)\n    - [Artifact Management](#artifact-management)\n  - [Pipeline Management](#pipeline-management)\n    - [Basic Pipeline](#basic-pipeline)\n    - [Running a Pipeline](#running-a-pipeline)\n    - [Advanced Configuration](#advanced-configuration)\n  - [Dataset Management](#dataset-management)\n    - [Loading Datasets](#loading-datasets)\n    - [Using Blob Storage for Dataset](#using-blob-storage-for-dataset)\n  - [Universal Model Serving](#universal-model-serving)\n    - [Inference Types System](#inference-types-system)\n    - [Quick Start Model Serving](#quick-start-model-serving)\n    - [Advanced Model Serving](#advanced-model-serving)\n  - [Model Deployment through KServe](#model-deployment-through-kserve)\n    - [Create a model wrapper:](#create-a-model-wrapper)\n    - [Serve models with ModelHub:](#serve-models-with-modelhub)\n    - [Deploy with KServe:](#deploy-with-kserve)\n  - [Examples](#examples)\n    - [Training Pipeline with Multiple Stages](#training-pipeline-with-multiple-stages)\n    - [Dataset Version Management](#dataset-version-management)\n- [InferenceClient](#inferenceclient)\n  - [Installation](#installation-1)\n  - [Authentication](#authentication)\n  - [Text Inference](#text-inference)\n  - [File Inference](#file-inference)\n    - [Local File Path](#local-file-path)\n    - [File Object](#file-object)\n    - [URL](#url)\n    - [Signed URL from Cloud Storage](#signed-url-from-cloud-storage)\n  - [Response Format](#response-format)\n  - [Error Handling](#error-handling)\n  - [Additional Features](#additional-features)\n  - [Async Support](#async-support)\n- [Prompt Management](#prompt-management)\n  - [Features](#features)\n  - [Installation](#installation-2)\n  - [Basic Usage](#basic-usage)\n  - [Loading and Using Prompts](#loading-and-using-prompts)\n  - [Managing Prompt Versions](#managing-prompt-versions)\n  - [Evaluating Prompts](#evaluating-prompts)\n    - [Online Evaluation (Backend Processing)](#online-evaluation-backend-processing)\n    - [Offline Evaluation (Local Development)](#offline-evaluation-local-development)\n    - [Template-Based Evaluation](#template-based-evaluation)\n- [ML Monitoring](#model-monitoring-and-evaluation)\n  - [LLL](#llm-monitoring)\n  - [Traditional Model Monitoring](#traditional-ml-monitoring)\n- [Migration Guide](#migration-guide)\n\n## Installation\n\nTo install the ModelHub SDK, simply run:\n\n```bash\npip install autonomize-model-sdk\n```\n\n**\ud83d\udd12 Security Update**: We strongly recommend upgrading to version 1.1.39 or later, which includes enhanced security features with MLflow no longer being directly exposed. All MLflow traffic now routes through our secure API gateway.\n\n### Optional Dependencies\n\nThe SDK uses a modular dependency structure, allowing you to install only what you need:\n\n```bash\n# Install with core functionality (base, mlflow, pipeline, datasets)\npip install \"autonomize-model-sdk[core]\"\n\n# Install with monitoring capabilities\npip install \"autonomize-model-sdk[monitoring]\"\n\n# Install with serving capabilities\npip install \"autonomize-model-sdk[serving]\"\n\n# Install with Azure integration\npip install \"autonomize-model-sdk[azure]\"\n\n# Install the full package with all dependencies\npip install \"autonomize-model-sdk[full]\"\n\n# Install for specific use cases\npip install \"autonomize-model-sdk[data-science]\"\npip install \"autonomize-model-sdk[deployment]\"\n```\n\n## What's New: autonomize-core Integration\n\nThe ModelHub SDK has been enhanced with **autonomize-core**, providing a more robust and feature-rich foundation:\n\n### \ud83d\udd27 **Core Improvements**\n- **Enhanced HTTP Client**: Built on `httpx` for better async support and connection management\n- **Comprehensive Exception Handling**: Detailed error types for better debugging and error handling\n- **Improved Authentication**: More secure and flexible credential management\n- **Better Logging**: Centralized logging system with configurable levels\n- **SSL Certificate Support**: Custom certificate handling for enterprise environments\n\n### \ud83d\ude80 **Key Features**\n- **Backward Compatibility**: All existing code continues to work without changes\n- **New Environment Variables**: Cleaner, more consistent naming (with backward compatibility)\n- **SSL Verification Control**: Support for custom certificates and SSL configuration\n- **Better Error Messages**: More descriptive error messages for troubleshooting\n- **Performance Improvements**: Optimized HTTP client and connection pooling\n\n### \ud83d\udce6 **Dependencies**\nThe integration brings the autonomize-core package as a dependency, which includes:\n- Modern HTTP client (`httpx`)\n- Comprehensive exception handling\n- Advanced credential management\n- SSL certificate support\n- Structured logging\n\n## Environment Setup\n\n### New Preferred Environment Variables (autonomize-core)\n\nWe recommend using the new environment variable names for better consistency and clarity:\n\n```bash\nexport MODELHUB_URI=https://your-modelhub.com\nexport MODELHUB_AUTH_CLIENT_ID=your_client_id\nexport MODELHUB_AUTH_CLIENT_SECRET=your_secret\nexport GENESIS_CLIENT_ID=your_genesis_client\nexport GENESIS_COPILOT_ID=your_copilot\nexport MLFLOW_EXPERIMENT_ID=your_experiment_id\n```\n\n### Legacy Environment Variables (Backward Compatibility)\n\nThe following environment variables are still supported for backward compatibility:\n\n```bash\nexport MODELHUB_BASE_URL=https://your-modelhub.com\nexport MODELHUB_CLIENT_ID=your_client_id\nexport MODELHUB_CLIENT_SECRET=your_secret\nexport CLIENT_ID=your_client\nexport COPILOT_ID=your_copilot\nexport MLFLOW_EXPERIMENT_ID=your_experiment_id\n```\n\n### SSL Certificate Configuration\n\nThe SDK now supports custom SSL certificate verification through the `verify_ssl` parameter. This is useful when working with self-signed certificates or custom certificate authorities:\n\n```python\nfrom modelhub.core import ModelhubCredential\n\n# Disable SSL verification (not recommended for production)\ncredential = ModelhubCredential(\n    modelhub_url=\"https://your-modelhub.com\",\n    client_id=\"your_client_id\",\n    client_secret=\"your_client_secret\",\n    verify_ssl=False\n)\n\n# Use custom certificate bundle\ncredential = ModelhubCredential(\n    modelhub_url=\"https://your-modelhub.com\",\n    client_id=\"your_client_id\",\n    client_secret=\"your_client_secret\",\n    verify_ssl=\"/path/to/your/certificate.pem\"\n)\n```\n\n### Environment File Configuration\n\nAlternatively, create a `.env` file in your project directory and add the environment variables:\n\n```bash\n# .env file\nMODELHUB_URI=https://your-modelhub.com\nMODELHUB_AUTH_CLIENT_ID=your_client_id\nMODELHUB_AUTH_CLIENT_SECRET=your_secret\nGENESIS_CLIENT_ID=your_genesis_client\nGENESIS_COPILOT_ID=your_copilot\nMLFLOW_EXPERIMENT_ID=your_experiment_id\n```\n\n## Enhanced Security (v1.1.39+)\n\nStarting with version 1.1.39, the ModelHub SDK includes significant security enhancements:\n\n### \ud83d\udd12 MLflow Security Improvements\n\n**No Direct MLflow Exposure**: MLflow is no longer directly accessible. All MLflow operations now route through our secure BFF (Backend-for-Frontend) API gateway, providing:\n\n- **Centralized Authentication**: All requests are authenticated at the gateway level\n- **Request Validation**: Enhanced input validation and sanitization\n- **Audit Logging**: Complete audit trail for all MLflow operations\n- **Network Isolation**: MLflow server runs in an internal-only network\n\n### Secure MLflow Tracking URI\n\nThe SDK now automatically configures MLflow to use the secure gateway endpoint:\n\n```python\n# The SDK handles this automatically - no manual configuration needed\n# MLflow tracking URI is set to: {api_url}/mlflow-tracking\n# All requests are proxied through the secure gateway\n\nfrom modelhub.core import ModelhubCredential\nfrom modelhub.clients import MLflowClient\n\n# Initialize with standard credentials\ncredential = ModelhubCredential()\nclient = MLflowClient(credential=credential)\n\n# Use MLflow as normal - security is handled transparently\nwith client.start_run():\n    client.mlflow.log_param(\"param\", \"value\")\n    client.mlflow.log_metric(\"metric\", 0.95)\n```\n\n**Note**: For long-running MLflow operations, ensure your token lifetime is sufficient. The token is set when the MLflowClient is initialized.\n\n## CLI Tool\n\nThe ModelHub SDK includes a command-line interface for managing ML pipelines:\n\n```bash\n# Start a pipeline in local mode (with local scripts)\npipeline start -f pipeline.yaml --mode local --pyproject pyproject.toml\n\n# Start a pipeline in CI/CD mode (using container)\npipeline start -f pipeline.yaml --mode cicd\n```\n\nCLI Options:\n\n- `-f, --file`: Path to pipeline YAML file (default: pipeline.yaml)\n- `--mode`: Execution mode ('local' or 'cicd')\n  - local: Runs with local scripts and installs dependencies using uv\n  - cicd: Uses container image with pre-installed dependencies\n- `--pyproject`: Path to pyproject.toml file (required for local mode)\n\n## Quickstart\n\nThe ModelHub SDK allows you to easily log experiments, manage pipelines, and use datasets.\n\nHere's a quick example of how to initialize the client and log a run:\n\n### Basic Usage\n\n```python\nfrom modelhub.core import ModelhubCredential\nfrom modelhub.clients import MLflowClient\n\n# Initialize the credential\ncredential = ModelhubCredential(\n    modelhub_url=\"https://your-modelhub.com\",\n    client_id=\"your_client_id\",\n    client_secret=\"your_client_secret\"\n)\n\n# Initialize the MLflow client with the credential\nclient = MLflowClient(\n    credential=credential,\n    client_id=\"your_client_id\",\n    copilot_id=\"your_copilot_id\"\n)\n\nexperiment_id = \"your_experiment_id\"\nclient.set_experiment(experiment_id=experiment_id)\n\n# Start an MLflow run\nwith client.start_run(run_name=\"my_experiment_run\"):\n    client.mlflow.log_param(\"param1\", \"value1\")\n    client.mlflow.log_metric(\"accuracy\", 0.85)\n    client.mlflow.log_artifact(\"model.pkl\")\n```\n\n### Advanced Usage with SSL Configuration\n\n```python\nfrom modelhub.core import ModelhubCredential\nfrom modelhub.clients import MLflowClient\n\n# Initialize with custom SSL configuration\ncredential = ModelhubCredential(\n    modelhub_url=\"https://your-modelhub.com\",\n    client_id=\"your_client_id\",\n    client_secret=\"your_client_secret\",\n    verify_ssl=\"/path/to/custom/certificate.pem\"  # or False to disable\n)\n\n# The rest remains the same\nclient = MLflowClient(\n    credential=credential,\n    client_id=\"your_client_id\",\n    copilot_id=\"your_copilot_id\"\n)\n```\n\n### Using Environment Variables\n\n```python\nfrom modelhub.core import ModelhubCredential\nfrom modelhub.clients import MLflowClient\n\n# Credentials will be loaded from environment variables automatically\n# MODELHUB_URI, MODELHUB_AUTH_CLIENT_ID, MODELHUB_AUTH_CLIENT_SECRET\ncredential = ModelhubCredential()\n\n# Client IDs will be loaded from GENESIS_CLIENT_ID, GENESIS_COPILOT_ID\nclient = MLflowClient(credential=credential)\n```\n\n## Experiments and Runs\n\nModelHub SDK provides an easy way to interact with MLflow for managing experiments and runs.\n\n### Logging Parameters and Metrics\n\nTo log parameters, metrics, and artifacts:\n\n```python\nwith client.start_run(run_name=\"my_run\"):\n    # Log parameters\n    client.mlflow.log_param(\"learning_rate\", 0.01)\n\n    # Log metrics\n    client.mlflow.log_metric(\"accuracy\", 0.92)\n    client.mlflow.log_metric(\"precision\", 0.88)\n\n    # Log artifacts\n    client.mlflow.log_artifact(\"/path/to/model.pkl\")\n```\n\n### Artifact Management\n\nYou can log or download artifacts with ease:\n\n```python\n# Log artifact\nclient.mlflow.log_artifact(\"/path/to/file.csv\")\n\n# Download artifact\nclient.mlflow.artifacts.download_artifacts(run_id=\"run_id_here\", artifact_path=\"artifact.csv\", dst_path=\"/tmp\")\n```\n\n## Pipeline Management\n\nModelHub SDK enables users to define, manage, and run multi-stage pipelines that automate your machine learning workflow. You can define pipelines in YAML and submit them using the SDK.\n\n### Basic Pipeline\n\nHere's a simple pipeline example:\n\n```yaml\nname: \"Simple Pipeline\"\ndescription: \"Basic ML pipeline\"\nexperiment_id: \"123\"\nimage_tag: \"my-image:1.0.0\"\nstages:\n  - name: train\n    type: custom\n    script: scripts/train.py\n```\n\n### Running a Pipeline\n\nUsing CLI:\n\n```bash\n# Local development\npipeline start -f pipeline.yaml --mode local --pyproject pyproject.toml\n\n# CI/CD environment\npipeline start -f pipeline.yaml --mode cicd\n```\n\nUsing SDK:\n\n```python\nfrom modelhub.core import ModelhubCredential\nfrom modelhub.clients import PipelineManager\n\n# Initialize the credential\ncredential = ModelhubCredential(\n    modelhub_url=\"https://your-modelhub.com\",\n    client_id=\"your_client_id\",\n    client_secret=\"your_client_secret\"\n)\n\n# Initialize the pipeline manager with the credential\npipeline_manager = PipelineManager(\n    credential=credential,\n    client_id=\"your_client_id\",\n    copilot_id=\"your_copilot_id\"\n)\n\n# Start the pipeline\npipeline = pipeline_manager.start_pipeline(\"pipeline.yaml\")\n```\n\n### Advanced Configuration\n\nFor detailed information about pipeline configuration including:\n\n- Resource management (CPU, Memory, GPU)\n- Node scheduling with selectors and tolerations\n- Blob storage integration\n- Stage dependencies\n- Advanced examples and best practices\n\nSee our [Pipeline Configuration Guide](./PIPELINE.md).\n\n## Dataset Management\n\nModelHub SDK allows you to load and manage datasets easily, with support for loading data from external storage or datasets managed through the frontend.\n\n### Loading Datasets\n\nTo load datasets using the SDK:\n\n```python\nfrom modelhub.core import ModelhubCredential\nfrom modelhub.clients import DatasetClient\n\n# Initialize the credential\ncredential = ModelhubCredential(\n    modelhub_url=\"https://your-modelhub.com\",\n    client_id=\"your_client_id\",\n    client_secret=\"your_client_secret\"\n)\n\n# Initialize the dataset client with the credential\ndataset_client = DatasetClient(\n    credential=credential,\n    client_id=\"your_client_id\",\n    copilot_id=\"your_copilot_id\"\n)\n\n# Load a dataset by name\ndataset = dataset_client.load_dataset(\"my_dataset\")\n\n# Load a dataset from a specific directory\ndataset = dataset_client.load_dataset(\"my_dataset\", directory=\"data_folder/\")\n\n# Load a specific version and split\ndataset = dataset_client.load_dataset(\"my_dataset\", version=2, split=\"train\")\n```\n\n### Using Blob Storage for Dataset\n\n```python\n# Load dataset from Azure Blob Storage\ndataset = dataset_client.load_dataset(\n    \"my_dataset\",\n    blob_storage_config={\n        \"container\": \"data\",\n        \"blob_url\": \"https://storage.blob.core.windows.net\",\n        \"mount_path\": \"/data\"\n    }\n)\n```\n\n### Google Cloud Storage Support (v1.1.39+)\n\nModelHub SDK now supports Google Cloud Storage (GCS) for dataset storage and artifact management:\n\n```python\n# Load dataset from Google Cloud Storage\ndataset = dataset_client.load_dataset(\n    \"my_dataset\",\n    gcs_config={\n        \"bucket\": \"my-ml-datasets\",\n        \"prefix\": \"datasets/training/\",\n        \"credentials_path\": \"/path/to/service-account.json\",  # Optional\n        \"mount_path\": \"/data\"\n    }\n)\n\n# Using GCS for MLflow artifacts\nimport os\nos.environ[\"MLFLOW_GCS_DEFAULT_ARTIFACT_ROOT\"] = \"gs://my-ml-artifacts\"\n\nwith client.start_run():\n    # Artifacts will be automatically stored in GCS\n    client.mlflow.log_artifact(\"model.pkl\")  # Stored in gs://my-ml-artifacts/\n    client.mlflow.log_metric(\"accuracy\", 0.95)\n```\n\n**GCS Configuration Options:**\n- **bucket**: The GCS bucket name\n- **prefix**: Optional path prefix within the bucket\n- **credentials_path**: Path to service account JSON (uses default credentials if not specified)\n- **mount_path**: Local mount point for the dataset\n\n## Universal Model Serving\n\nModelHub SDK provides a revolutionary inference types system that automatically detects and handles different data types, making it easy to deploy any model with a unified interface. The system supports 9 inference types and provides KServe V2 protocol compliance without requiring KServe dependencies.\n\n### Inference Types System\n\nThe inference types system automatically detects and processes different input data types:\n\n**Supported Types:**\n- **TEXT**: Natural language text processing\n- **IMAGE**: Image classification, object detection, etc.\n- **PDF**: Document processing and analysis\n- **TABULAR**: Structured data (CSV, DataFrame)\n- **AUDIO**: Speech recognition, audio classification\n- **VIDEO**: Video analysis and processing\n- **JSON**: Structured JSON data\n- **CSV**: Comma-separated values\n- **AUTO**: Automatic type detection\n\n### Quick Start Model Serving\n\nDeploy any model with minimal configuration:\n\n```python\nfrom modelhub.serving import BaseModelPredictor, ModelServer\n\nclass MyTextClassifier(BaseModelPredictor):\n    def load_model(self):\n        # Load your model (any framework: scikit-learn, PyTorch, TensorFlow, etc.)\n        import joblib\n        return joblib.load(\"my_model.pkl\")\n\n    def predict(self, data):\n        # Simple prediction logic\n        predictions = self.model.predict(data)\n        return {\"predictions\": predictions.tolist()}\n\n# Start the server\nmodel_service = MyTextClassifier(name=\"text-classifier\")\nserver = ModelServer()\nserver.start([model_service])\n\n# Your model is now available at:\n# POST http://localhost:8080/v2/models/text-classifier/infer\n```\n\n### Advanced Model Serving\n\nUse the AutoModelPredictor for automatic type handling:\n\n```python\nfrom modelhub.serving import AutoModelPredictor, ModelServer\nfrom modelhub.serving.inference_types import InferenceType\n\nclass UniversalModel(AutoModelPredictor):\n    def load_model(self):\n        # Load your universal model\n        return load_your_model()\n\n    def predict(self, processed_data, inference_type: InferenceType):\n        \"\"\"\n        Handle different input types automatically\n        processed_data: Already transformed by InputTransformer\n        inference_type: Detected type (TEXT, IMAGE, PDF, etc.)\n        \"\"\"\n        if inference_type == InferenceType.TEXT:\n            return {\"text_prediction\": self.model.predict_text(processed_data)}\n        elif inference_type == InferenceType.IMAGE:\n            return {\"image_prediction\": self.model.predict_image(processed_data)}\n        elif inference_type == InferenceType.TABULAR:\n            return {\"tabular_prediction\": self.model.predict_tabular(processed_data)}\n        else:\n            # Handle AUTO and other types\n            return {\"prediction\": self.model.predict(processed_data)}\n\n# Deploy with full KServe V2 protocol support\nmodel_service = UniversalModel(name=\"universal-model\")\nserver = ModelServer()\nserver.start([model_service])\n\n# Available endpoints:\n# POST /v2/models/universal-model/infer     - Main inference\n# GET  /v2/models/universal-model           - Model metadata\n# GET  /v2/health/live                      - Liveness check\n# GET  /v2/health/ready                     - Readiness check\n```\n\n**Key Features:**\n- **Automatic Type Detection**: Input data is automatically classified\n- **Input/Output Transformation**: Data is preprocessed and postprocessed automatically\n- **KServe V2 Protocol**: Full compliance with industry standard\n- **Production Ready**: Comprehensive error handling and logging\n- **Framework Agnostic**: Works with any ML framework\n- **Scalable**: Designed for production deployment\n\n**Example Requests:**\n\n```bash\n# Text input\ncurl -X POST http://localhost:8080/v2/models/universal-model/infer \\\n  -H \"Content-Type: application/json\" \\\n  -d '{\"inputs\": [{\"name\": \"text\", \"data\": [\"Hello world\"]}]}'\n\n# Image input (base64 encoded)\ncurl -X POST http://localhost:8080/v2/models/universal-model/infer \\\n  -H \"Content-Type: application/json\" \\\n  -d '{\"inputs\": [{\"name\": \"image\", \"data\": [\"data:image/jpeg;base64,...\"]}]}'\n\n# Auto-detection (let the system detect the type)\ncurl -X POST http://localhost:8080/v2/models/universal-model/infer \\\n  -H \"Content-Type: application/json\" \\\n  -d '{\"inputs\": [{\"name\": \"input\", \"data\": [\"any_data_here\"]}]}'\n```\n\nFor comprehensive documentation on model serving capabilities, see our [Model Serving Guide](./SERVING.md).\n\n## Model Deployment through KServe\n\nDeploy models via KServe after logging them with MLflow:\n\n### Create a model wrapper:\n\nUse the MLflow PythonModel interface to define your model's prediction logic.\n\n```python\nimport mlflow.pyfunc\nimport joblib\n\nclass ModelWrapper(mlflow.pyfunc.PythonModel):\n    def load_context(self, context):\n        self.model = joblib.load(\"/path/to/model.pkl\")\n\n    def predict(self, context, model_input):\n        return self.model.predict(model_input)\n\n# Log the model\nclient.mlflow.pyfunc.log_model(\n    artifact_path=\"model\",\n    python_model=ModelWrapper()\n)\n```\n\n### Serve models with ModelHub:\n\nModelHub SDK provides enhanced classes for serving models with the new Universal Inference Types System:\n\n```python\nfrom modelhub.serving import BaseModelPredictor, ModelServer\n\n# Simple approach - serve any MLflow model\nclass MLflowModelService(BaseModelPredictor):\n    def __init__(self, name: str, run_uri: str):\n        super().__init__(name=name)\n        self.run_uri = run_uri\n\n    def load_model(self):\n        import mlflow.pyfunc\n        return mlflow.pyfunc.load_model(self.run_uri)\n\n    def predict(self, data):\n        predictions = self.model.predict(data)\n        return {\"predictions\": predictions}\n\n# Create and start model service\nmodel_service = MLflowModelService(\n    name=\"my-classifier\",\n    run_uri=\"runs:/abc123def456/model\"\n)\nserver = ModelServer()\nserver.start([model_service])\n```\n\nThe new system supports automatic type detection for text, images, PDFs, tabular data, audio, video, JSON, CSV, and more. For comprehensive documentation on model serving capabilities, see our [Model Serving Guide](./SERVING.md).\n\n### Deploy with KServe:\n\nAfter logging the model, deploy it using KServe:\n\n```yaml\napiVersion: \"serving.kserve.io/v1beta1\"\nkind: \"InferenceService\"\nmetadata:\n  name: \"model-service\"\n  namespace: \"modelhub\"\n  labels:\n    azure.workload.identity/use: \"true\"\nspec:\n  predictor:\n    containers:\n      - image: your-registry.io/model-serve:latest\n        name: model-service\n        resources:\n          requests:\n            cpu: \"1\"\n            memory: \"2Gi\"\n          limits:\n            cpu: \"2\"\n            memory: \"4Gi\"\n        command:\n          [\n            \"sh\",\n            \"-c\",\n            \"python app/main.py --model_name my-classifier --run runs:/abc123def456/model\",\n          ]\n        env:\n          - name: MODELHUB_BASE_URL\n            value: \"https://api-modelhub.example.com\"\n    serviceAccountName: \"service-account-name\"\n```\n\n## Examples\n\n### Training Pipeline with Multiple Stages\n\n```python\nfrom modelhub.core import ModelhubCredential\nfrom modelhub.clients import MLflowClient, PipelineManager\n\n# Initialize credential\ncredential = ModelhubCredential(\n    modelhub_url=\"https://api-modelhub.example.com\",\n    client_id=\"your_client_id\",\n    client_secret=\"your_client_secret\"\n)\n\n# Setup clients\nmlflow_client = MLflowClient(credential=credential)\npipeline_manager = PipelineManager(credential=credential)\n\n# Define and run pipeline\npipeline = pipeline_manager.start_pipeline(\"pipeline.yaml\")\n\n# Track experiment in MLflow\nwith mlflow_client.start_run(run_name=\"Training Run\"):\n    # Log training parameters\n    mlflow_client.log_param(\"model_type\", \"transformer\")\n    mlflow_client.log_param(\"epochs\", 10)\n\n    # Log metrics\n    mlflow_client.log_metric(\"train_loss\", 0.123)\n    mlflow_client.log_metric(\"val_accuracy\", 0.945)\n\n    # Log model artifacts\n    mlflow_client.log_artifact(\"model.pkl\")\n```\n\n### Dataset Version Management\n\n```python\nfrom modelhub.core import ModelhubCredential\nfrom modelhub.clients import DatasetClient\n\n# Initialize credential\ncredential = ModelhubCredential(\n    modelhub_url=\"https://api-modelhub.example.com\",\n    client_id=\"your_client_id\",\n    client_secret=\"your_client_secret\"\n)\n\n# Initialize client\ndataset_client = DatasetClient(credential=credential)\n\n# List available datasets\ndatasets = dataset_client.list_datasets()\n\n# Get specific version\ndataset_v2 = dataset_client.get_dataset_versions(\"dataset_id\")\n\n# Load dataset with version control\ndataset = dataset_client.load_dataset(\n    \"my_dataset\",\n    version=2,\n    split=\"train\"\n)\n```\n\n# InferenceClient\n\nThe `InferenceClient` provides a simple interface to perform inference using deployed models. It supports both text-based and file-based inference with comprehensive error handling and support for various input types.\n\n## Installation\n\nThe inference client is part of the ModelHub SDK optional dependencies. To install:\n\n```bash\npip install \"autonomize-model-sdk[serving]\"\n```\n\nOr with uv:\n\n```bash\nuv add autonomize-model-sdk --extra serving\n```\n\n## Authentication\n\nThe client supports multiple authentication methods:\n\n```python\nfrom modelhub.core import ModelhubCredential\nfrom modelhub.clients import InferenceClient\n\n# Create credential\ncredential = ModelhubCredential(\n    modelhub_url=\"https://your-modelhub-instance\",\n    client_id=\"your-client-id\",\n    client_secret=\"your-client-secret\"\n)\n\n# Using credential (recommended approach)\nclient = InferenceClient(\n    credential=credential,\n    client_id=\"client_id\",\n    copilot_id=\"copilot_id\"\n)\n\n# Using environment variables (MODELHUB_BASE_URL, MODELHUB_CLIENT_ID, MODELHUB_CLIENT_SECRET)\n# Note: This approach is deprecated and will be removed in a future version\nclient = InferenceClient()\n\n# Using direct parameters (deprecated)\nclient = InferenceClient(\n    base_url=\"https://your-modelhub-instance\",\n    sa_client_id=\"your-client-id\",\n    sa_client_secret=\"your-client-secret\",\n    genesis_client_id=\"client id\",\n    genesis_copilot_id=\"copilot id\"\n)\n\n# Using a token (deprecated)\nclient = InferenceClient(\n    base_url=\"https://your-modelhub-instance\",\n    token=\"your-token\"\n)\n```\n\n## Text Inference\n\nFor models that accept text input:\n\n```python\n# Simple text inference\nresponse = client.run_text_inference(\n    model_name=\"text-model\",\n    text=\"This is the input text\"\n)\n\n# With additional parameters\nresponse = client.run_text_inference(\n    model_name=\"llm-model\",\n    text=\"Translate this to French: Hello, world!\",\n    parameters={\n        \"temperature\": 0.7,\n        \"max_tokens\": 100\n    }\n)\n\n# Access the result\nresult = response[\"result\"]\nprint(f\"Processing time: {response.get('processing_time')} seconds\")\n```\n\n## File Inference\n\nThe client supports multiple file input methods:\n\n### Local File Path\n\n```python\n# Using a local file path\nresponse = client.run_file_inference(\n    model_name=\"image-recognition\",\n    file_path=\"/path/to/image.jpg\"\n)\n```\n\n### File Object\n\n```python\n# Using a file-like object\nwith open(\"document.pdf\", \"rb\") as f:\n    response = client.run_file_inference(\n        model_name=\"document-processor\",\n        file_path=f,\n        file_name=\"document.pdf\",\n        content_type=\"application/pdf\"\n    )\n```\n\n### URL\n\n```python\n# Using a URL\nresponse = client.run_file_inference(\n    model_name=\"image-recognition\",\n    file_path=\"https://example.com/images/sample.jpg\"\n)\n```\n\n### Signed URL from Cloud Storage\n\n```python\n# Using a signed URL from S3 or Azure Blob Storage\nresponse = client.run_file_inference(\n    model_name=\"document-processor\",\n    file_path=\"https://your-bucket.s3.amazonaws.com/path/to/document.pdf?signature=...\",\n    file_name=\"confidential-document.pdf\",  # Optional: Override filename\n    content_type=\"application/pdf\"          # Optional: Override content type\n)\n```\n\n## Response Format\n\nThe response format is consistent across inference types:\n\n```python\n{\n    \"result\": {\n        # Model-specific output\n        # For example, text models might return:\n        \"text\": \"Generated text\",\n\n        # Image models might return:\n        \"objects\": [\n            {\"class\": \"car\", \"confidence\": 0.95, \"bbox\": [10, 20, 100, 200]},\n            {\"class\": \"person\", \"confidence\": 0.87, \"bbox\": [150, 30, 220, 280]}\n        ]\n    },\n    \"processing_time\": 0.234,  # Time in seconds\n    \"model_version\": \"1.0.0\",  # Optional version info\n    \"metadata\": {              # Optional additional information\n        \"runtime\": \"cpu\",\n        \"batch_size\": 1\n    }\n}\n```\n\n## Error Handling\n\nThe client provides comprehensive error handling with specific exception types:\n\n```python\nfrom modelhub.clients import InferenceClient\nfrom modelhub.core.exceptions import (\n    ModelHubException,\n    ModelHubResourceNotFoundException,\n    ModelHubBadRequestException,\n    ModelhubUnauthorizedException\n)\n\nclient = InferenceClient(credential=credential)\n\ntry:\n    response = client.run_text_inference(\"model-name\", \"input text\")\n    print(response)\nexcept ModelHubResourceNotFoundException as e:\n    print(f\"Model not found: {e}\")\n    # Handle 404 error\nexcept ModelhubUnauthorizedException as e:\n    print(f\"Authentication failed: {e}\")\n    # Handle 401/403 error\nexcept ModelHubBadRequestException as e:\n    print(f\"Invalid request: {e}\")\n    # Handle 400 error\nexcept ModelHubException as e:\n    print(f\"Inference failed: {e}\")\n    # Handle other errors\n```\n\n## Additional Features\n\n- **SSL verification control**: You can disable SSL verification for development environments\n- **Automatic content type detection**: The client automatically detects the content type of files based on their extension\n- **Customizable timeout**: You can set a custom timeout for inference requests\n- **Comprehensive logging**: All operations are logged for easier debugging\n\n## Async Support\n\nThe InferenceClient also provides async versions of all methods for use in async applications:\n\n```python\nimport asyncio\nfrom modelhub.clients import InferenceClient\n\nasync def run_inference():\n    client = InferenceClient(credential=credential)\n\n    # Text inference\n    response = await client.arun_text_inference(\n        model_name=\"text-model\",\n        text=\"This is async inference\"\n    )\n\n    # File inference\n    file_response = await client.arun_file_inference(\n        model_name=\"image-model\",\n        file_path=\"/path/to/image.jpg\"\n    )\n\n    return response, file_response\n\n# Run with asyncio\nresponses = asyncio.run(run_inference())\n```\n\n# Prompt Management\n\nThe ModelHub SDK provides comprehensive prompt management capabilities through the dedicated PromptClient. This allows you to version, track, evaluate, and reuse prompts across your organization with support for complex multi-message templates.\n\n## Features\n\n- **Versioning** - Track the evolution of your prompts with version control\n- **Multi-Message Templates** - Support for complex system/user message structures\n- **Reusability** - Store and manage prompts in a centralized registry\n- **Aliases** - Create aliases for prompt versions to simplify deployment pipelines\n- **Evaluation** - Built-in prompt evaluation with metrics and traces\n- **Tags & Metadata** - Rich tagging and metadata support for organization\n- **Async Support** - Full async/await support for all operations\n\n## Installation\n\nPrompt management is included in the core SDK:\n\n```bash\npip install autonomize-model-sdk\n```\n\n## Basic Usage\n\n```python\nfrom modelhub.core import ModelhubCredential\nfrom modelhub.clients import PromptClient\nfrom modelhub.models.prompts import PromptCreation, Message, Content\n\n# Initialize credential\ncredential = ModelhubCredential(\n    modelhub_url=\"https://api-modelhub.example.com\",\n    client_id=\"your_client_id\",\n    client_secret=\"your_client_secret\"\n)\n\n# Initialize prompt client\nprompt_client = PromptClient(credential=credential)\n\n# Create a new prompt with system and user messages\nprompt = prompt_client.create_prompt(\n    PromptCreation(\n        name=\"summarization-prompt\",\n        template=[\n            Message(\n                role=\"system\",\n                content=Content(type=\"text\", text=\"You are a helpful summarization assistant.\"),\n                input_variables=[]\n            ),\n            Message(\n                role=\"user\",\n                content=Content(\n                    type=\"text\",\n                    text=\"Summarize this in {{ num_sentences }} sentences: {{ content }}\"\n                ),\n                input_variables=[\"num_sentences\", \"content\"]\n            )\n        ],\n        commit_message=\"Initial version\",\n        version_metadata={\"author\": \"author@example.com\"},\n        tags=[{\"key\": \"task\", \"value\": \"summarization\"}]\n    )\n)\n\nprint(f\"Created prompt '{prompt['name']}' (version {prompt['latest_versions'][0]['version']})\")\n```\n\n## Loading and Using Prompts\n\n```python\n# Get a specific prompt version\nprompt = prompt_client.get_registered_prompt_version(\"summarization-prompt\", version=1)\n\n# Get the latest version\nlatest_prompt = prompt_client.get_registered_prompt_by_name(\"summarization-prompt\")\n\n# Create an alias for deployment\nfrom modelhub.models.models import Alias\nprompt_client.create_alias(\n    \"summarization-prompt\",\n    Alias(name=\"production\", version=1)\n)\n\n# Search for prompts\nfrom modelhub.models.models import SearchModelsCriteria\nprompts = prompt_client.get_prompts(\n    SearchModelsCriteria(\n        filter_string=\"tags.task = 'summarization'\"\n    )\n)\n\n# Evaluate a prompt\nfrom modelhub.models.prompts import EvaluationInput\nevaluation_result = prompt_client.evaluate_prompt(\n    EvaluationInput(\n        model=\"gpt-3.5-turbo\",\n        provider=\"azure\",\n        template=prompt['latest_versions'][0]['template'],\n        temperature=0.1,\n        variables={\"num_sentences\": \"2\", \"content\": \"Your text here...\"}\n    )\n)\n```\n\n## Managing Prompt Versions\n\n```python\n# Create a new version of existing prompt\nfrom modelhub.models.prompts import UpdatePromptVersionRequest\n\nnew_version = prompt_client.create_prompt_version(\n    \"summarization-prompt\",\n    UpdatePromptVersionRequest(\n        template=[\n            Message(\n                role=\"system\",\n                content=Content(\n                    type=\"text\",\n                    text=\"You are an expert summarizer. Be concise and accurate.\"\n                ),\n                input_variables=[]\n            ),\n            Message(\n                role=\"user\",\n                content=Content(\n                    type=\"text\",\n                    text=\"Summarize in exactly {{ num_sentences }} sentences: {{ content }}\"\n                ),\n                input_variables=[\"num_sentences\", \"content\"]\n            )\n        ],\n        commit_message=\"Improved prompt with clearer instructions\"\n    )\n)\n\n# Update version tags\nfrom modelhub.models.models import Tag\nprompt_client.update_prompt_version_tag(\n    \"summarization-prompt\",\n    version=\"2\",\n    version_metadata=[\n        Tag(key=\"tested\", value=\"true\"),\n        Tag(key=\"performance\", value=\"improved\")\n    ]\n)\n\n# List all versions of a prompt\nversions = prompt_client.get_prompt_versions_with_name(\"summarization-prompt\")\n```\n\n## Evaluating Prompts\n\nThe ModelHub SDK provides both **online** (backend-processed) and **offline** (local) evaluation capabilities for prompt testing and development.\n\n### Online Evaluation (Backend Processing)\n\nFor comprehensive evaluation with results tracked on the ModelHub dashboard:\n\n```python\n# Online evaluation via backend API\nfrom modelhub.models.prompts import EvaluationInput\n\n# Get the prompt to evaluate\nprompt = prompt_client.get_registered_prompt_version(\"summarization-prompt\", version=1)\n\n# Submit evaluation job (processed asynchronously via Kafka)\nevaluation_result = prompt_client.evaluate_prompt(\n    EvaluationInput(\n        model=\"gpt-3.5-turbo\",\n        provider=\"azure\",  # or \"openai\"\n        template=prompt['template'],\n        temperature=0.1,\n        variables={\n            \"num_sentences\": \"2\",\n            \"content\": \"Artificial intelligence has transformed how businesses operate...\"\n        }\n    )\n)\n\n# Get execution traces for analysis\nfrom modelhub.models.prompts import PromptRunTracesDto\n\ntraces = prompt_client.get_traces(\n    PromptRunTracesDto(\n        experiment_ids=[\"your-experiment-id\"],\n        filter_string=\"tags.prompt_name = 'summarization-prompt'\",\n        max_results=100\n    )\n)\n```\n\n### Offline Evaluation (Local Development)\n\nFor immediate feedback during prompt development, use the **offline evaluation** capabilities:\n\n```python\n# Install with evaluation dependencies\n# pip install \"autonomize-model-sdk[monitoring]\"\n\nimport pandas as pd\nfrom modelhub.evaluation import PromptEvaluator, EvaluationConfig\n\n# Configure evaluation settings\nconfig = EvaluationConfig(\n    evaluations=[\"metrics\"],  # Basic text metrics\n    save_html=True,\n    save_json=True,\n    output_dir=\"./evaluation_reports\"\n)\n\n# Initialize evaluator\nevaluator = PromptEvaluator(config)\n\n# Prepare evaluation data\ndata = pd.DataFrame({\n    'prompt': [\n        'Summarize this article in 2 sentences.',\n        'Explain quantum computing in simple terms.'\n    ],\n    'response': [\n        'This article discusses AI advancements and applications in various industries.',\n        'Quantum computing uses quantum mechanics principles for faster computations.'\n    ],\n    'expected': [  # Optional reference responses\n        'AI has advanced significantly with diverse applications.',\n        'Quantum computing leverages quantum mechanics for speed.'\n    ]\n})\n\n# Run offline evaluation\nreport = evaluator.evaluate_offline(\n    data=data,\n    prompt_col='prompt',\n    response_col='response',\n    reference_col='expected'  # Optional\n)\n\n# Access results\nprint(f\"Total samples evaluated: {report.summary['total_samples']}\")\nprint(f\"Average prompt length: {report.summary['basic_stats']['avg_prompt_length']}\")\nprint(f\"Average response length: {report.summary['basic_stats']['avg_response_length']}\")\n\n# Reports saved to ./evaluation_reports/ directory\nprint(f\"HTML report: {report.html_path}\")\nprint(f\"JSON report: {report.json_path}\")\n```\n\n### Template-Based Evaluation\n\nEvaluate prompt templates with multiple test cases:\n\n```python\nfrom modelhub.models.prompts import Message, Content\n\n# Define a prompt template\ntemplate = [\n    Message(\n        role=\"system\",\n        content=Content(type=\"text\", text=\"You are a helpful assistant.\"),\n        input_variables=[]\n    ),\n    Message(\n        role=\"user\",\n        content=Content(type=\"text\", text=\"Summarize in {{num_sentences}} sentences: {{content}}\"),\n        input_variables=[\"num_sentences\", \"content\"]\n    )\n]\n\n# Test data with variable combinations\ntest_data = pd.DataFrame({\n    'variables': [\n        {'num_sentences': '2', 'content': 'Long article about AI...'},\n        {'num_sentences': '3', 'content': 'Research paper on quantum computing...'}\n    ],\n    'expected': [\n        'Expected summary 1...',\n        'Expected summary 2...'\n    ]\n})\n\n# Optional: Provide LLM function for actual response generation\ndef generate_response(prompt):\n    # Your LLM call here\n    return \"Generated response...\"\n\n# Evaluate template\nreport = evaluator.evaluate_prompt_template(\n    prompt_template=template,\n    test_data=test_data,\n    variables_col='variables',\n    expected_col='expected',\n    llm_generate_func=generate_response  # Optional\n)\n```\n\n**Key Differences:**\n- **Online Evaluation**: Comprehensive analysis, dashboard integration, requires backend processing time\n- **Offline Evaluation**: Immediate results, local development, basic text metrics only\n- **Use Cases**: Online for production testing, offline for rapid iteration\n\n## Async Support\n\nAll prompt operations support async/await:\n\n```python\n# Async prompt creation\nasync def create_prompt_async():\n    prompt = await prompt_client.acreate_prompt(prompt_creation)\n    return prompt\n\n# Async version retrieval\nasync def get_versions_async():\n    versions = await prompt_client.aget_prompt_versions_with_name(\"summarization-prompt\")\n    return versions\n```\n\nFor more detailed information about prompt management, including advanced usage patterns, best practices, and in-depth examples, see our [Prompt Management Guide](./PROMPT.md).\n\n# Model Monitoring and Evaluation\n\nModelHub SDK provides comprehensive tools for monitoring and evaluating both traditional ML models and Large Language Models (LLMs). These tools help track model performance, detect data drift, and assess LLM-specific metrics.\n\nTo install with monitoring capabilities:\n\n```bash\npip install \"autonomize-model-sdk[monitoring]\"\n```\n\n## LLM Monitoring\n\nThe `LLMMonitor` utility allows you to evaluate and monitor LLM outputs using specialized metrics and visualizations.\n\n### Basic LLM Evaluation\n\n```python\nfrom modelhub.core import ModelhubCredential\nfrom modelhub.clients.mlflow_client import MLflowClient\nfrom modelhub.monitors.llm_monitor import LLMMonitor\n\n# Initialize credential\ncredential = ModelhubCredential(\n    modelhub_url=\"https://your-modelhub-instance\",\n    client_id=\"your-client-id\",\n    client_secret=\"your-client-secret\"\n)\n\n# Initialize clients\nmlflow_client = MLflowClient(credential=credential)\nllm_monitor = LLMMonitor(mlflow_client=mlflow_client)\n\n# Create a dataframe with LLM responses\ndata = pd.DataFrame({\n    \"prompt\": [\"Explain AI\", \"What is MLOps?\"],\n    \"response\": [\"AI is a field of computer science...\", \"MLOps combines ML and DevOps...\"],\n    \"category\": [\"education\", \"technical\"]\n})\n\n# Create column mapping\ncolumn_mapping = llm_monitor.create_column_mapping(\n    prompt_col=\"prompt\",\n    response_col=\"response\",\n    categorical_cols=[\"category\"]\n)\n\n# Run evaluations\nlength_report = llm_monitor.evaluate_text_length(\n    data=data,\n    response_col=\"response\",\n    column_mapping=column_mapping,\n    save_html=True\n)\n\n# Generate visualizations\ndashboard_path = llm_monitor.generate_dashboard(\n    data=data,\n    response_col=\"response\",\n    category_col=\"category\"\n)\n\n# Log metrics to MLflow\nllm_monitor.log_metrics_to_mlflow(length_report)\n```\n\n### Evaluating Content Patterns\n\n```python\npatterns_report = llm_monitor.evaluate_content_patterns(\n    data=data,\n    response_col=\"response\",\n    words_to_check=[\"AI\", \"model\", \"learning\"],\n    patterns_to_check=[\"neural network\", \"deep learning\"],\n    prefix_to_check=\"I'll explain\"\n)\n```\n\n### Semantic Properties Analysis\n\n```python\nsemantic_report = llm_monitor.evaluate_semantic_properties(\n    data=data,\n    response_col=\"response\",\n    prompt_col=\"prompt\",\n    check_sentiment=True,\n    check_toxicity=True,\n    check_prompt_relevance=True\n)\n```\n\n### Comprehensive Evaluation\n\n```python\nresults = llm_monitor.run_comprehensive_evaluation(\n    data=data,\n    response_col=\"response\",\n    prompt_col=\"prompt\",\n    categorical_cols=[\"category\"],\n    words_to_check=[\"AI\", \"model\", \"learning\"],\n    run_sentiment=True,\n    run_toxicity=True,\n    save_html=True\n)\n```\n\n### LLM-as-Judge Evaluation\n\nEvaluate responses using OpenAI's models as a judge (requires OpenAI API key):\n\n```python\njudge_report = llm_monitor.evaluate_llm_as_judge(\n    data=data,\n    response_col=\"response\",\n    check_pii=True,\n    check_decline=True,\n    custom_evals=[{\n        \"name\": \"Educational Value\",\n        \"criteria\": \"Evaluate whether the response has educational value.\",\n        \"target\": \"educational\",\n        \"non_target\": \"not_educational\"\n    }]\n)\n```\n\n### Comparing LLM Models\n\nCompare responses from different LLM models:\n\n```python\ncomparison_report = llm_monitor.generate_comparison_report(\n    reference_data=model_a_data,\n    current_data=model_b_data,\n    response_col=\"response\",\n    category_col=\"category\"\n)\n\ncomparison_viz = llm_monitor.create_comparison_visualization(\n    reference_data=model_a_data,\n    current_data=model_b_data,\n    response_col=\"response\",\n    metrics=[\"length\", \"word_count\", \"sentiment_score\"]\n)\n```\n\n## Traditional ML Monitoring\n\nThe SDK also includes `MLMonitor` for traditional ML models, providing capabilities for:\n\n- Data drift detection\n- Data quality assessment\n- Model performance monitoring\n- Target drift analysis\n- Regression and classification metrics\n\n```python\nfrom modelhub.core import ModelhubCredential\nfrom modelhub.clients.mlflow_client import MLflowClient\nfrom modelhub.monitors.ml_monitor import MLMonitor\n\n# Initialize credential\ncredential = ModelhubCredential(\n    modelhub_url=\"https://your-modelhub-instance\",\n    client_id=\"your-client-id\",\n    client_secret=\"your-client-secret\"\n)\n\n# Initialize clients\nmlflow_client = MLflowClient(credential=credential)\nml_monitor = MLMonitor(mlflow_client=mlflow_client)\n\nresults = ml_monitor.run_and_log_reports(\n    reference_data=reference_data,\n    current_data=current_data,\n    report_types=[\"data_drift\", \"data_quality\", \"target_drift\", \"regression\"],\n    column_mapping=column_mapping,\n    target_column=\"target\",\n    prediction_column=\"prediction\",\n    log_to_mlflow=True\n)\n```\n\n## Migration Guide\n\n### autonomize-core Integration (Latest Version)\n\nThe latest version of ModelHub SDK is built on **autonomize-core**, providing enhanced functionality and better performance. Here's what you need to know:\n\n#### Environment Variables Migration\n\n**New Preferred Variables:**\n```bash\nexport MODELHUB_URI=https://your-modelhub.com\nexport MODELHUB_AUTH_CLIENT_ID=your_client_id\nexport MODELHUB_AUTH_CLIENT_SECRET=your_secret\nexport GENESIS_CLIENT_ID=your_genesis_client\nexport GENESIS_COPILOT_ID=your_copilot\n```\n\n**Legacy Variables (Still Supported):**\n```bash\nexport MODELHUB_BASE_URL=https://your-modelhub.com\nexport MODELHUB_CLIENT_ID=your_client_id\nexport MODELHUB_CLIENT_SECRET=your_secret\nexport CLIENT_ID=your_client\nexport COPILOT_ID=your_copilot\n```\n\n#### SSL Certificate Support\n\nNew SSL configuration options are now available:\n\n```python\nfrom modelhub.core import ModelhubCredential\n\n# Custom certificate path\ncredential = ModelhubCredential(\n    modelhub_url=\"https://your-modelhub.com\",\n    client_id=\"your_client_id\",\n    client_secret=\"your_client_secret\",\n    verify_ssl=\"/path/to/certificate.pem\"\n)\n\n# Disable SSL verification (development only)\ncredential = ModelhubCredential(\n    modelhub_url=\"https://your-modelhub.com\",\n    client_id=\"your_client_id\",\n    client_secret=\"your_client_secret\",\n    verify_ssl=False\n)\n```\n\n#### What's Changed\n- **HTTP Client**: Now uses `httpx` instead of `requests` for better performance\n- **Exception Handling**: More detailed exception types from autonomize-core\n- **Authentication**: Enhanced credential management system\n- **Logging**: Improved logging with autonomize-core's logging system\n\n#### What Stays the Same\n- **API Compatibility**: All existing client methods work without changes\n- **Import Statements**: No changes needed to your import statements\n- **Environment Variables**: Legacy environment variables continue to work\n\n### Client Architecture Changes\n\nStarting with version 1.2.0, the ModelHub SDK uses a new architecture based on HTTPX and a centralized credential system. If you're upgrading from an earlier version, you'll need to update your code as follows:\n\n#### Old Way (Deprecated)\n\n```python\nfrom modelhub.clients import BaseClient, DatasetClient, MLflowClient\n\n# Direct initialization with credentials\nclient = BaseClient(\n    base_url=\"https://api-modelhub.example.com\",\n    sa_client_id=\"your_client_id\",\n    sa_client_secret=\"your_client_secret\"\n)\n\ndataset_client = DatasetClient(\n    base_url=\"https://api-modelhub.example.com\",\n    sa_client_id=\"your_client_id\",\n    sa_client_secret=\"your_client_secret\"\n)\n```\n\n#### New Way (Recommended)\n\n```python\nfrom modelhub.core import ModelhubCredential\nfrom modelhub.clients import BaseClient, DatasetClient, MLflowClient\n\n# Create a credential object\ncredential = ModelhubCredential(\n    modelhub_url=\"https://api-modelhub.example.com\",\n    client_id=\"your_client_id\",\n    client_secret=\"your_client_secret\"\n)\n\n# Initialize clients with the credential\nbase_client = BaseClient(\n    credential=credential,\n    client_id=\"your_client_id\",  # For RBAC\n    copilot_id=\"your_copilot_id\"  # For RBAC\n)\n\ndataset_client = DatasetClient(\n    credential=credential,\n    client_id=\"your_client_id\",\n    copilot_id=\"your_copilot_id\"\n)\n\nmlflow_client = MLflowClient(\n    credential=credential,\n    client_id=\"your_client_id\",\n    copilot_id=\"your_copilot_id\"\n)\n```\n\n### Prompt Management Changes\n\nThe PromptClient has been replaced with MLflow's built-in prompt registry capabilities:\n\n#### Old Way (Deprecated)\n\n```python\nfrom modelhub.clients.prompt_client import PromptClient\n\nprompt_client = PromptClient(\n    base_url=\"https://api-modelhub.example.com\",\n    sa_client_id=\"your_client_id\",\n    sa_client_secret=\"your_client_secret\"\n)\n\nprompt_client.create_prompt(\n    name=\"summarization-prompt\",\n    template=\"Summarize this text: {{context}}\",\n    prompt_type=\"USER\"\n)\n```\n\n#### New Way (Recommended)\n\n```python\nfrom modelhub.core import ModelhubCredential\nfrom modelhub.clients import MLflowClient\n\ncredential = ModelhubCredential(\n    modelhub_url=\"https://api-modelhub.example.com\",\n    client_id=\"your_client_id\",\n    client_secret=\"your_client_secret\"\n)\n\nclient = MLflowClient(credential=credential)\n\nclient.mlflow.register_prompt(\n    name=\"summarization-prompt\",\n    template=\"Summarize this text: {{ context }}\",\n    commit_message=\"Initial version\"\n)\n\n# Load and use a prompt\nprompt = client.mlflow.load_prompt(\"prompts:/summarization-prompt/1\")\nformatted_prompt = prompt.format(context=\"Your text to summarize\")\n```\n\n### New Async Support\n\nAll clients now support asynchronous operations:\n\n```python\n# Synchronous\nresult = client.get(\"endpoint\")\n\n# Asynchronous\nresult = await client.aget(\"endpoint\")\n```\n\nFor detailed information about the new prompt management capabilities, see the [Prompt Management Guide](./PROMPT.md).\n",
    "bugtrack_url": null,
    "license": "Proprietary",
    "summary": "SDK for creating and managing machine learning pipelines.",
    "version": "1.1.66",
    "project_urls": null,
    "split_keywords": [
        "inference",
        " machine learning",
        " mlflow",
        " modelhub",
        " sdk"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "82059a25d5979d51961009d518d7052f99a99bff7498ea70f16acfc94c1e3381",
                "md5": "127e4b22467d8ef30f0fe6f3874dbd10",
                "sha256": "a1b3d360024b48217d71bdccf03c8bbe31dbe411926cb45902b015f29ba78d0d"
            },
            "downloads": -1,
            "filename": "autonomize_model_sdk-1.1.66-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "127e4b22467d8ef30f0fe6f3874dbd10",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.12",
            "size": 100222,
            "upload_time": "2025-09-11T16:43:39",
            "upload_time_iso_8601": "2025-09-11T16:43:39.308115Z",
            "url": "https://files.pythonhosted.org/packages/82/05/9a25d5979d51961009d518d7052f99a99bff7498ea70f16acfc94c1e3381/autonomize_model_sdk-1.1.66-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "4b0b75e7b8465597342d24bc47917475aa86cb3665af353dd04ccb2fb64525d2",
                "md5": "8c4be5d9f5277e073448ab57cfc2ad07",
                "sha256": "0998e56a15f1637f4f7e7c2c6812a2eca5b93fcef9039cb8fea63696d6bf8449"
            },
            "downloads": -1,
            "filename": "autonomize_model_sdk-1.1.66.tar.gz",
            "has_sig": false,
            "md5_digest": "8c4be5d9f5277e073448ab57cfc2ad07",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.12",
            "size": 365465,
            "upload_time": "2025-09-11T16:43:40",
            "upload_time_iso_8601": "2025-09-11T16:43:40.574277Z",
            "url": "https://files.pythonhosted.org/packages/4b/0b/75e7b8465597342d24bc47917475aa86cb3665af353dd04ccb2fb64525d2/autonomize_model_sdk-1.1.66.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-09-11 16:43:40",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "autonomize-model-sdk"
}

None