apicrusher

Name	apicrusher JSON
Version	2.2.1 JSON
	download
home_page	https://apicrusher.com
Summary	Cut AI API costs by 63-99% with intelligent routing across all AI providers
upload_time	2025-09-08 19:15:59
maintainer	None
docs_url	None
author	APICrusher
requires_python	>=3.7
license	MIT
keywords	openai anthropic google ai api optimization cost-reduction llm gpt claude gemini gpt-5 gpt-4 claude-4 api-optimization ai-cost-reduction universal-ai groq mistral cohere deepseek llama
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            # APICrusher - Cut AI API Costs by 63-99%

Stop bleeding money on AI APIs. APICrusher automatically routes requests to the cheapest capable model and caches responses, cutting costs by 63-99% with just 2 lines of code.

## 🚀 Quick Start

```bash
# Python 3.7+ required
# Use a virtual environment (recommended for all Python packages)
python3 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install with your provider(s)
pip install apicrusher[standard]  # OpenAI + Redis caching (recommended)
# OR
pip install apicrusher[all]       # All providers (OpenAI, Anthropic, Google, etc.)
```

## 📋 Installation Options

```bash
# Choose based on your needs:
pip install apicrusher[standard]   # OpenAI + Redis (most users)
pip install apicrusher[openai]     # Just OpenAI support
pip install apicrusher[anthropic]  # Just Anthropic support
pip install apicrusher[google]     # Just Google support
pip install apicrusher[all]        # Everything - all providers
pip install apicrusher              # Minimal - add providers later
```

### Virtual Environment Required
Modern Python systems require virtual environments for pip packages:
```bash
# If you see "externally-managed-environment" error:
python3 -m venv venv
source venv/bin/activate
pip install apicrusher[standard]
```

### API Keys Required
APICrusher optimizes your existing AI API calls. You need:
1. **Your AI provider API key** (OpenAI, Anthropic, etc.) - Keep using your existing keys
2. **An APICrusher optimization key** from [apicrusher.com](https://apicrusher.com) - Enables cost optimization

### How It Works
APICrusher is a smart proxy layer. You keep your existing API keys. We analyze each request and route it to the optimal model. Your API keys never leave your server.

## 💻 Basic Usage

```python
# Before (expensive)
from openai import OpenAI
client = OpenAI(api_key="sk-...")  # Your OpenAI key

# After (63-99% cheaper)
from apicrusher import OpenAI
client = OpenAI(
    openai_api_key="sk-...",        # Your existing OpenAI key
    apicrusher_key="apc_live_..."   # Add optimization key
)

# Your code stays exactly the same
response = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "Hello!"}]
)
```

## 💰 How Much Can You Save?

### Single Provider vs Cross-Provider
- **Single Provider** (e.g., just OpenAI): 70-85% savings
- **Cross-Provider** (e.g., OpenAI + Anthropic): Up to 99% savings

| Your Current Usage | Monthly Cost | With APICrusher | You Save |
|-------------------|--------------|-----------------|----------|
| GPT-4 for everything | $1,000 | $180 | $820 (82%) |
| Mixed GPT-4/3.5 | $500 | $95 | $405 (81%) |
| Heavy API usage | $5,000 | $750 | $4,250 (85%) |
| Long conversations | $2,000 | $340 | $1,660 (83%) |

## 🎯 Core Features

### Cross-Provider Optimization (NEW in v2.0)

Get 99% savings by routing between providers:

```python
from apicrusher import OpenAI

client = OpenAI(
    openai_api_key="sk-...",             # Your OpenAI key
    anthropic_api_key="sk-ant-...",      # Optional: Add for 99% savings
    google_api_key="...",                # Optional: Add Google key
    apicrusher_key="apc_live_..."    
)

# Simple GPT-4 queries now route to Claude Haiku automatically (99% cheaper)
# Complex queries stay on GPT-4 to preserve quality
```

### Universal Provider Support
Works with ALL major AI providers and models:
- **OpenAI**: GPT-5, GPT-4, GPT-4o, O1, O3 (all current models)
- **Anthropic**: Claude Opus 4.1, Claude Sonnet 4, Claude 3.5
- **Google**: Gemini 2.0, Gemini 1.5 Pro/Flash
- **Meta**: Llama 3.3, Llama 3.2, Code Llama
- **Others**: Groq, Cohere, Mistral, DeepSeek, and more

### Intelligent Model Routing
- Simple queries → Cheap models (gpt-4o-mini)
- Complex queries → Premium models (GPT-5, Claude Opus 4.1)
- Automatic quality preservation

### Smart Caching
- Deduplicates identical requests
- Redis + in-memory fallback
- 33% average cache hit rate

### 🆕 Context Compression (NEW!)
**Stop paying to reprocess the same conversation 50 times:**

```python
# Enable context compression for long conversations
response = client.chat.completions.create(
    model="gpt-4",
    messages=conversation_history,  # 50 messages = 15,000 tokens normally
    compress_context=True  # Reduces to ~3,000 tokens automatically
)

# Features:
# - Summarizes older messages while preserving key decisions
# - Removes duplicate context automatically  
# - Compresses code blocks by 40-60%
# - Sends only deltas for continuing conversations
# - Preserves last 3 messages in full for accuracy
```

**Context Compression Savings Example:**
- Normal 20-message conversation: 150,000 tokens ($2.25)
- With compression: 35,000 tokens ($0.52)
- **Savings: 77% on long conversations**

## 🏢 Enterprise Security & Controls (v2.0)

### Security Features
- **IP Allowlisting**: Restrict API keys to specific IP ranges
- **Audit Logging**: Complete usage trail for compliance (SOC2 Type II)
- **API Key Rotation**: Rotate keys without service interruption
- **Email Verification**: Secure access control for all users
- **Role-Based Access**: Admin and user permissions for teams

### Business Controls
- **Usage Quotas**: Configurable limits per tier
  - Trial: 1,000 calls/day
  - Professional: 10,000 calls/day
  - Enterprise: Unlimited with alerts
- **80% Alerts**: Email warnings before quota exceeded
- **Overage Protection**: Prevent unexpected bills
- **Team Management**: Multiple users under one billing account
- **Monthly Reports**: Automated ROI reports for finance teams

### Reliability & Monitoring
- **Webhook Retry Queue**: Never miss critical payment events
- **Health Monitoring**: Real-time system status at `/metrics`
- **Automatic Failover**: Cross-provider redundancy
- **99.9% Uptime SLA**: For enterprise customers
- **Dedicated Support**: Priority response for business accounts

### Compliance
- **SOC2 Type II**: Security audit compliant
- **GDPR Ready**: Data processing agreements available
- **HIPAA Compatible**: With enterprise agreement
- **Self-Hosted Option**: Deploy in your own VPC for maximum control

## 📊 Analytics & Reporting

```python
# Get detailed savings report
client.print_savings_summary()

# Output:
# 💸 Total Saved: $127.43
# 📞 Total Calls: 1,432
# 💾 Cache Hit Rate: 34.2%
# ⚡ Optimization Rate: 91.3%
```

### Executive Dashboard
- Real-time cost savings visualization
- Model routing analytics
- Usage patterns and trends
- Export to CSV/Excel for finance teams
- Monthly ROI reports via email

## 🔧 Advanced Usage

### Multi-Provider Setup
```python
from apicrusher import OpenAI

# Install with: pip install apicrusher[all]
client = OpenAI(
    openai_api_key="sk-...",
    anthropic_api_key="sk-ant-...",
    google_api_key="...",
    groq_api_key="...",
    apicrusher_key="apc_..."
)

# Automatically routes to cheapest provider
response = client.chat.completions.create(
    model="gpt-4",  # Will use cheapest capable model
    messages=[{"role": "user", "content": "Format this date: 2024-01-01"}]
)
```

### Context Compression Options
```python
# Fine-tune compression behavior
response = client.chat.completions.create(
    model="gpt-4",
    messages=long_conversation,
    compress_context=True,
    compression_threshold=10,  # Start compressing after 10 messages
    preserve_recent=5  # Keep last 5 messages uncompressed
)
```

### Manual Optimization Control
```python
# Force specific model
response = client.chat.completions.create(
    model="gpt-4o-mini",  # Use this exact model
    messages=messages,
    skip_optimization=True  # Bypass routing logic
)
```

### Enterprise Configuration
```python
# Configure enterprise features
client = OpenAI(
    openai_api_key="sk-...",
    apicrusher_key="apc_enterprise_...",
    config={
        "ip_allowlist": ["192.168.1.0/24"],
        "audit_logging": True,
        "usage_quota": 50000,  # Daily limit
        "alert_threshold": 0.8,  # Alert at 80% usage
        "team_id": "eng-team-01"
    }
)
```

## 📊 Real-World Results

Based on actual customer usage:

- **E-commerce company**: Reduced costs from $8,400/mo to $1,260/mo (85% savings)
- **SaaS startup**: Cut API bills from $3,200/mo to $480/mo (85% savings)  
- **AI coding assistant**: Dropped from $12,000/mo to $2,400/mo (80% savings)
- **Customer support platform**: Saved $47,000/year with context compression
- **Data analytics firm**: 99% reduction using cross-provider routing

## 🛡️ Security & Privacy

- **Your API keys stay local** - Never sent to our servers
- **No prompt logging** - Your data remains private
- **Open source core** - Audit the optimization logic
- **SOC2 compliant** - Enterprise-ready security
- **IP allowlisting** - Restrict access to your network
- **Audit trails** - Complete usage history for compliance

## 🚀 Getting Started

1. **Install**: `pip install apicrusher[standard]`
2. **Get your key**: Sign up at [apicrusher.com](https://apicrusher.com)
3. **Add 2 lines**: Replace your import and add your key
4. **Save money**: Watch your costs drop by 63-99%

## 💰 Pricing

- **Free Trial**: 7 days, no credit card required
- **Professional**: $99/month (pays for itself in hours)
- **Enterprise**: Custom pricing for high-volume usage

Most customers save 10-50x the subscription cost in the first month.

## 🤝 Support

- Email: hello@apicrusher.com
- Documentation: [apicrusher.com/docs](https://apicrusher.com/docs)
- Enterprise Support: Priority response with SLA

## License

MIT License - Use it however you want.

---

## 📝 Changelog

### [2.2.1] - 2025-01-15
#### 📚 Documentation & PyPI Updates

**Fixed**
- **PyPI Documentation**: Removed private GitHub URLs from package metadata
- **Documentation Link**: Fixed broken `/documentation` link (now `/docs`)
- **Changelog Integration**: Added complete version history to PyPI README

**Improved**
- **Package Metadata**: Professional project URLs for better PyPI presentation
- **Installation Instructions**: Clearer guidance for virtual environments
- **Version History**: Full changelog now visible on PyPI without GitHub access

### [2.2.0] - 2025-01-10
#### 🚀 2025 Model Support & Pricing Updates

**Added**
- **GPT-5 Models**: Full support with aggressive 2025 pricing ($1.25/$10 per M tokens)
- **Claude Opus 4.1 & Sonnet 4**: Latest Anthropic models ($15/$75 and $3/$15)
- **Gemini 2.5 Flash-Lite**: Ultra-cheap Google model ($0.10/$0.40)
- **xAI (Grok) Integration**: Complete Grok model family support
- **Mistral Medium 3**: Competitive pricing at $0.40/$2

**Updated**
- **All Model Pricing**: Updated to January 2025 rates across all providers
- **Routing Logic**: Optimized for new model tiers (nano, mini, standard, pro, ultra)
- **Fallback Models**: Now use cheapest 2025 options (GPT-5-nano, Flash-Lite)
- **Cost Calculations**: Accurate for all 100+ supported models

**Improved**
- **Provider Detection**: Better handling of new model naming schemes
- **Cross-Provider Routing**: Enhanced for maximum 99% savings
- **Documentation**: Updated with current model availability

### [2.0.1] - 2025-01-13
#### 🔧 Critical Fixes & Dependency Management

**Fixed**
- **Model Availability Checker**: Fixed OpenAI SDK v1.0+ compatibility issues
- **Deprecation Warnings**: Resolved confusing "model deprecated" messages for working models
- **Dependency Installation**: OpenAI SDK now installs automatically (no more `pip install apicrusher openai`)

**Added**
- **Smart Dependencies**: Flexible installation options:
  - `pip install apicrusher[standard]` - OpenAI + Redis (recommended)
  - `pip install apicrusher[all]` - All providers
  - `pip install apicrusher[openai]` - Just OpenAI
  - `pip install apicrusher[anthropic]` - Just Anthropic
  - `pip install apicrusher[google]` - Just Google
- **Comprehensive Model Support**: Updated for all 2025 models:
  - GPT-5 family (released August 2025)
  - Claude Opus 4.1 (released August 2025)
  - Gemini 2.0 Pro
  - Llama 3.3
  - DeepSeek R1/V3
  - 100+ models across 15 providers

**Improved**
- **Installation Experience**: Clear installation paths for different use cases
- **Model Fallback Logic**: Smarter routing when models are unavailable
- **Documentation**: Updated README with correct installation instructions
- **Error Messages**: Clearer guidance when providers aren't configured

### [2.0.0] - 2025-01-12
#### 🎯 Major Release - Enterprise Security & Business Features

**Added - Critical Security Features**
- **Usage Limits & Quotas**: Per-tier API call limits (Trial: 1k/day, Pro: 10k/day, Enterprise: 100k/day)
- **Customer-Specific Rate Limiting**: Redis-backed per-customer rate limits with graceful fallback
- **Email Verification System**: Optional 6-digit code verification for new signups
- **API Key Rotation**: Secure key rotation with audit trail for compliance
- **Enterprise IP Allowlisting**: Restrict API access to specific IPs for high-security customers
- **Audit Logging**: Complete compliance trail for all key actions (login, rotation, subscription changes)
- **Failed Attempt Monitoring**: Track and alert on potential brute force attacks
- **Enhanced Key Security**: 32-character keys with SHA256 checksum validation

**Added - Business Logic**
- **Payment Webhook Handling**: Complete Stripe webhook processing with retry queue
- **Automatic Payment Recovery**: 3-day grace period for failed payments before suspension
- **Usage Alerts**: Email notifications at 80% and 95% of monthly limits
- **Model Availability Checking**: Track deprecated models with automatic fallback
- **Webhook Retry Queue**: Exponential backoff for failed webhook processing
- **Monthly Usage Reports**: Automated ROI reports with export functionality
- **Trial Ending Reminders**: Automated emails 2-3 days before trial ends

**Added - Dashboard Features**
- **Executive Analytics Dashboard**: Real-time savings visualization with ROI metrics
- **CSV/Excel Export**: One-click export of usage data for finance teams
- **Usage Percentage Display**: Visual indicators of quota consumption
- **Billing Portal Integration**: Self-service subscription management via Stripe

**Added - Infrastructure**
- **PostgreSQL Support**: Production-ready database with migration support
- **Background Scheduler**: Automated daily/monthly tasks without external cron
- **Prometheus Metrics**: `/metrics` endpoint for monitoring (MRR, usage, security)
- **Health Check Endpoint**: Database connectivity and system status monitoring
- **Sentry Integration**: Error tracking and performance monitoring in production
- **Admin Stats Endpoint**: Quick MRR and usage metrics for business monitoring

**Improved**
- **Better Error Handling**: Graceful degradation when Redis/Stripe unavailable
- **Database Migration Support**: Automatic column addition for existing deployments
- **Security Monitoring**: Real-time detection of suspicious access patterns
- **Cross-Provider Optimization**: Beta feature for 99% savings by routing between providers

**Fixed**
- **Memory Leak**: Fixed in-memory storage losing data on deploy
- **Timing Attacks**: Added deliberate delays in key validation
- **SQL Injection**: All queries now properly parameterized
- **Missing Columns**: Auto-adds required columns to existing databases

**Security**
- **FORCE HTTPS**: Automatic redirect in production
- **Secure Cookies**: HttpOnly, Secure, SameSite flags on all sessions
- **Rate Limiting**: Different limits for different endpoints
- **Input Validation**: Comprehensive validation on all user inputs
- **Sensitive Data Filtering**: Automatic redaction in logs and error reports

### [1.3.2] - 2025-01-07
**Fixed**
- Indentation bug in SDK core that prevented proper imports
- PyPI package now properly installable

### [1.3.0] - 2025-01-06
**Added**
- Cross-provider optimization for 99% savings
- Support for 30+ models across 15 providers
- Future-proof architecture for upcoming models

### [1.2.0] - 2025-01-05
**Added**
- Context compression feature reducing token usage by 40-77% on long conversations
- Automatic summarization of older messages
- Delta-only updates for continuing conversations
- Code block compression removing comments and whitespace
- `compress_context` parameter for automatic optimization

**Improved**
- Better handling of long conversation contexts
- Reduced API costs for chat applications by $5+ per conversation

### [1.1.0] - 2025-01-04
**Added**
- Universal provider support (OpenAI, Anthropic, Google, Groq, Cohere, Meta, Mistral)
- Intelligent complexity detection for routing decisions
- Redis caching with in-memory fallback

### [1.0.1] - 2025-01-03
**Fixed**
- Minor bug fixes in routing logic
- Documentation improvements

### [1.0.0] - 2025-01-02
**Initial Release**
- Basic OpenAI optimization
- Model routing (GPT-4 → GPT-4o-mini)
- Simple caching system
- 63-85% cost savings proven

---

**Stop bleeding money on AI APIs. Start saving with APICrusher today.**

[Get Your Key →](https://apicrusher.com)

Raw data

            {
    "_id": null,
    "home_page": "https://apicrusher.com",
    "name": "apicrusher",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.7",
    "maintainer_email": null,
    "keywords": "openai, anthropic, google, ai, api, optimization, cost-reduction, llm, gpt, claude, gemini, gpt-5, gpt-4, claude-4, api-optimization, ai-cost-reduction, universal-ai, groq, mistral, cohere, deepseek, llama",
    "author": "APICrusher",
    "author_email": "hello@apicrusher.com",
    "download_url": "https://files.pythonhosted.org/packages/67/00/270ae9f688a512d6c02bdd6ec3185e4147b26fb534fde2878d8bae0a90bd/apicrusher-2.2.1.tar.gz",
    "platform": "any",
    "description": "# APICrusher - Cut AI API Costs by 63-99%\n\nStop bleeding money on AI APIs. APICrusher automatically routes requests to the cheapest capable model and caches responses, cutting costs by 63-99% with just 2 lines of code.\n\n## \ud83d\ude80 Quick Start\n\n```bash\n# Python 3.7+ required\n# Use a virtual environment (recommended for all Python packages)\npython3 -m venv venv\nsource venv/bin/activate  # On Windows: venv\\Scripts\\activate\n\n# Install with your provider(s)\npip install apicrusher[standard]  # OpenAI + Redis caching (recommended)\n# OR\npip install apicrusher[all]       # All providers (OpenAI, Anthropic, Google, etc.)\n```\n\n## \ud83d\udccb Installation Options\n\n```bash\n# Choose based on your needs:\npip install apicrusher[standard]   # OpenAI + Redis (most users)\npip install apicrusher[openai]     # Just OpenAI support\npip install apicrusher[anthropic]  # Just Anthropic support\npip install apicrusher[google]     # Just Google support\npip install apicrusher[all]        # Everything - all providers\npip install apicrusher              # Minimal - add providers later\n```\n\n### Virtual Environment Required\nModern Python systems require virtual environments for pip packages:\n```bash\n# If you see \"externally-managed-environment\" error:\npython3 -m venv venv\nsource venv/bin/activate\npip install apicrusher[standard]\n```\n\n### API Keys Required\nAPICrusher optimizes your existing AI API calls. You need:\n1. **Your AI provider API key** (OpenAI, Anthropic, etc.) - Keep using your existing keys\n2. **An APICrusher optimization key** from [apicrusher.com](https://apicrusher.com) - Enables cost optimization\n\n### How It Works\nAPICrusher is a smart proxy layer. You keep your existing API keys. We analyze each request and route it to the optimal model. Your API keys never leave your server.\n\n## \ud83d\udcbb Basic Usage\n\n```python\n# Before (expensive)\nfrom openai import OpenAI\nclient = OpenAI(api_key=\"sk-...\")  # Your OpenAI key\n\n# After (63-99% cheaper)\nfrom apicrusher import OpenAI\nclient = OpenAI(\n    openai_api_key=\"sk-...\",        # Your existing OpenAI key\n    apicrusher_key=\"apc_live_...\"   # Add optimization key\n)\n\n# Your code stays exactly the same\nresponse = client.chat.completions.create(\n    model=\"gpt-4\",\n    messages=[{\"role\": \"user\", \"content\": \"Hello!\"}]\n)\n```\n\n## \ud83d\udcb0 How Much Can You Save?\n\n### Single Provider vs Cross-Provider\n- **Single Provider** (e.g., just OpenAI): 70-85% savings\n- **Cross-Provider** (e.g., OpenAI + Anthropic): Up to 99% savings\n\n| Your Current Usage | Monthly Cost | With APICrusher | You Save |\n|-------------------|--------------|-----------------|----------|\n| GPT-4 for everything | $1,000 | $180 | $820 (82%) |\n| Mixed GPT-4/3.5 | $500 | $95 | $405 (81%) |\n| Heavy API usage | $5,000 | $750 | $4,250 (85%) |\n| Long conversations | $2,000 | $340 | $1,660 (83%) |\n\n## \ud83c\udfaf Core Features\n\n### Cross-Provider Optimization (NEW in v2.0)\n\nGet 99% savings by routing between providers:\n\n```python\nfrom apicrusher import OpenAI\n\nclient = OpenAI(\n    openai_api_key=\"sk-...\",             # Your OpenAI key\n    anthropic_api_key=\"sk-ant-...\",      # Optional: Add for 99% savings\n    google_api_key=\"...\",                # Optional: Add Google key\n    apicrusher_key=\"apc_live_...\"    \n)\n\n# Simple GPT-4 queries now route to Claude Haiku automatically (99% cheaper)\n# Complex queries stay on GPT-4 to preserve quality\n```\n\n### Universal Provider Support\nWorks with ALL major AI providers and models:\n- **OpenAI**: GPT-5, GPT-4, GPT-4o, O1, O3 (all current models)\n- **Anthropic**: Claude Opus 4.1, Claude Sonnet 4, Claude 3.5\n- **Google**: Gemini 2.0, Gemini 1.5 Pro/Flash\n- **Meta**: Llama 3.3, Llama 3.2, Code Llama\n- **Others**: Groq, Cohere, Mistral, DeepSeek, and more\n\n### Intelligent Model Routing\n- Simple queries \u2192 Cheap models (gpt-4o-mini)\n- Complex queries \u2192 Premium models (GPT-5, Claude Opus 4.1)\n- Automatic quality preservation\n\n### Smart Caching\n- Deduplicates identical requests\n- Redis + in-memory fallback\n- 33% average cache hit rate\n\n### \ud83c\udd95 Context Compression (NEW!)\n**Stop paying to reprocess the same conversation 50 times:**\n\n```python\n# Enable context compression for long conversations\nresponse = client.chat.completions.create(\n    model=\"gpt-4\",\n    messages=conversation_history,  # 50 messages = 15,000 tokens normally\n    compress_context=True  # Reduces to ~3,000 tokens automatically\n)\n\n# Features:\n# - Summarizes older messages while preserving key decisions\n# - Removes duplicate context automatically  \n# - Compresses code blocks by 40-60%\n# - Sends only deltas for continuing conversations\n# - Preserves last 3 messages in full for accuracy\n```\n\n**Context Compression Savings Example:**\n- Normal 20-message conversation: 150,000 tokens ($2.25)\n- With compression: 35,000 tokens ($0.52)\n- **Savings: 77% on long conversations**\n\n## \ud83c\udfe2 Enterprise Security & Controls (v2.0)\n\n### Security Features\n- **IP Allowlisting**: Restrict API keys to specific IP ranges\n- **Audit Logging**: Complete usage trail for compliance (SOC2 Type II)\n- **API Key Rotation**: Rotate keys without service interruption\n- **Email Verification**: Secure access control for all users\n- **Role-Based Access**: Admin and user permissions for teams\n\n### Business Controls\n- **Usage Quotas**: Configurable limits per tier\n  - Trial: 1,000 calls/day\n  - Professional: 10,000 calls/day\n  - Enterprise: Unlimited with alerts\n- **80% Alerts**: Email warnings before quota exceeded\n- **Overage Protection**: Prevent unexpected bills\n- **Team Management**: Multiple users under one billing account\n- **Monthly Reports**: Automated ROI reports for finance teams\n\n### Reliability & Monitoring\n- **Webhook Retry Queue**: Never miss critical payment events\n- **Health Monitoring**: Real-time system status at `/metrics`\n- **Automatic Failover**: Cross-provider redundancy\n- **99.9% Uptime SLA**: For enterprise customers\n- **Dedicated Support**: Priority response for business accounts\n\n### Compliance\n- **SOC2 Type II**: Security audit compliant\n- **GDPR Ready**: Data processing agreements available\n- **HIPAA Compatible**: With enterprise agreement\n- **Self-Hosted Option**: Deploy in your own VPC for maximum control\n\n## \ud83d\udcca Analytics & Reporting\n\n```python\n# Get detailed savings report\nclient.print_savings_summary()\n\n# Output:\n# \ud83d\udcb8 Total Saved: $127.43\n# \ud83d\udcde Total Calls: 1,432\n# \ud83d\udcbe Cache Hit Rate: 34.2%\n# \u26a1 Optimization Rate: 91.3%\n```\n\n### Executive Dashboard\n- Real-time cost savings visualization\n- Model routing analytics\n- Usage patterns and trends\n- Export to CSV/Excel for finance teams\n- Monthly ROI reports via email\n\n## \ud83d\udd27 Advanced Usage\n\n### Multi-Provider Setup\n```python\nfrom apicrusher import OpenAI\n\n# Install with: pip install apicrusher[all]\nclient = OpenAI(\n    openai_api_key=\"sk-...\",\n    anthropic_api_key=\"sk-ant-...\",\n    google_api_key=\"...\",\n    groq_api_key=\"...\",\n    apicrusher_key=\"apc_...\"\n)\n\n# Automatically routes to cheapest provider\nresponse = client.chat.completions.create(\n    model=\"gpt-4\",  # Will use cheapest capable model\n    messages=[{\"role\": \"user\", \"content\": \"Format this date: 2024-01-01\"}]\n)\n```\n\n### Context Compression Options\n```python\n# Fine-tune compression behavior\nresponse = client.chat.completions.create(\n    model=\"gpt-4\",\n    messages=long_conversation,\n    compress_context=True,\n    compression_threshold=10,  # Start compressing after 10 messages\n    preserve_recent=5  # Keep last 5 messages uncompressed\n)\n```\n\n### Manual Optimization Control\n```python\n# Force specific model\nresponse = client.chat.completions.create(\n    model=\"gpt-4o-mini\",  # Use this exact model\n    messages=messages,\n    skip_optimization=True  # Bypass routing logic\n)\n```\n\n### Enterprise Configuration\n```python\n# Configure enterprise features\nclient = OpenAI(\n    openai_api_key=\"sk-...\",\n    apicrusher_key=\"apc_enterprise_...\",\n    config={\n        \"ip_allowlist\": [\"192.168.1.0/24\"],\n        \"audit_logging\": True,\n        \"usage_quota\": 50000,  # Daily limit\n        \"alert_threshold\": 0.8,  # Alert at 80% usage\n        \"team_id\": \"eng-team-01\"\n    }\n)\n```\n\n## \ud83d\udcca Real-World Results\n\nBased on actual customer usage:\n\n- **E-commerce company**: Reduced costs from $8,400/mo to $1,260/mo (85% savings)\n- **SaaS startup**: Cut API bills from $3,200/mo to $480/mo (85% savings)  \n- **AI coding assistant**: Dropped from $12,000/mo to $2,400/mo (80% savings)\n- **Customer support platform**: Saved $47,000/year with context compression\n- **Data analytics firm**: 99% reduction using cross-provider routing\n\n## \ud83d\udee1\ufe0f Security & Privacy\n\n- **Your API keys stay local** - Never sent to our servers\n- **No prompt logging** - Your data remains private\n- **Open source core** - Audit the optimization logic\n- **SOC2 compliant** - Enterprise-ready security\n- **IP allowlisting** - Restrict access to your network\n- **Audit trails** - Complete usage history for compliance\n\n## \ud83d\ude80 Getting Started\n\n1. **Install**: `pip install apicrusher[standard]`\n2. **Get your key**: Sign up at [apicrusher.com](https://apicrusher.com)\n3. **Add 2 lines**: Replace your import and add your key\n4. **Save money**: Watch your costs drop by 63-99%\n\n## \ud83d\udcb0 Pricing\n\n- **Free Trial**: 7 days, no credit card required\n- **Professional**: $99/month (pays for itself in hours)\n- **Enterprise**: Custom pricing for high-volume usage\n\nMost customers save 10-50x the subscription cost in the first month.\n\n## \ud83e\udd1d Support\n\n- Email: hello@apicrusher.com\n- Documentation: [apicrusher.com/docs](https://apicrusher.com/docs)\n- Enterprise Support: Priority response with SLA\n\n## License\n\nMIT License - Use it however you want.\n\n---\n\n## \ud83d\udcdd Changelog\n\n### [2.2.1] - 2025-01-15\n#### \ud83d\udcda Documentation & PyPI Updates\n\n**Fixed**\n- **PyPI Documentation**: Removed private GitHub URLs from package metadata\n- **Documentation Link**: Fixed broken `/documentation` link (now `/docs`)\n- **Changelog Integration**: Added complete version history to PyPI README\n\n**Improved**\n- **Package Metadata**: Professional project URLs for better PyPI presentation\n- **Installation Instructions**: Clearer guidance for virtual environments\n- **Version History**: Full changelog now visible on PyPI without GitHub access\n\n### [2.2.0] - 2025-01-10\n#### \ud83d\ude80 2025 Model Support & Pricing Updates\n\n**Added**\n- **GPT-5 Models**: Full support with aggressive 2025 pricing ($1.25/$10 per M tokens)\n- **Claude Opus 4.1 & Sonnet 4**: Latest Anthropic models ($15/$75 and $3/$15)\n- **Gemini 2.5 Flash-Lite**: Ultra-cheap Google model ($0.10/$0.40)\n- **xAI (Grok) Integration**: Complete Grok model family support\n- **Mistral Medium 3**: Competitive pricing at $0.40/$2\n\n**Updated**\n- **All Model Pricing**: Updated to January 2025 rates across all providers\n- **Routing Logic**: Optimized for new model tiers (nano, mini, standard, pro, ultra)\n- **Fallback Models**: Now use cheapest 2025 options (GPT-5-nano, Flash-Lite)\n- **Cost Calculations**: Accurate for all 100+ supported models\n\n**Improved**\n- **Provider Detection**: Better handling of new model naming schemes\n- **Cross-Provider Routing**: Enhanced for maximum 99% savings\n- **Documentation**: Updated with current model availability\n\n### [2.0.1] - 2025-01-13\n#### \ud83d\udd27 Critical Fixes & Dependency Management\n\n**Fixed**\n- **Model Availability Checker**: Fixed OpenAI SDK v1.0+ compatibility issues\n- **Deprecation Warnings**: Resolved confusing \"model deprecated\" messages for working models\n- **Dependency Installation**: OpenAI SDK now installs automatically (no more `pip install apicrusher openai`)\n\n**Added**\n- **Smart Dependencies**: Flexible installation options:\n  - `pip install apicrusher[standard]` - OpenAI + Redis (recommended)\n  - `pip install apicrusher[all]` - All providers\n  - `pip install apicrusher[openai]` - Just OpenAI\n  - `pip install apicrusher[anthropic]` - Just Anthropic\n  - `pip install apicrusher[google]` - Just Google\n- **Comprehensive Model Support**: Updated for all 2025 models:\n  - GPT-5 family (released August 2025)\n  - Claude Opus 4.1 (released August 2025)\n  - Gemini 2.0 Pro\n  - Llama 3.3\n  - DeepSeek R1/V3\n  - 100+ models across 15 providers\n\n**Improved**\n- **Installation Experience**: Clear installation paths for different use cases\n- **Model Fallback Logic**: Smarter routing when models are unavailable\n- **Documentation**: Updated README with correct installation instructions\n- **Error Messages**: Clearer guidance when providers aren't configured\n\n### [2.0.0] - 2025-01-12\n#### \ud83c\udfaf Major Release - Enterprise Security & Business Features\n\n**Added - Critical Security Features**\n- **Usage Limits & Quotas**: Per-tier API call limits (Trial: 1k/day, Pro: 10k/day, Enterprise: 100k/day)\n- **Customer-Specific Rate Limiting**: Redis-backed per-customer rate limits with graceful fallback\n- **Email Verification System**: Optional 6-digit code verification for new signups\n- **API Key Rotation**: Secure key rotation with audit trail for compliance\n- **Enterprise IP Allowlisting**: Restrict API access to specific IPs for high-security customers\n- **Audit Logging**: Complete compliance trail for all key actions (login, rotation, subscription changes)\n- **Failed Attempt Monitoring**: Track and alert on potential brute force attacks\n- **Enhanced Key Security**: 32-character keys with SHA256 checksum validation\n\n**Added - Business Logic**\n- **Payment Webhook Handling**: Complete Stripe webhook processing with retry queue\n- **Automatic Payment Recovery**: 3-day grace period for failed payments before suspension\n- **Usage Alerts**: Email notifications at 80% and 95% of monthly limits\n- **Model Availability Checking**: Track deprecated models with automatic fallback\n- **Webhook Retry Queue**: Exponential backoff for failed webhook processing\n- **Monthly Usage Reports**: Automated ROI reports with export functionality\n- **Trial Ending Reminders**: Automated emails 2-3 days before trial ends\n\n**Added - Dashboard Features**\n- **Executive Analytics Dashboard**: Real-time savings visualization with ROI metrics\n- **CSV/Excel Export**: One-click export of usage data for finance teams\n- **Usage Percentage Display**: Visual indicators of quota consumption\n- **Billing Portal Integration**: Self-service subscription management via Stripe\n\n**Added - Infrastructure**\n- **PostgreSQL Support**: Production-ready database with migration support\n- **Background Scheduler**: Automated daily/monthly tasks without external cron\n- **Prometheus Metrics**: `/metrics` endpoint for monitoring (MRR, usage, security)\n- **Health Check Endpoint**: Database connectivity and system status monitoring\n- **Sentry Integration**: Error tracking and performance monitoring in production\n- **Admin Stats Endpoint**: Quick MRR and usage metrics for business monitoring\n\n**Improved**\n- **Better Error Handling**: Graceful degradation when Redis/Stripe unavailable\n- **Database Migration Support**: Automatic column addition for existing deployments\n- **Security Monitoring**: Real-time detection of suspicious access patterns\n- **Cross-Provider Optimization**: Beta feature for 99% savings by routing between providers\n\n**Fixed**\n- **Memory Leak**: Fixed in-memory storage losing data on deploy\n- **Timing Attacks**: Added deliberate delays in key validation\n- **SQL Injection**: All queries now properly parameterized\n- **Missing Columns**: Auto-adds required columns to existing databases\n\n**Security**\n- **FORCE HTTPS**: Automatic redirect in production\n- **Secure Cookies**: HttpOnly, Secure, SameSite flags on all sessions\n- **Rate Limiting**: Different limits for different endpoints\n- **Input Validation**: Comprehensive validation on all user inputs\n- **Sensitive Data Filtering**: Automatic redaction in logs and error reports\n\n### [1.3.2] - 2025-01-07\n**Fixed**\n- Indentation bug in SDK core that prevented proper imports\n- PyPI package now properly installable\n\n### [1.3.0] - 2025-01-06\n**Added**\n- Cross-provider optimization for 99% savings\n- Support for 30+ models across 15 providers\n- Future-proof architecture for upcoming models\n\n### [1.2.0] - 2025-01-05\n**Added**\n- Context compression feature reducing token usage by 40-77% on long conversations\n- Automatic summarization of older messages\n- Delta-only updates for continuing conversations\n- Code block compression removing comments and whitespace\n- `compress_context` parameter for automatic optimization\n\n**Improved**\n- Better handling of long conversation contexts\n- Reduced API costs for chat applications by $5+ per conversation\n\n### [1.1.0] - 2025-01-04\n**Added**\n- Universal provider support (OpenAI, Anthropic, Google, Groq, Cohere, Meta, Mistral)\n- Intelligent complexity detection for routing decisions\n- Redis caching with in-memory fallback\n\n### [1.0.1] - 2025-01-03\n**Fixed**\n- Minor bug fixes in routing logic\n- Documentation improvements\n\n### [1.0.0] - 2025-01-02\n**Initial Release**\n- Basic OpenAI optimization\n- Model routing (GPT-4 \u2192 GPT-4o-mini)\n- Simple caching system\n- 63-85% cost savings proven\n\n---\n\n**Stop bleeding money on AI APIs. Start saving with APICrusher today.**\n\n[Get Your Key \u2192](https://apicrusher.com)\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Cut AI API costs by 63-99% with intelligent routing across all AI providers",
    "version": "2.2.1",
    "project_urls": {
        "Changelog": "https://pypi.org/project/apicrusher/",
        "Documentation": "https://apicrusher.com/documentation",
        "Homepage": "https://apicrusher.com",
        "Website": "https://apicrusher.com"
    },
    "split_keywords": [
        "openai",
        " anthropic",
        " google",
        " ai",
        " api",
        " optimization",
        " cost-reduction",
        " llm",
        " gpt",
        " claude",
        " gemini",
        " gpt-5",
        " gpt-4",
        " claude-4",
        " api-optimization",
        " ai-cost-reduction",
        " universal-ai",
        " groq",
        " mistral",
        " cohere",
        " deepseek",
        " llama"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "9c826e19be232c03ef49d5b5e45ee79b0dbd9830bfdfe6b377678a7efa6983ec",
                "md5": "6eac43913c51b0a306febcbaa3ec6eae",
                "sha256": "4155a4e6eb95a8bff380682078d9ad5fe1c6076a2044252a991ea5ca2d0acf3e"
            },
            "downloads": -1,
            "filename": "apicrusher-2.2.1-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "6eac43913c51b0a306febcbaa3ec6eae",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.7",
            "size": 25313,
            "upload_time": "2025-09-08T19:15:58",
            "upload_time_iso_8601": "2025-09-08T19:15:58.273026Z",
            "url": "https://files.pythonhosted.org/packages/9c/82/6e19be232c03ef49d5b5e45ee79b0dbd9830bfdfe6b377678a7efa6983ec/apicrusher-2.2.1-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "6700270ae9f688a512d6c02bdd6ec3185e4147b26fb534fde2878d8bae0a90bd",
                "md5": "be51275bf018b281353e0d9c488a0f58",
                "sha256": "456facf4396aabedf31705a4855bf091cdd9a489358816ebd3dcec7d3241ee80"
            },
            "downloads": -1,
            "filename": "apicrusher-2.2.1.tar.gz",
            "has_sig": false,
            "md5_digest": "be51275bf018b281353e0d9c488a0f58",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.7",
            "size": 32116,
            "upload_time": "2025-09-08T19:15:59",
            "upload_time_iso_8601": "2025-09-08T19:15:59.678072Z",
            "url": "https://files.pythonhosted.org/packages/67/00/270ae9f688a512d6c02bdd6ec3185e4147b26fb534fde2878d8bae0a90bd/apicrusher-2.2.1.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-09-08 19:15:59",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "apicrusher"
}

APICrusher