AI Infrastructure Platform

Hanzo Cloud
AI at Scale

Unified gateway to 100+ AI providers. Usage analytics, team management, and API key provisioning — all in one platform.

api.cloud.hanzo.ai
curl https://api.cloud.hanzo.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-hanzo-..." \
  -d '{
    "model": "claude-sonnet-4-5-20250929",
    "messages": [{"role": "user", "content": "Hello"}]
  }'
100+
AI Providers
<50ms
Proxy Latency
99.9%
Uptime SLA
5000
Req/s Capacity

Everything you need for AI infrastructure

From a single API key to a full enterprise deployment. Hanzo Cloud scales with your team.

LLM Gateway

Unified proxy for 100+ AI providers. OpenAI, Anthropic, Together, Deepseek, and more through a single API.

Usage Analytics

Real-time token usage, cost tracking, and performance metrics. Per-model, per-team breakdowns with ClickHouse-powered dashboards.

Team Management

Organizations, teams, and role-based access control. SSO via Hanzo IAM with granular permissions per API key.

API Key Management

Create, rotate, and scope API keys. Set rate limits, model access, and budget caps per key.

Security & Compliance

End-to-end encryption, audit logs, and SOC 2 compliance. PII redaction and content filtering built in.

Rate Limiting & Caching

Intelligent response caching with Redis. Global and per-key rate limits. Automatic retry with exponential backoff.

Built for production

Enterprise-grade infrastructure running on Kubernetes with automatic scaling, failover, and observability.

Request Flow
api.cloud.hanzo.ai
Cloudflare Edge + TLS
API Gateway
Auth, rate limiting, caching
LLM Router
Provider selection, load balancing, fallback
AI Provider
OpenAI, Anthropic, Together, Deepseek...
Analytics Pipeline
Langfuse → ClickHouse → Dashboards

Ready to get started?

Sign in to the Hanzo Cloud Console to manage your AI infrastructure. Create API keys, monitor usage, and manage your team.