One API for LLMs.

The one platform that devs, CFOs and security teams love.

no card · $5 free tier

works with OpenAIAnthropicDeepSeekGLMxAI

01 — At scale

Learn from thousands of agents that your users run.

Fan out to thousands of sessions at once. Click any agent and watch its session — the chat, the model, the compute, the cost.

02 — The mix

Stop running everything on the most expensive model.

Only ~a quarter of your work needs the best model. One routing policy fixes the mix.

BEFORE · all frontier

10,000 sessions$4,000 / mo

AFTER · right-sized

same 10,000 sessions$1,540 / mo · −62%

Browse routing policies → What it saves you

03 — Anywhere

Local or cloud.

Hosted model sessions work instantly. When the session needs your repo, terminal, tools, or CLI agents, attach the relay and keep the same session boundary.

Models

Claude Opus 4.8

GPT-5

Grok 4

Gemma 4

Runtimes

Claude Code

Codex CLI

OpenAI SDK

xAI

Compute

Vercel

E2B

Cloudflare

Modal

your machine

no card · $5 free tier