One API key, every frontier model. Same context windows, same SDKs, up to 80% below official pricing. New models are added within days of release.
Anthropic's most capable model — deepest reasoning, best-in-class coding and agentic performance, with effort control for cost tuning.
Anthropic's newest 2026 model. Frontier coding and agentic capability with a 1M-token context variant for huge codebases and document sets.
The workhorse. Near-Opus quality on most coding tasks at a fifth of the price — the default choice for production workloads.
Small, fast, and shockingly capable — classification, extraction, routing, and high-volume tasks at the lowest Claude price point.
The 1M-token context tier — load entire repositories, hundreds of documents, or week-long agent transcripts into a single request.
OpenAI's flagship reasoning model — strong writing, math, and multimodal performance through the same OpenAI-compatible endpoint.
The widely deployed GPT generation — balanced capability and cost for chat, content, and general-purpose API workloads.
Google's frontier multimodal model — native 1M context, strong video/image understanding, and top-tier benchmark scores.
Google's speed-tier model — 1M context at bargain rates, ideal for high-volume multimodal and retrieval workloads.
One of 2026's strongest open-weight frontier models — competitive coding and agentic scores at a fraction of closed-model prices.