AI Models — Claude, GPT & Gemini API Pricing

Claude Opus 4.8

Anthropic's most capable model — deepest reasoning, best-in-class coding and agentic performance, with effort control for cost tuning.

$15/M in · $75/M out
$3/M in · $15/M out80% OFF

Context: 200K tokens

NewestAnthropic

Claude Fable 5

Anthropic's newest 2026 model. Frontier coding and agentic capability with a 1M-token context variant for huge codebases and document sets.

$15/M in · $75/M out
$3/M in · $15/M out80% OFF

Context: 200K · 1M variant

Best valueAnthropic

Claude Sonnet 4.6

The workhorse. Near-Opus quality on most coding tasks at a fifth of the price — the default choice for production workloads.

$3/M in · $15/M out
$0.9/M in · $4.5/M out70% OFF

Context: 200K · 1M beta

FastestAnthropic

Claude Haiku 4.5

Small, fast, and shockingly capable — classification, extraction, routing, and high-volume tasks at the lowest Claude price point.

$1/M in · $5/M out
$0.4/M in · $2/M out60% OFF

Context: 200K tokens

Long contextAnthropic

Claude 1M Context

The 1M-token context tier — load entire repositories, hundreds of documents, or week-long agent transcripts into a single request.

$6/M in · $22.5/M out
$1.8/M in · $6.75/M out70% OFF

Context: 1,000,000 tokens

FlagshipOpenAI

GPT-5.5

OpenAI's flagship reasoning model — strong writing, math, and multimodal performance through the same OpenAI-compatible endpoint.

$10/M in · $40/M out
$3/M in · $12/M out70% OFF

Context: 256K tokens

PopularOpenAI

GPT-5

The widely deployed GPT generation — balanced capability and cost for chat, content, and general-purpose API workloads.

$5/M in · $20/M out
$1.5/M in · $6/M out70% OFF

Context: 256K tokens

MultimodalGoogle

Gemini 3 Pro

Google's frontier multimodal model — native 1M context, strong video/image understanding, and top-tier benchmark scores.

$2.5/M in · $15/M out
$1/M in · $6/M out60% OFF

Context: 1M tokens

FastGoogle

Gemini 3 Flash

Google's speed-tier model — 1M context at bargain rates, ideal for high-volume multimodal and retrieval workloads.

$0.5/M in · $3/M out
$0.2/M in · $1.2/M out60% OFF

Context: 1M tokens

NewMiniMax

MiniMax M3

One of 2026's strongest open-weight frontier models — competitive coding and agentic scores at a fraction of closed-model prices.

$1.2/M in · $6/M out
$0.5/M in · $2.5/M out58% OFF

Context: 1M tokens