Pricing & plans

Flat rate. Every model.
No token math.

Every dollar you pay turns into $3 of model usage at provider rates — and roughly $6 when you pair DevPass with SoulForge.

Lite

For occasional AI-assisted coding

$29/mo
Casual hobby work
What you actually get3× value
You pay$29/mo
You use$87 at provider rates
$29 paid$87 used
With SoulForge → prompt caching cuts ~50% of tokens, stretching your $87 to ~$174 of effective use.
  • All 200+ models — Claude, GPT-5, Gemini, GLM, Qwen, …
  • Works with Claude Code, OpenCode, SoulForge & every OpenAI-compatible tool
  • Real-time usage dashboard with per-request cost
  • Switch tiers any time — prorated
Get Lite
Most popular

Pro

For daily development workflows

$79/mo
Most developers ship from here
What you actually get3× value
You pay$79/mo
You use$237 at provider rates
$79 paid$237 used
With SoulForge → prompt caching cuts ~50% of tokens, stretching your $237 to ~$474 of effective use.
  • Everything in Lite
  • Headroom for full-day agent runs in Claude Code & OpenCode
  • Priority routing on flagship models
  • Email support with 1-business-day reply
Get Pro

Max

For power users and heavy sessions

$179/mo
All-day agent runs
What you actually get3× value
You pay$179/mo
You use$537 at provider rates
$179 paid$537 used
With SoulForge → prompt caching cuts ~50% of tokens, stretching your $537 to ~$1074 of effective use.
  • Everything in Pro
  • Comfortable for non-stop SoulForge & Claude Code usage
  • Priority support, faster turnaround
  • Best $/usage ratio across the lineup
Get Max

Usage is metered at each provider's published per-token rate. Every request shows its dollar value in your dashboard in real time.

Pair with SoulForge

Cut ~50% of tokens.
Double the value of your DevPass.

SoulForge is a coding agent built around aggressive prompt caching and context reuse. Point it at LLM Gateway and it sends roughly half the tokens of an equivalent Claude Code session — same model, same task, smaller bill.

  • 1
    Prompt caching by default — system prompt, tools, and project context are cached on every provider that supports it.
  • 2
    Context-aware compaction — SoulForge prunes stale turns instead of replaying the whole conversation.
  • 3
    Same DevPass key — no separate subscription. Run soulforge, type /keys, paste your key.
Get SoulForge
Tokens used · same task
Without SoulForge1,000K

Standard agent loop, no aggressive caching

SOULFORGEWith SoulForge−50%
~500K

Prompt-cache hits on every reusable prefix

Effective DevPass value
Plan baseline
~6×
With SoulForge

Actual savings vary by workload. The 50% figure is typical for multi-turn agent sessions where the system prompt and codebase context stay stable.

Compare plans

What's in each plan

Every tier ships with the full model catalog. The only thing that changes is how much usage you have to play with each month.

Feature
Lite
$29/mo · $290/yr
ProPOPULAR
$79/mo · $790/yr
Max
$179/mo · $1790/yr
You pay$29/mo$79/mo$179/mo
Monthly model usage at provider rates$87$237$537
Effective with SoulForge (~50% token cut)~$174~$474~$1074
Models included200+200+200+
Latest flagships (Opus 4.7, GPT-5.5, Gemini 3.1 Pro)IncludedIncludedIncluded
Open-weight Chinese coders (GLM-4.7, Qwen3, Kimi K2.6)IncludedIncludedIncluded
Works with Claude Code, OpenCode, SoulForgeIncludedIncludedIncluded
Any OpenAI/Anthropic-compatible toolIncludedIncludedIncluded
Real-time usage dashboardIncludedIncludedIncluded
Per-request cost & latency analyticsIncludedIncludedIncluded
Switch tiers anytime (prorated)IncludedIncludedIncluded
Annual billing (save 2 months)IncludedIncludedIncluded
Email supportIncludedIncludedIncluded
Priority supportNot includedIncludedIncluded
Headroom for all-day agent runsNot includedNot includedIncluded

Usage is metered at each provider's published per-token rate (input, output, and cached tokens). Every request shows its dollar value in your dashboard in real time. SoulForge savings vary by workload — 50% is typical for multi-turn agent sessions where the system prompt and codebase context stay stable.

Not sure which plan fits?

Estimate your monthly cost with the token calculator — pick a model, paste a sample prompt, and see what your usage actually looks like.

Open calculator

FAQ

Common questions

Everything you need to know about usage limits, plans, and getting started with DevPass.

Can't find an answer? Contact us

Every plan includes a fixed monthly usage allowance, calculated at standard provider rates:

  • Lite — $29/mo: $87 in monthly model usage
  • Pro — $79/mo: $237 in monthly model usage
  • Max — $179/mo: $537 in monthly model usage

That's roughly 3× the subscription price — effectively model access at a deep discount compared to paying providers directly.

Still deciding?

Start on Pro — most developers ship from there. Switch tiers any time, prorated.