How to use Claude Code without paying $200/month
Five proven ways to keep using Claude Opus and Sonnet for coding without paying Anthropic's $200/month Max subscription. Real setups with iFlow, Qwen, Kiro free tiers, and kRouter combos.
The Claude Code Max plan is $200 a month. That is real money. For most developers, it is also more than you need -- Claude itself is great, but Anthropic's pricing assumes you are agentic-coding eight hours a day on Opus 4.7.
Here are five ways to get the same Claude models, or close enough, without the Max bill.
TL;DR
If you only want the answer: install kRouter, connect Kiro AI (free Claude 4.5), add GLM-5.1 Coding Plan as a $3-8/month overflow, and point Claude Code at localhost:20128. You will spend under $10/month instead of $200, and your IDE will not notice the difference.
The rest of this post explains why each piece works.
1. Kiro AI -- Free Claude 4.5 via OAuth
Kiro gives you free Claude Sonnet 4.5 (and Haiku 4.5) through their browser-based OAuth flow. It is genuinely free. The catch: you sign in with AWS Builder ID, Google, or GitHub, and you are bound by Kiro's terms of service for personal coding use.
Once connected through kRouter:
# Your IDE config
OPENAI_BASE_URL=http://localhost:20128/v1
OPENAI_API_KEY=sk-krouter-local
# Model:
kr/claude-sonnet-4.5Claude Code, Cursor, Cline, any tool -- they all see a normal OpenAI-compatible endpoint and Kiro serves Claude on the other side.
When it works well: the 80% of your day that is normal coding. Refactors, test generation, code review.
When it does not: Kiro occasionally rate-limits aggressively under sustained load. That is where the fallback chain matters.
2. iFlow -- 8 free models, no sign-up
iFlow is the most underrated free provider in 2026. It serves 8 models without requiring an API key or even an account:
- Claude Sonnet 4.5 (via proxy)
- GPT-4.1
- Gemini 2.5 Pro
- DeepSeek V3.2
- Qwen 3 235B
- Llama 4 Maverick
- Mistral Large
- Command R+
The quality varies by model and time of day (iFlow load-balances across community infrastructure), but for overflow and fallback purposes it is excellent. Add it as a zero-config free tier in kRouter:
Provider: iFlow
Auth: none required
Models: 8 (auto-detected)3. Qwen -- 3 free models with generous quota
Alibaba's Qwen offers three models on a free tier with daily quota resets:
- Qwen 3 235B (frontier reasoning)
- Qwen 3 30B (fast, good for autocomplete)
- Qwen Coder 2.5 32B (code-specialized)
The free tier gives you roughly 1M tokens/day across models. For a developer coding 4-5 hours/day, that is enough for most workloads. Sign up at dashscope.aliyuncs.com for an API key, add it to kRouter, and you have a third free provider in your chain.
4. GLM-5.1 Coding Plan -- $0.60 per million tokens
When the free tiers get tight, you want a cheap pay-per-token tier behind them. GLM-5.1 at $0.60/M input / $1.20/M output is the current price-to-quality leader for coding. It is roughly Claude Sonnet quality on most code-edit tasks, sometimes better on Chinese-language reasoning.
In a kRouter combo:
1. kr/claude-sonnet-4.5 # free, primary
2. iflow/<auto> # free, 8 models
3. glm/glm-5.1 # $3-8/month overflow
4. qwen/qwen-3-235b # free, daily quotaThe dashboard shows you exactly how often it falls through to GLM. For most developers it is ~5-10% of requests when you stack multiple free providers. At those volumes, GLM costs $3-5 total per month.
5. MiniMax M2.7 -- $0.20 per million tokens
For long-context tasks (large file rewrites, multi-file refactors), MiniMax M2.7 at $0.20/M is the cheapest 1M-context model that does not embarrass itself. Add it as a context-aware tier specifically for "this file is huge" cases.
The full kRouter setup
Here is how to wire all of this together:
# Install kRouter
npm install -g @sifxprime/krouter
krouter -t
# Open the dashboard
http://localhost:20128/dashboard
# Add providers:
# 1. Kiro -> OAuth -> AWS Builder ID (free Claude Sonnet 4.5 + Haiku 4.5)
# 2. iFlow -> no auth (free 8 models)
# 3. Qwen -> API key from dashscope.aliyuncs.com (free 3 models)
# 4. GLM -> API key from open.bigmodel.cn ($0.60/M overflow)
# 5. MiniMax -> API key (optional, $0.20/M long-context)
# Build combo:
# kr/claude-sonnet-4.5 -> iflow/<auto> -> qwen/qwen-3-235b -> glm/glm-5.1
# In Claude Code:
OPENAI_BASE_URL=http://localhost:20128/v1
OPENAI_API_KEY=sk-krouter-localkRouter's 3-tier auto-fallback handles the rest. When Kiro is available, it serves Claude for free. When Kiro rate-limits, iFlow or Qwen catch the request. When all free tiers are exhausted (rare with this many providers), GLM picks up at $0.60/M. Your IDE never sees an error.
A real monthly setup
Here is what a real developer's stack looks like, with kRouter:
| Provider | Cost | Used for |
|---|---|---|
| Kiro AI | $0 | 55% of requests |
| iFlow | $0 | 15% of requests |
| Qwen | $0 | 10% of requests |
| GLM-5.1 | ~$4 | 15% (overflow) |
| MiniMax M2.7 | ~$1 | 5% (long context) |
| Total | ~$5/mo |
Compare to the $200 Max plan, or the $20-40/month a typical pay-as-you-go Anthropic key would cost. You save 95%+ and never hit a quota wall.
What you give up
To be honest: you do not get Opus 4.7 on every request. The free and cheap tiers route to Sonnet 4.5 / GLM-5.1 / Qwen -- all excellent for coding, none of them Opus. If you specifically need Opus reasoning for a hard problem, add a direct Anthropic API key as a final tier. kRouter will only fall through to it when nothing else works.
Install
npm install -g @sifxprime/krouter
krouter -t
# Open http://localhost:20128/dashboard
# Connect Kiro, iFlow, Qwen, then GLM
# Point your IDE at http://localhost:20128/v1The full setup guide is at /install. Provider details at /providers. Provider comparison at /compare.
You are about to save several hundred dollars a year. Send the difference to your favorite open-source project.
Klaw is the Kodelyth AI agent. He writes drafts, runs the benchmarks, and tracks every cost number in this post live through kRouter. Humans review before publish.
Install kRouter