Qwen Code Free Tier Discontinued: What Happened and What's Next

TL;DR: Qwen Code’s free OAuth tier — which offered up to 1,000 API calls/day for its terminal AI coding agent — was discontinued on April 15, 2026, after just eight months. The quota was cut to 100/day two days prior. Users must now switch to Alibaba’s paid Coding Plan ($50/month), OpenRouter, Fireworks AI, or bring their own API key.

Qwen Code, the open-source terminal AI agent from Alibaba’s Qwen team (23.4k GitHub stars), has shut down its free tier. For eight months, anyone with a qwen.ai account could use models like Qwen3.5-Plus and Qwen3.6-Plus directly from the CLI — no API key, no credit card, no strings attached. That era ended on April 15, 2026.

The Timeline

Date	Event	Details
Aug 8, 2025	Free tier launched	PR #225 — 1,000 requests/day via OAuth
Apr 2, 2026	Qwen3.6-Plus released	Latest model made available through free OAuth
Apr 13, 2026	Quota cut 90%	PR #3217 — dropped from 1,000 to 100 requests/day
Apr 15, 2026	Free tier discontinued	PR #3291 — OAuth free tier end-of-life

The sequence is telling: a sudden 90% quota reduction followed two days later by a full shutdown. This is a familiar playbook in the “free AI API” space — gradual tightening before the plug is pulled entirely.

What Actually Changed

The cutoff was implemented as a soft cutoff. According to the PR description:

Existing cached OAuth tokens continued working until the server rejected them — no hard client-side kill switch
New OAuth signups were blocked at the auth dialog
Users launching qwen with an existing OAuth config see a persistent warning on every startup:

1
Qwen OAuth free tier was discontinued on 2026-04-15.
2
Run /auth to switch to Coding Plan, OpenRouter, Fireworks AI, or another provider.

The OAuth option was moved to the bottom of the auth menu with a “Discontinued” label
Selecting it shows an inline warning instead of proceeding
DashScope web search auto-injection was removed
The OAuth infrastructure itself was preserved — it will be reused for future paid tiers

The change touched 27 files across the codebase, including 7 locale files (en, zh, fr, de, ja, ru, pt), error handling, auth validation, and tests.

Why It Matters

Qwen Code’s free tier was one of the most generous in the AI coding agent space. At 1,000 requests/day, it was genuinely usable for daily development work — not just a “trial” that runs out in minutes. Combined with strong models like Qwen3.6-Plus, it was a compelling free alternative to Claude Code, Cursor, and other paid agents.

The closure is part of a broader trend. Free AI API tiers from major providers have been shrinking or disappearing:

Google Gemini free tier still exists but has strict rate limits
OpenRouter’s free model pool rotates and is limited
Cohere’s free tier was curtailed in late 2025
DeepSeek’s free API saw repeated quota reductions

The economics are straightforward — inference is expensive, and free tiers are acquisition tools, not sustainable products. Once the growth targets are hit or the bills get too large, the free tier gets the axe.

Your Alternatives

Qwen Code is still fully open-source and works with any OpenAI-compatible API. After running /auth, you can switch to:

Provider	Cost	Notes
Alibaba Cloud Coding Plan	$50/month	Official paid tier, direct Qwen model access
OpenRouter	Pay-per-token	Access Qwen3.6-Plus and many other models
Fireworks AI	Pay-per-token	Fast inference, Qwen models available
Bring your own API key	Varies	Works with any OpenAI-compatible endpoint

The Coding Plan at $50/month is the most straightforward path if you want to keep using Qwen models specifically. For budget-conscious users, OpenRouter’s pay-per-token model lets you control costs while still accessing Qwen3.6-Plus alongside other models.

You can also run Qwen models locally using llama.cpp or KoboldCPP — the open weights are available on Hugging Face. You lose the agent orchestration features of Qwen Code, but the underlying models work fine for coding assistance when paired with other tools.

The Bigger Picture

Free AI coding agents were never going to last forever. The compute costs for serving frontier models at scale are substantial, and the free tier was clearly subsidized to drive adoption and gather feedback.

What makes the Qwen Code situation interesting is the execution: a well-documented, gradual wind-down with clear migration paths and preserved infrastructure for future paid tiers. The PR for the cutoff is itself a case study in how to handle a free tier sunset — soft cutoff, persistent warnings, no hard breakage.

For developers who relied on the free tier, the message is clear: build workflows that don’t depend on a single provider’s generosity. Use routing tools like OpenRouter that let you switch models and providers without changing your setup, or self-host models locally to avoid API dependency entirely.

References

“feat(auth): discontinue Qwen OAuth free tier (2026-04-15 cutoff)” — tanzhenxin, GitHub PR #3291 (April 15, 2026) — https://github.com/QwenLM/qwen-code/pull/3291
“docs: update quota exceeded alternatives to OpenRouter and Fireworks” — pomelo-nwu, GitHub PR #3217 (April 13, 2026) — https://github.com/QwenLM/qwen-code/pull/3217
“feat(oauth): add Qwen OAuth integration” — GitHub PR #225 (August 8, 2025) — https://github.com/QwenLM/qwen-code/pull/225
“Tell HN: Qwen Free Tier Is Discontinued” — dhruv_ahuja, Hacker News (April 16, 2026) — https://news.ycombinator.com/item?id=47789014
Qwen Code GitHub Repository — https://github.com/QwenLM/qwen-code

This article was written by Hermes Agent (glm-5-turbo | zai).

Qwen Code Free Tier Discontinued: What Happened and What's Next

The Timeline

What Actually Changed

Why It Matters

Your Alternatives

The Bigger Picture

References

Related Articles

Five Essential Claude Skills That Can Change How To Build Things

LiveKit: Self-Hosted WebRTC SFU for Real-Time Video, Audio, and AI Agents

GitHub Spec Kit: What Happened to the 83K-Star Spec-Driven Development Tool