SemiAnalysis just published a sharp thread of primary research on AI subscription economics. Their team bought one of each Anthropic and OpenAI subscription plan, ran long-horizon coding tasks until the weekly limits were exhausted, and back-calculated what each subscription actually delivers at API-equivalent pricing. The headline numbers — up to $14,000/month of usage on a $200 ChatGPT Pro plan and $8,000/month on Claude Max 20x — are 4× to 7× more generous than the widely-circulated ~$2,000/month rule of thumb. They then modeled subscription gross margin under the assumption that the underlying API carries a 75% margin. The result is the heat map circulating across timelines this week: ChatGPT Pro 20x runs -1,650% margin at full utilization, Claude Max 20x runs -900%, and every tier crosses into negative territory once a single power user pushes past roughly 10–20% utilization. Their conclusion: openly nerfing subscriptions is off the table because public backlash is asymmetric to the savings, so the labs will instead withhold new features and models from subscription plans — with the explicit prediction that Mythos may end up being API-only. We agree with all of this. The experimental approach is clean primary research, the heat map is the right framing of the data, and the policy prediction — vintage withholding rather than usage caps — is the correct structural answer to the subsidy problem. This piece picks up where SemiAnalysis stopped. The same charts are already being read by a second wave of commentary as “AI subscriptions are a bubble” — confusing accounting margin with strategic position. The deeper structural read goes the other direction: the negative margin is not a leak, the subscription is doing three pieces of strategic work simultaneously, and the timeline on which this trade pays off is the timeline of token deflation, not the timeline of quarterly earnings. Three things the bubble framing misses, and SemiAnalysis touches but does not fully unpack:
This is not a bubble. It is the harness training auction — the most important strategic transaction in AI right now, and the one nobody on the surface reading is pricing correctly. The Three-Asset SubsidyThe subscription is not one product. It is three assets bundled at a single price. Asset 1: Harness training signalThe frontier moat has shifted. Model weights are increasingly fungible — Sonnet, GPT, Gemini, and Grok cluster within a few points on benchmarks that mattered last year. What separates them now is the harness — the agentic scaffold around the model: tool use, planning, memory, error recovery, multi-step orchestration, browser control, code execution. This is what Harness Theory predicts: when the model commoditizes, the moat migrates to the scaffolding. And the scaffolding cannot be trained on benchmarks. It can only be trained on real long-horizon workloads — the kind a $200/month power user runs for 14 hours a day, four weeks a month. Asset 2: Power user lock-inNot the casual user. The casual user pays $20 and uses 2% of capacity — they are the cross-subsidy. The asset being bought is the high-utilization user: the developer who has wired their entire workflow around Claude Code or Codex CLI, who has internalized the harness, who pays $200/month and would scream if it disappeared. These users are non-substitutable in three directions: workflow ($200/mo plus a year of muscle memory), API equivalence (their usage maps to $14K/month, which they will not pay), and competitor switching (the harness is sticky in a way the model is not). Asset 3: Token deflation call optionThis is the asset nobody is pricing. Token prices have fa |