Edition 2026-05-17 · read as Investor
AnthropicRents220KGPUsFromxAI,KillsClaudeArbitrage
- Sources
- 36
- Words
- 1,832
- Read
- 9min
Topics Agentic AI AI Capital LLM Inference
◆ The signal
Anthropic leased two hundred and twenty thousand GPUs from xAI's Colossus 1, which is to say from its sworn enemy, in the same week it passed OpenAI on enterprise spend (34.4% to 32.3%, per Ramp) and quietly converted every subscription into dollar-matched API credits. That last move closes the 70-90% arbitrage that funded most Claude-wrapper startups. Two readings: real demand forcing awkward bedfellows, or a tidy pre-IPO margin sweep. Probably both. Either way, every Claude-dependent name in the book has worse unit economics this Monday than last Friday.
◆ INTELLIGENCE MAP
01 Enterprise AI Leadership Flipped — Revenue Quality Didn't Follow
act nowRamp data shows Anthropic at 34.4% vs OpenAI's 32.3% of B2B spend — the first documented lead change. But ServiceNow blew its full-year Claude budget by May because Anthropic ships zero SLAs, zero granular telemetry. Enterprise share is real; enterprise-grade infrastructure is not.
- Anthropic B2B share
- OpenAI B2B share
- OpenAI YoY growth
- Anthropic YoY growth
- Anthropic34.4
- OpenAI32.3
02 Coding Agent Subsidy War Kills Wrapper Economics Overnight
act nowAnthropic converted subscriptions to dollar-matched API credits, eliminating the 70-90% cost arbitrage third-party harnesses exploited. OpenAI countered with 2 months free Codex for enterprise switchers. Claude Code shipped autonomous /goal mode. Standalone coding-agent startups (Cognition, Factory) just lost their pricing moat and their product differentiation in the same week.
- Arbitrage killed
- Codex free period
- Anthropic Oct IPO target
- Credit match date
- Wrapper COGS (before)10
- Wrapper COGS (after)85
03 Agent Economy Reaches 59% Majority — Category Is Investable Now
monitorVercel production data across 200K+ teams shows agentic workloads at 59% of token volume. Anthropic captures 61% of spend (premium reasoning), Google takes 38% of volume (commodity throughput). SAP committed €100M to autonomous enterprise fund. a16z published system-of-intelligence thesis with live Stitch investment. Agent infra is the majority use case, not a speculative thesis.
- Agentic workloads
- Anthropic spend share
- Google volume share
- SAP agent fund
04 AI Security Crosses First KEV Threshold — Category Has Budget Codes
monitorLiteLLM (AI gateway) hit CISA's Known Exploited Vulnerabilities catalog — the first AI-infra component actively exploited in the wild. Separately, Mythos became the first model to clear both UK AISI attack ranges, with NSA (not CISA) winning access. DepthFirst claims 10x cost advantage over Mythos. AI security is no longer a thesis; it has KEV entries, Congressional briefings, and a $40B deepfake TAM anchor.
- AISI ranges cleared
- LiteLLM status
- DepthFirst cost edge
- Cyber task doubling
- LiteLLM hits KEVFirst AI-infra on exploited list
- Mythos clears AISIFirst model to pass both ranges
- NSA gets accessIC procurement, not civilian
- DepthFirst 10xSpecialized harness economics proven
05 Neocloud Unit Economics Revealed — Structurally Cash-Negative
backgroundNebius printed 684% YoY revenue growth but disclosed $2.47B capex against $2.26B operating cash — proving neoclouds are structurally cash-negative even at hypergrowth. The Colossus 1 lease (Anthropic renting from xAI) validates GPU scarcity but also confirms that neocloud multiples should compress from 8-10x to 2-3x forward revenue. The alpha has migrated to Nvidia adjacencies and hyperscaler-contracted energy.
- Nebius Q1 growth
- Capex vs cash
- Cisco AI orders
- Fervo IPO pop
◆ DEEP DIVES
01 Enterprise AI Revenue Is Real — Enterprise AI Infrastructure Is Consumer-Grade
The Flip Is Real. The Foundation Beneath It Is Not.
Ramp's April card-spend data puts Anthropic at 34.4% of US B2B spend versus OpenAI's 32.3%, the first documented lead change in any credible enterprise AI dataset. Over the trailing year Anthropic quadrupled business adoption while OpenAI grew three-tenths of a percent. The 0.3% deserves its own sentence. It is not a slowdown in a flat market. It is a stall while the category grew roughly four times around it.
Anthropic also admitted it planned for 10x growth and got 80x, then leased the entire Colossus 1 cluster (220K+ NVIDIA GPUs) from Elon Musk's xAI, the same Musk who publicly called Anthropic 'misanthropic and evil.' When a competitor rents compute from a sworn enemy, the supply situation is not a glut. It is a shortage that bends strategy, and what Anthropic is not doing with that capital is building the next cluster itself.
The Revenue Quality Problem Nobody Is Discussing
The share number obscures a structural fragility. ServiceNow, one of the most sophisticated enterprise software buyers on earth, blew through its full-year Anthropic budget by May 2026. Not because Claude underdelivered. Because Anthropic provides:
- Zero granular per-user, per-tool usage telemetry
- Zero enterprise SLAs worth the name
- No enterprise dashboard that would embarrass a mid-tier SaaS vendor from 2014
National Life Group's CIO put it without ornament: Anthropic is 'great for consumer usage but not great for companies.'
Enterprise AI ARR is not SaaS ARR. Switching costs are near zero, and the budget overruns are invisible until the invoice arrives.
The Deployment Gap Creates the Alpha
Every major player is now running the Palantir playbook. Google Cloud is hiring hundreds of forward-deployed engineers. OpenAI stood up DeployCo with Bain Capital and acquired a consulting firm for its 150-FDE starting roster. Salesforce and ServiceNow are staffing the same function. The industry has quietly conceded that deployment is the bottleneck, not model capability.
Which means the $30B ARR figure is real but reversible. The quality of that revenue, measured by lock-in, telemetry, and contractual switching costs, is consumer-grade at best. There is a version where this gets cleaned up before a Q4 IPO, which is plausible and boring if it happens. There is a version where the gap persists long enough for an AI observability category to fill it, which is the more interesting one. This is probably wrong, but the second version is the base case. The bull case is not crazy. Read the next ServiceNow invoice before deciding which one you are underwriting.
Action items
- Demand SLA and usage-telemetry roadmap from every portco selling into enterprise AI buyers by end of month
- Open sourcing sprint on AI observability / FinOps-for-AI at Seed-Series A within 30 days
- Apply 20-40% 'reversibility discount' to any LLM-layer ARR multiple where SLAs and telemetry are absent
- Re-underwrite xAI/Grok exposure as infrastructure + X-distribution play, not frontier lab
Sources:The Pragmatic Engineer · Laura Bratton · TLDR AI · StrictlyVC · Morning Brew · Bloomberg Technology
02 The Coding Agent Pricing War Just Reset Every Wrapper's Unit Economics
What Happened
On May 12-13, Anthropic and OpenAI repriced the coding-agent market within hours of each other, which is the sort of coincidence that isn't. Anthropic converted every Claude subscription into a dollar-matched API credit pool, so a two hundred dollar plan now buys two hundred dollars of programmatic tokens, and the seventy to ninety percent arbitrage third-party harnesses had been running quietly disappears. OpenAI answered the same afternoon with two months of free Codex for enterprise switchers inside a thirty-day window.
Context: Ramp's April print showed Anthropic leading OpenAI for the first time, 34.4 against 32.3 percent, a new CFO is in the seat, and the IPO is probably October. The honest read is narrow. Margin recovery dressed as developer defense and timed to pre-IPO diligence.
Who Gets Hurt
Any coding tool running COGS against subscription tokens has lost twenty to forty percent of effective runway since Friday. The arbitrage was unsubtle: buy a two hundred dollar a month subscription, push thousands of API-equivalent calls through the UI, pay seventy to ninety percent below list. That trade is closed.
Meanwhile Claude Code shipped /goal and Auto Mode, which removes the human-in-the-loop steps entirely. The product surface that standalone autonomous agents (Cognition's Devin, Factory, Cosine) were selling against is now a free CLI primitive. The rounds those companies raised were priced on features that just became table stakes.
Player Pre-Change Position Post-Change Position Third-party harnesses (Cline, OpenCode) 70-90% cost advantage via subscription abuse Full API-rate pricing; COGS jumps 3-9x Autonomous agents (Devin, Factory) Product differentiation on autonomy Feature parity with free Claude Code /goal Multi-model routing (Cursor) Provider-agnostic moat Strengthened — security/bounded execution is real moat The Squeeze From Above and Below
This is probably wrong in the timing, but the cleanest framing is a two-sided squeeze. From above, Anthropic meters tokens and OpenAI subsidizes switches. From below, Notion's External Agents API now hosts Claude, Codex, Cursor, Decagon, Warp, and Devin inside the same workspace, which commoditizes the harness layer. Anyone thin-wrapping a frontier model now eats cost inflation and distribution compression at once.
The coding-agent thesis is now a duopoly subsidy fight with a commoditized harness layer beneath. Every portco priced on a Claude subscription arbitrage is worth less today than it was last Friday.
Three positions survive: proprietary workflow data, open-source distribution on Cline's model, or bounded enterprise execution on Cursor's security moat. The counter-thesis is that one of the harnesses quietly accumulates a real workflow datamoat before the subsidy ends, which is possible and not obvious. Anything that doesn't clear one of the three is a pass.
Action items
- Request updated gross margin models from every Claude-dependent coding tool portco by end of week — model the arbitrage as permanently gone
- Re-underwrite any autonomous coding agent deals in pipeline against Claude Code's /goal + Auto Mode baseline
- Accelerate Anthropic pre-IPO secondary positioning before September
- Open diligence on training-efficiency infra (Nous TST 2-3x speedup, Datology 17x less VLM compute) at current pre-consensus valuations
Sources:AINews · Daily Dose of DS · ben's bites · TLDR AI · The Pragmatic Engineer
03 Agent Economy at 59% — The Infrastructure Layer Is Investable Before SAP and ServiceNow Absorb It
The Data Point That Changes the Thesis
Vercel published what is, to my knowledge, the first production-grade AI usage index across 200K+ teams, and the headline is that agentic workloads carry 59% of all token volume. This is not a lab benchmark or a self-reported ARR slide. It is downstream production data, which is rarer than it sounds. The chat-completion era is now a minority revenue line.
The interesting cut is underneath. Anthropic captures 61% of spend, mostly via Opus on long, tool-using agent calls, while Google takes 38% of volume via Flash on cheap, high-throughput commodity work. Two different businesses are now visible inside the thing we have been calling 'foundation models,' and they probably should not trade on the same multiple.
Incumbents Are Moving First This Time
The cloud transition was defined by startups, with the incumbents catching up late and overpaying. The pattern this round looks different. The ERPs and workflow platforms are moving first, which is worth flagging because it changes the resource math for anyone underwriting a greenfield platform play:
- SAP committed €100M to an Autonomous Enterprise partner fund wired into NVIDIA and Microsoft
- ServiceNow shipped Action Fabric, decoupling logic from UI and exposing workflows as headless APIs for agents to consume
- a16z published its system-of-intelligence thesis with a live Stitch investment, arguing the moat is shifting from data to orchestration
The Lemkin case makes it concrete. Seats fell from 10+ humans to 2 humans plus 1 API seat, while spend rose 83% ($12K→$22K) with 20+ agents running on top. Seat count collapsed. The bill went up.
Value in GTM software is migrating from data gravity to orchestration gravity, and the next 12-18 months is the last clean window to own the intelligence layer before incumbents or consensus close it.
Where the Alpha Sits
Three layers, three different capital dynamics, and only one of them is obviously defensible.
Layer Stage Entry Window Agent orchestration (standalone) Fragmented; LangChain, CrewAI Compression risk — M&A exits preferred Agent infra picks-and-shovels (MCP, identity, governance, observability) Greenfield — Seed/A pricing rational Highest alpha; 2-3 quarters before platform absorption Vertical workflow agents (narrow, structured) Category formation Prime entry window — too narrow for incumbents to prioritize Abridge at $5.3B is the vertical template, or rather the most legible version of it: 250 health systems, 80M+ conversations a year, wedging from documentation into prior auth and clinical decision support. The moat is proprietary interaction data per use, not model quality. Health systems compressed release cycles from quarterly to monthly for them, which is faster than regulated buyers usually move.
This is probably wrong, but the timing question splits cleanly. a16z's thesis implies a 12-18 month window. ServiceNow's Action Fabric and SAP's fund suggest incumbents compress that to 6-9 months at the platform layer. The vertical layer holds longer, because incumbents will not prioritize narrow markets until someone embarrasses them in one.
Action items
- Source 3-5 agent infrastructure deals in MCP tooling, agent identity, and agent observability this quarter
- Request Vercel AI Gateway production index as a recurring diligence data source — use spend/volume split as benchmark for every model-layer pitch
- Update vertical AI underwriting framework: require per-use data flywheel, ≥2 monetization vectors, and explicit non-compete posture vs dominant system-of-record
- Run portfolio revalidation on every seat-based SaaS position — model seat value at +30% and -30% scenarios using Lemkin's template
Sources:ben's bites · TLDR AI · a16z · TLDR IT · TLDR · Laura Bratton
04 AI Security Exits Thesis Stage — LiteLLM on KEV and Mythos at NSA Create Two Distinct Funding Opportunities
AI security gets its first KEV entry
LiteLLM, the LLM-routing control plane, landed on CISA's Known Exploited Vulnerabilities catalog this week. That is the first time AI infrastructure has been federally flagged as actively exploited, which matters less for what it says about LiteLLM than for what it does to procurement budgeting. Separately, Anthropic's Mythos became the first model to clear both UK AISI simulated attack ranges, and the House Homeland Security Committee routed access through NSA, not CISA. That last detail is the tell. Offensive and IC-led procurement, not civilian defense.
Stack on Ollama's GGUF model-loader bug at CVSS 9.1, OpenClaw shipping six critical CVEs in a single cycle, and a PraisonAI auth-bypass weaponized within four hours of disclosure. The AI-infra stack is roughly where enterprise SaaS sat in 2014.
Two markets under one label
Sources describe a bifurcation in AI security economics that most investors are still pricing as one category:
Segment Economics Signal Procurement Path Entry Timing AI-native vulnerability discovery DepthFirst: 10x cost advantage over Mythos ($1K vs $10K for same FFmpeg bugs) Enterprise / DevSecOps Series A window open now AI-powered offensive/IC tools Mythos NSA access; $40B deepfake TAM by 2027 IC/DoD — TS/SCI required Dual-use Series A/B AI infrastructure defense LiteLLM on KEV; PraisonAI 4-hour exploit Enterprise CISO budgets Category-defining — pre-consensus EDR incumbents All 5 tested share identical building blocks; AI extracts detection logic in days Legacy renewal cycles Multiples should compress OpenAI's Daybreak launch with 8 incumbent 'partners' (Cloudflare, Cisco, CrowdStrike, PANW, Oracle, Zscaler, Akamai, Fortinet) is the setup pattern of a platform preparing to disintermediate its partners. That move has played out in four prior waves of enterprise software. This is probably wrong, but the partners usually sign anyway, because the alternative is sitting outside the platform while a competitor sits inside it.
Cyber-AI is pricing as an AI feature and transacting as a defense-tech category. The Series B window closes on whichever of those framings the market agrees to first.
The Mozilla-vs-curl test
One quality signal worth keeping. Mozilla found 271 bugs in Firefox using a custom agentic harness on Mythos Preview + Claude Opus 4.6. Mythos alone found 1 real CVE in curl, and the maintainer called it "primarily marketing." Harness design beats model selection, or rather, the more honest version of that claim: any AI AppSec founder who cannot explain why their harness investment produces 271 bugs instead of 1 is selling the model, not the product. Use it as a diligence filter.
Action items
- Run portfolio-wide exposure sweep for LiteLLM, Ollama, Traefik, and Argo CD versions — escalate any internet-exposed instances this week
- Build target list of 5-8 AI-native identity/deepfake defense companies at Series A/B and initiate conversations before the $40B TAM number enters consensus
- Request DepthFirst data room and validate the 10x cost claim against Mythos on 2-3 non-FFmpeg codebases before the round re-rates
- Add 'Daybreak feature absorption' to the competitive tracker for every security portfolio company — require 2-sentence answer on defensibility by next board meeting
Sources:CyberScoop · TLDR InfoSec · Clint Gibler · The Information AM · The Hacker News · SANS AtRisk
◆ QUICK HITS
Update: Cerebras closed day one at $311 vs $89 last private round (70% pop, ~$41.7B market cap) — Eclipse netted 17x, Tiger sitting on $1B paper gain; use as permanent comp for Modal ($4.5B), but note 70% first-day gaps historically retrace before lock-up expiry
Katie Roof
Nebius guided $3.0-3.4B 2026 revenue (~6x YoY) with 4+ customers bidding per GPU — but $2.47B capex exceeds $2.26B operating cash, proving neoclouds are structurally cash-negative; compress private multiples from 8-10x to 2-3x forward revenue
Martin Peers
Abridge raised $550M at $5.3B serving 250 health systems with 80M+ annual conversations — health systems compressed release cycles from quarterly to monthly; the ambient scribe category is closed, but payer-side prior auth and nursing workflows remain investable
Latent.Space
Benchmark raised a $225M SPV to defend Cerebras ownership into Tiger's round — early-stage purity is dead in AI; 93% of Benchmark's cost basis was late-stage adds that lowered the fund multiple but added $300M cash
Katie Roof
Google's Gemini Intelligence embeds autonomous task execution directly into Android OS (97%+ share markets like India, summer 2026 rollout) — stress-test every mobile AI agent portco against 'ships as OS feature' scenario before WWDC
Simplifying AI
CLARITY Act passage probability collapsed from 80% to 55% on Polymarket after Armstrong called the draft 'unworkable' — structure binary hedge across any stablecoin/COIN exposure; vol is underpriced given the dislocation
TLDR Crypto
Only 15% of enterprises have data foundations ready for agentic AI at scale (Fivetran index) despite spending millions — data quality and lineage cited as #1 blocker by nearly half; this is the cleanest picks-and-shovels TAM signal since early Snowflake era
TLDR Data
DuckDB shipped Quack client-server protocol — directly targets single-node workloads that Spark/Glue currently over-serve; ask every sub-TB ETL portco what percentage of ARR is vulnerable to displacement
TLDR Data
◆ Bottom line
The take.
Anthropic overtook OpenAI in enterprise spend this week (34.4% vs 32.3%), then killed the 70-90% cost arbitrage that funded most Claude-wrapper startups while simultaneously leasing 220K GPUs from its sworn enemy because it grew 80x instead of 10x. The three trades to execute this week: re-underwrite every Claude-dependent portco's gross margins (they're worse), open AI observability sourcing (ServiceNow blew its annual budget by May because no one built the monitoring layer), and shift agent-infra capital from standalone orchestration into picks-and-shovels before SAP's €100M fund and Notion's dev platform absorb the category.
Frequently asked
- What does Anthropic's subscription-to-API-credit conversion actually do to Claude wrapper economics?
- It eliminates the 70-90% arbitrage that wrappers were running by pushing API-equivalent traffic through subscription UIs. A $200/month plan now buys exactly $200 of programmatic tokens at list rates, so any coding tool whose COGS was based on subscription abuse just saw effective runway shrink 20-40% and gross margins compress 3-9x overnight.
- Why would Anthropic lease 220K GPUs from xAI, a company whose founder publicly attacks it?
- Because Anthropic planned for 10x growth and got 80x, and Colossus 1 was the only cluster of that size available on the timeline required. The strategic read is that compute scarcity is bending strategy harder than rivalry — and that Anthropic is choosing to rent rather than build the next cluster itself, which is a signal about capital allocation ahead of the IPO.
- How should the 34.4% vs 32.3% Ramp number be weighted given enterprise switching costs?
- It should be weighted as real but reversible. Anthropic provides minimal usage telemetry, weak SLAs, and a dashboard well below 2014-era SaaS standards, and ServiceNow blew through its full-year Anthropic budget by May. Apply a 20-40% reversibility discount to LLM-layer ARR multiples where SLAs and telemetry are absent.
- What's the cleanest way to position around an October Anthropic IPO?
- Treat the May repricing as classic pre-book margin cleanup and move on secondary positioning before September book-building begins. The combination of a new CFO, the credit conversion closing the arbitrage, and OpenAI's defensive Codex giveaway all point to a deliberate margin sweep being staged for diligence.
- Where does the alpha sit if incumbents are moving first in the agent economy?
- In agent infrastructure picks-and-shovels — MCP tooling, agent identity, governance, and observability — where Seed/Series A pricing is still rational and SAP's €100M fund hasn't yet activated corp dev consensus. Vertical workflow agents with proprietary per-use interaction data, on the Abridge template, are the second viable lane; standalone orchestration is compression risk.
◆ Same day, different angle
Read this day as…
◆ Recent in investor
Keep reading.
- SpaceX is pricing June 12 at one-point-seven-five trillion, roughly a hundred times revenue, into the worst tape we have seen for a listing…
- SpaceX is quietly collecting $2.17B/month in AI compute rent from Anthropic and Google — a $26B annualized run-rate that isn't in secondary…
- Anthropic edged OpenAI in enterprise billing on Ramp last week, 34.4 percent to 32.3, in the same week ServiceNow admitted it had burned its…
- ServiceNow burned its full-year Anthropic budget by May, with no SLAs, no per-user telemetry, no enterprise dashboard.
- Anthropic's June 15 pricing change closed the seventy-to-ninety percent subscription arbitrage the third-party Claude tools were quietly run…