Edition 2026-05-15 · read as Investor
Anthropic'sxAIDealSignalsThreeInvestorRepricingEvents
- Sources
- 36
- Words
- 1,802
- Read
- 9min
Topics AI Capital Agentic AI LLM Inference
◆ The signal
Anthropic leased 220K GPUs from Elon Musk's xAI — the company its CEO called 'misanthropic and evil' — because 80x growth against a 10x plan broke its compute plan. In the same week, Anthropic killed the 70-90% subscription arbitrage powering Claude wrappers (effective June 15), and ServiceNow disclosed it blew its full-year Anthropic budget by May with zero SLA recourse. Three repricing events landing simultaneously: infrastructure scarcity is real enough to bend strategy, coding-agent portco margins changed last Friday, and 'enterprise AI ARR' in your book may not be SaaS-quality revenue. Audit all three before month-end.
◆ INTELLIGENCE MAP
01 Compute Scarcity Forces Nuclear Option: xAI Becomes Anthropic's Landlord
act nowAnthropic rented xAI's entire Colossus 1 cluster (220K GPUs incl. GB200s) — ~45% of xAI's capacity — confirming compute shortage severe enough to bend rival strategy. Nebius 684% Q1 growth with capex > operating cash proves neocloud economics are structurally cash-negative. xAI effectively exited the frontier race.
- Anthropic growth vs plan
- Nebius Q1 rev growth
- Nebius capex vs op cash
- GPU customers per unit
02 Cerebras 70% First-Day Pop Validates SPV Era and Sets Permanent Comps
monitorCerebras closed at $311 vs $89 private round (70% IPO pop), yielding Eclipse 17x, Tiger ~$1B paper gain. Benchmark broke its own model with a $225M SPV to defend ownership — early-stage purity is dead in AI. Every late-stage AI infra mark re-rates against this print this week.
- First-day close
- Last private price
- Fully diluted mkt cap
- Benchmark SPV size
03 Enterprise AI Revenue ≠ SaaS Revenue: The Quality Crisis Emerges
act nowServiceNow exhausted its full-year Anthropic budget by May — no SLAs, no per-user telemetry, no enterprise dashboard. Google, OpenAI/Bain, and Salesforce all hiring hundreds of FDEs confirms deployment is the bottleneck. AI observability/FinOps is forming as a Datadog-scale category with no winner yet.
- Budget blown by
- FDE orgs hiring
- Modal valuation
- AI readiness gap
- Enterprise SaaS (typical)95
- Enterprise AI (Anthropic)20
04 Agent Stack Hits Majority: 59% of Tokens and a June 15 Margin Event
monitorVercel production data shows 59% of token volume is agentic. Spend/volume split: Anthropic takes 61% of dollars, Google takes 38% of volume — two different businesses inside 'foundation models.' Anthropic's June 15 credit conversion kills the 70-90% wrapper arbitrage. SAP committed €100M to autonomous enterprise.
- Anthropic share of $
- Google share of volume
- SAP agent fund
- Wrapper margin loss
05 Vertical AI Data Moats: Abridge's $5.3B Prints the Category Template
backgroundAbridge raised $550M at $5.3B servicing 250 health systems with 80M+ annual conversations — a corpus no entrant can replicate. Health systems compressed release cycles from quarterly to monthly. The 'save time → save money → save lives' wedge-and-expand is now the vertical AI underwriting template.
- Health systems live
- Annual conversations
- 2025 capital raised
- Specialties covered
- Wedge: Save TimeClinical docs — won
- Expand: Save MoneyPrior auth — active
- Expand: Save LivesCDS — FDA unlocked Jan'26
◆ DEEP DIVES
01 Anthropic Rents from the Enemy: Compute Scarcity Just Bent Rival Strategy
The Strategic Concession
Anthropic has leased the entire Colossus 1 cluster, 220,000-plus NVIDIA GPUs including GB200s, from Elon Musk's xAI. That is roughly forty-five percent of xAI's compute, rented to the company Musk publicly called "misanthropic and evil." Renting compute to a declared enemy is what a shortage bending strategy looks like. It is not what equilibrium looks like.
Dario Amodei has now conceded that Anthropic planned for 10x growth and got 80x. The quiet Claude Code nerfs and mid-trial Pro revocations were not strategy. They were capacity triage. This is the demand signal every neocloud thesis has been pricing in, arriving with named counterparties attached.
xAI's Retreat Is the Under-Discussed Signal
Renting forty-five percent of your compute to a competitor is, whatever else one calls it, a concession in the frontier race. xAI has no meaningful B2B or B2C traction, trails DeepSeek and Qwen in developer surveys, and its parent just became a landlord. The honest comp set is now neoclouds plus X-distribution, not Anthropic or OpenAI.
Any fund or secondary offering xAI at frontier-lab comps is mis-marked. Grok has no revenue, lost developer mindshare, and its compute is generating more value as rental income than as training runs.
Neocloud Economics: Real Demand, Ugly Unit Economics
Nebius is the public benchmark, or rather the only public benchmark we have. 684% Q1 revenue growth on $399M, guiding to three to three-point-four billion for 2026, with four-plus customers bidding per GPU. The catch sits on the cash statement: $2.47B of capex against $2.26B in operating cash flow. Structurally cash-negative, growing extraordinarily.
This is probably wrong, but: private neocloud rounds at eight to ten times forward revenue should compress to two to three times once the market accepts that capex exceeds operating cash on a permanent basis. The demand is real. The unit economics are not venture-returnable at the current entry multiple unless the exit is a strategic acquisition.
The 'Compute Glut' Narrative Is Dead
Public-market bears citing overcapacity are trading against a private market in which frontier labs rent from sworn enemies. The gap between the two is the arbitrage. The trade is picking the right layer of the infra stack, not blanket exposure.
Action items
- Re-underwrite all xAI/Grok exposure as infrastructure + X-distribution plays, not frontier-model labs — submit revised marks to IC by Friday
- Increase allocation sizing on neocloud/GPU-leasing positions (CoreWeave secondary, Crusoe, Nebius, Lambda) with long-duration frontier-lab contracts
- Compress neocloud private multiples from 8-10x to 2-3x forward revenue in any active term sheet, citing Nebius capex > operating cash as public comp
Sources:Anthropic's 80x growth broke its infra · AI infra capex hits $100B/partnership · Anthropic at $900B + Nebius +684% · AI compute is sold out 4:1
02 Enterprise AI Revenue Is Not SaaS Revenue — And the Market Just Found Out
The ServiceNow Wake-Up Call
ServiceNow, which is as sophisticated an enterprise software buyer as exists on this planet, blew through its full-year Anthropic budget by May 2026. Not because Claude underdelivered. Because Anthropic provides no granular per-user or per-tool usage telemetry, no enterprise dashboards, and no SLAs worth the name. National Life Group's CIO put it without ornament: Anthropic is 'great for consumer usage but not great for companies.'
This is probably wrong, but: the market is being asked to value Anthropic at $900B+ on the premise that enterprise revenue justifies the number, and consumer-grade plumbing does not usually justify enterprise-grade multiples. The revenue is real. The quality of that revenue — or rather, its stickiness, its predictability, and whatever contractual protection sits underneath it — is a different question, and the answer is the one that matters.
The FDE Consensus: Deployment Is the Bottleneck
Four organizations independently concluded the margin lives in deployment services this quarter, which is the kind of coincidence that usually isn't:
- Google Cloud is hiring hundreds of forward-deployed engineers
- OpenAI/Bain stood up DeployCo and acquired a consulting firm for its 150-FDE roster
- Salesforce is staffing the same function
- ServiceNow built AI Control Tower to sell into the problem it discovered internally
When four firms independently conclude the margin is in deployment services rather than the model, the margin is probably in deployment services rather than the model.
The AI Observability Category Is Forming Now
ServiceNow's response is the instructive part: it built AI Control Tower and is now selling it to other customers panicking about their Anthropic bills. A CDIO watched the problem form inside her own P&L and decided to be the vendor rather than the victim, which is the Datadog playbook with the serial numbers barely filed off.
The category ingredients are all present: token-level cost attribution, per-user spend caps, SLA monitoring across model APIs, and budget governance. No independent winner exists yet. The 6-12 month sourcing window is open, and what the incumbents are not doing during that window is shipping the missing telemetry themselves.
What This Means for LLM-Layer Marks
Enterprise AI spend is reversible in ways traditional SaaS is not. Low switching costs, no contractual lock-in (because no SLAs), FOMO-driven procurement, and budget overruns discovered only at billing time produce a cliff-shaped risk profile rather than a curve. Apply a 20-40% 'reversibility discount' to any model-layer ARR multiple where SLAs and telemetry are absent. The thesis could be wrong if Anthropic ships enterprise controls in the next two quarters, or if procurement teams decide they don't care. Neither has happened yet.
Action items
- Demand SLA and usage-telemetry roadmap from every model-layer portco claiming enterprise ARR — submit findings to IC within 2 weeks
- Launch a sourcing sprint on AI observability/FinOps for AI/token-cost-attribution at Seed to Series A — target 5 meetings within 30 days
- Apply a 20-40% reversibility discount to any LLM-layer ARR multiple in your book where SLAs, telemetry, or contractual lock-in are absent
- Build Palantir-alumni sourcing pipeline for deployment-services founders — 3-5 conversations before end of quarter
Sources:Anthropic has an enterprise gap · Anthropic's 80x growth broke its infra · DuckDB goes client-server, 85% agentic-AI readiness gap · a16z has published another map
03 The Agent Economy Hit Majority — And a June 15 Margin Event Is 30 Days Out
59% Is the Base Rate, Not the Upside Case
Vercel published what it calls the first production-grade index of real AI usage across more than 200,000 teams, and the headline number is that agentic workloads now carry 59% of all token volume. That is not a leading indicator. It is the majority case in production, which makes the chat-completion era a minority revenue line and most of the slideware from eighteen months ago retrospectively wrong.
The more interesting cut is the spend/volume bifurcation. Anthropic captures sixty-one percent of dollars on premium reasoning via Opus. Google moves thirty-eight percent of volume on commodity throughput via Flash. Two different businesses are now legible inside what we have been calling 'foundation models,' which validates multi-model routing as the enterprise default and retires the single-model moat thesis.
The June 15 Margin Event
Anthropic converted every Claude subscription into a dollar-matched API credit pool. A two-hundred-dollar plan now buys exactly two hundred dollars of programmatic tokens, and the seventy-to-ninety percent arbitrage that Cline, OpenCode and dozens of similar harnesses were running against subscription pricing is gone. OpenAI answered in hours with two months of free Codex for enterprise switchers.
Combine that with Claude Code's /goal mode absorbing the autonomous coding category and Notion's External Agents API commoditizing the interface, and the result is a two-sided squeeze on anyone whose business model was thin-wrapping a frontier model.
Every Claude-dependent portco that priced COGS off subsidized subscription tokens lost twenty to forty percent of runway last Friday. The change is four days old. Most founders have not flagged it to their boards yet.
Consolidation Accelerating from Above
The platform layer is moving to own the agent surface, and the inventory looks like this:
Platform Move What It Absorbs SAP €100M Autonomous Enterprise fund + NVIDIA/MSFT Agent orchestration startups ServiceNow Action Fabric — headless workflows via MCP Standalone workflow automation Notion Developer Platform + Claude/Codex as teammates Agent-hosting pure plays Airtable $10M Hyperagent credit program Agent-native distribution The pattern is consistent. Incumbents are consolidating the autonomous-enterprise category before pure-play agent startups can define it. The cloud transition ran the other direction. This time ERPs and workflow platforms moved first, which means the obvious horizontal land grab is mostly closed.
Where the Alpha Still Lives
This is probably wrong in one or two of these wedges, but vertical beats horizontal in agent ops from here. The investable surface is agent governance (the gap SAP and Levie keep flagging), agent identity/auth (eighty-one percent bot-detection bypass is a market by itself), evaluation and observability (the Haiku-as-judge pattern is now canonical), and vertical-specific orchestration in regulated industries. The horizontal seat at this table is taken.
Action items
- Stress-test every Claude-dependent portco's gross margin assuming the 70-90% subscription arbitrage is permanently gone — request updated cohort margins by May 30
- Source 3-5 agent infrastructure deals in MCP governance, agent identity, and evaluation tooling at Seed/Series A pricing — before SAP's corp dev team prices you out
- Map pipeline against 'agent-hosting platform' displacement risk — flag any deal that gets absorbed if Notion/Airtable/Cursor ships the feature natively
- Request Vercel AI Gateway production index as recurring data source and use spend/volume split as diligence benchmark for every model-layer pitch
Sources:Vercel's first production AI Gateway index · Anthropic is squeezing its pre-IPO round · SAP wrote a check for one hundred million euros · a16z has published another map · Anthropic shipped Claude Code
04 Vertical AI Moats Are Built from Data, Not Models: The Abridge Template
The Category Winner Proof Point
Abridge raised $550M in 2025 alone (two hundred fifty million early in the year, three hundred million in June at a $5.3B mark), and now services 250 large US health systems with 80M+ projected patient-clinician conversations this year across 28 languages and 50 specialties. The interesting part, or rather the part worth underwriting against, is the data moat that compounds with every patient visit at a rate no foundation model or new entrant can backfill after the fact.
Health systems, which are historically the slowest enterprise buyers on earth, compressed Abridge's release cycles from quarterly or biannual to monthly. That single observation rewrites the velocity assumption in every healthcare SaaS DCF built in the last five years.
The Three-Act Underwriting Framework
Abridge's structure validates a template that probably extends to other regulated verticals, with the usual caveat that templates tend to work until they don't:
- Save Time (documentation) is the wedge that gets the initial sale to the CMIO.
- Save Money (prior auth, billing) is the expansion that captures the CFO budget, which is where the actual dollars live.
- Save Lives (clinical decision support) is long-dated optionality that the FDA's January 2026 CDS guidance may or may not unlock; treat it as a maybe.
Each act monetizes the same proprietary conversation asset against a different buyer persona in the same account. You sell once and expand against data you accumulated for free as a byproduct of the first product, which is the actual structural advantage.
The Non-Compete Posture Is the Key Insight
Abridge positioned itself as a 'clinical intelligence layer' rather than an EHR replacement, explicitly non-competing with Epic and Cerner, which is the single biggest existential risk for any healthcare AI company and the one Abridge declined to take. The EHR remains the system of record and Abridge sits on top of it, which is now table stakes for any vertical AI deal in regulated industries.
What I'd underwrite for in vertical AI deals: a per-use data flywheel competitors cannot access, at least two monetization vectors beyond the wedge, and an explicit non-compete posture with the dominant system-of-record.
Where the Adjacent Alpha Sits
The ambient documentation category is closed. Three adjacent chapters remain open, in roughly descending order of how much I'd pay to be early:
- Payer-side prior authorization, where Abridge attacks from the provider side and the payer counterparty is greenfield.
- Nursing and short-form clinical workflows, where a thirty-second visit is a different product entirely from a fifteen-minute physician encounter.
- Specialty verticals outside US large systems (veterinary, dental, behavioral health, international markets) that Abridge won't prioritize for 24+ months.
Action items
- Kill or de-prioritize any active deals on direct ambient-scribe competitors targeting US large health systems — redirect pipeline to adjacent workflow layers
- Update vertical AI underwriting framework to require per-use data flywheel, ≥2 monetization vectors, and explicit non-compete with dominant system-of-record
- Build thesis memo on payer-side prior-auth automation with target list of 8-12 startups — schedule meetings within 60 days
Sources:Abridge at $5.3B: the healthcare AI vertical just printed a category winner · Anthropic has an enterprise gap
◆ QUICK HITS
Update: Cerebras closed at $311 (70% first-day pop) vs $89 private round — Eclipse netted 17x, Tiger sitting on $1B paper gain; Modal at $4.5B is the cleanest private read-through for re-marking AI infra books this week
Cerebras printed a seventy percent first-day pop
a16z published GTM thesis: value migrating from system-of-record to system-of-intelligence layer; Lemkin's proof point — 80% fewer seats, 83% higher spend ($12K→$22K), 20+ agents running — is the unit-economics template for consumption-based AI pricing
a16z has published another map of where value accrues in GTM software
DepthFirst claims 10x cost advantage over Anthropic's Mythos on vulnerability discovery (12 bugs in FFmpeg for ~$1K vs ~$10K at Mythos) — autonomous vuln discovery is commoditizing before incumbents finish drawing moats
AI-native security is splitting into two businesses
Anthropic's Mythos became first model to clear both UK AISI attack ranges; NSA (not CISA) reportedly winning access — signals offensive/IC-led procurement, not civilian defensive distribution; budget code forming
Anthropic's Mythos cleared AISI this week
Google Gemini Intelligence embeds autonomous agents directly into Android OS for summer 2026 — any mobile AI assistant startup whose GTM assumes non-native distribution needs a contingency plan before WWDC
Gemini becomes the OS: your agent-layer portfolio just got disintermediated
Fervo Energy IPO popped 33% to $10B+ on AI datacenter power demand — but only 115MW of 658MW pipeline has signed hyperscaler offtake; require contracted capacity as Series B gating criterion for energy-for-AI deals
AI infra capex hits $100B/partnership
DuckDB shipped Quack client-server protocol breaking out of embedded-only — direct threat to Spark/Glue-heavy ETL vendors on sub-TB workloads; ask portco CEOs what % of ARR comes from sub-TB customers
DuckDB goes client-server, 85% agentic-AI readiness gap
Thinking Machines Lab justified $2B raise with 0.40s turn-taking latency vs OpenAI's 1.18s — voice/interaction AI is separating into a distinct investable category from text LLMs with different benchmarks and unit economics
Gemini becomes the OS: your agent-layer portfolio just got disintermediated
◆ Bottom line
The take.
Anthropic renting 220K GPUs from a sworn enemy, ServiceNow blowing its annual Claude budget by May with zero SLA recourse, and Cerebras popping 70% on day one are three datapoints that resolve into one portfolio instruction: the demand is real but the revenue quality is not SaaS-grade, the infrastructure layer is the one getting paid, and every coding-agent wrapper in your book just lost its margin structure with 30 days' notice. Audit portco Claude dependencies before June 15, apply a reversibility discount to any AI ARR without contractual lock-in, and concentrate new capital on the infrastructure and observability layers where the actual constraints live.
Frequently asked
- What does Anthropic leasing GPUs from xAI actually signal about compute scarcity?
- It signals that frontier compute is scarce enough to override strategic enmity. Anthropic took roughly 45% of xAI's Colossus 1 cluster — over 220,000 GPUs — from a CEO its own leadership called 'misanthropic and evil,' after planning for 10x growth and getting 80x. Renting from a declared rival is not equilibrium behavior; it validates durable scarcity and reframes xAI as an infrastructure-plus-distribution play rather than a frontier-model competitor.
- How should the June 15 Claude subscription change affect portfolio company underwriting?
- Stress-test every Claude-dependent portco assuming the 70-90% subscription-to-API arbitrage is permanently gone. Anthropic converted subscriptions into dollar-matched API credit pools, so wrappers like Cline and OpenCode that priced COGS off subsidized tokens just lost 20-40% of runway. The change is days old and most founders haven't flagged it to boards yet — request updated cohort margins before month-end.
- Why is enterprise AI ARR not equivalent to traditional SaaS revenue?
- Because it lacks the contractual scaffolding that makes SaaS revenue sticky. ServiceNow blew its full-year Anthropic budget by May with no SLAs, no per-user telemetry, and no enterprise dashboards — and a National Life Group CIO said Anthropic is 'not great for companies.' Low switching costs, FOMO procurement, and budget overruns discovered at billing time produce reversible revenue. Apply a 20-40% discount to model-layer ARR multiples where SLAs and telemetry are absent.
- Where is the investable alpha in the agent economy if horizontal plays are closing?
- In vertical depth and infrastructure layers incumbents haven't absorbed. With SAP's €100M fund, ServiceNow's Action Fabric, and Notion's platform consolidating horizontal agent-ops, the surviving wedges are agent governance, agent identity and auth (81% bot-detection bypass is its own market), evaluation and observability, and vertical-specific orchestration in regulated industries. The Series A window likely closes in 2-3 quarters as corp dev activates.
- What underwriting criteria does the Abridge outcome set for vertical AI deals?
- Three requirements: a per-use data flywheel competitors cannot backfill, at least two monetization vectors beyond the initial wedge, and an explicit non-compete posture with the dominant system-of-record. Abridge cleared all three — $550M raised in 2025, 250 health systems, 80M+ conversations, sitting on top of Epic and Cerner rather than against them. Deals that don't clear this bar face structural risk from the platform layer above.
◆ Same day, different angle
Read this day as…
◆ Recent in investor
Keep reading.
- SpaceX is pricing June 12 at one-point-seven-five trillion, roughly a hundred times revenue, into the worst tape we have seen for a listing…
- SpaceX is quietly collecting $2.17B/month in AI compute rent from Anthropic and Google — a $26B annualized run-rate that isn't in secondary…
- Anthropic edged OpenAI in enterprise billing on Ramp last week, 34.4 percent to 32.3, in the same week ServiceNow admitted it had burned its…
- ServiceNow burned its full-year Anthropic budget by May, with no SLAs, no per-user telemetry, no enterprise dashboard.
- Anthropic's June 15 pricing change closed the seventy-to-ninety percent subscription arbitrage the third-party Claude tools were quietly run…