Edition 2026-05-30 · read as Investor
AnthropicRents220KGPUsFromxAIasCerebrasPops70%
- Sources
- 36
- Words
- 1,953
- Read
- 10min
Topics Agentic AI AI Capital LLM Inference
◆ The signal
Anthropic leased 220,000 GPUs from Elon Musk's xAI, a sworn enemy, after eighty-times growth broke its infrastructure plan. Cerebras opened up seventy percent on day one at a forty-one-point-seven-billion-dollar market cap, which is either a useful read on scarcity pricing or the same story told twice. The more interesting reading is that xAI just became a landlord, and any book carrying it as a frontier model lab is now carrying something else.
◆ INTELLIGENCE MAP
01 Cerebras Day-1 Print Opens the AI Infra Exit Window
act nowCerebras closed at $311 vs. $89 last private round — a 70% pop yielding $41.7B market cap. Eclipse netted 17x. Tiger sits on $1B paper gain in months. Benchmark broke its own dogma with a $225M SPV. Fervo popped 33% to $10B+ the same week. Every late-stage AI infra mark in the book now has a live public comp.
- Market cap at close
- Eclipse return
- Tiger paper gain
- Fervo debut pop
- Modal new mark
02 Anthropic's Capacity Triage: Leasing From Enemies, Killing Arbitrage
act nowAnthropic leased Colossus 1 (220K GPUs, ~45% of xAI capacity) from Musk — who publicly called them 'evil' — because 80x growth broke their 10x plan. Same week, they converted subscriptions to dollar-matched API credits, killing the 70-90% arbitrage third-party harnesses relied on. October IPO is the forcing function. ServiceNow blew its annual Claude budget by May.
- GPUs leased from xAI
- Growth vs plan
- Harness arbitrage killed
- ServiceNow budget
- Likely IPO
- Planned growth10
- Actual growth80
03 Agent Economy Hits 59% — Platform Wars Begin
monitorVercel's first production AI Gateway index shows agentic workloads at 59% of token volume. Anthropic captures 61% of spend (premium reasoning), Google takes 38% of volume (commodity throughput). SAP deployed €100M into 'Autonomous Enterprise,' ServiceNow shipped headless Action Fabric, and Apple is gating agents through App Store governance. The platform layer is consolidating before pure-plays can define it.
- Anthropic share of $
- Google share of vol
- SAP agent fund
- Agentic workloads
04 AI Security Crosses from Narrative to Federal Procurement
monitorLiteLLM landed on CISA's KEV catalog — the first AI infrastructure component federally flagged as actively exploited. Mythos cleared both AISI attack ranges (first model to do so), with NSA winning access over CISA. DepthFirst claims 10x cost advantage over Mythos on vuln discovery. OpenAI launched Daybreak with 8 incumbent 'partners.' AI security is no longer a feature — it's a budget line.
- LiteLLM status
- Mythos vs Daybreak
- DepthFirst cost edge
- PraisonAI exploit time
- LiteLLM on KEVFirst AI infra on federal exploit catalog
- Mythos clears AISIFirst model to clear both attack ranges
- NSA wins accessIC-led procurement, not civilian
- Daybreak launchesOpenAI enters with 8 incumbent partners
05 Vertical AI Data Moats Validated at $5B+ Scale
backgroundAbridge raised $550M at $5.3B serving 250 health systems with 80M+ annual conversations — a corpus no new entrant can replicate. Health systems compressed release cycles from quarterly to monthly. The wedge-and-expand playbook (documentation → prior auth → clinical decision support) is now the vertical AI investment template. Direct ambient-scribe competitors are uninvestable; adjacent workflow layers remain open.
- Health systems live
- Annual conversations
- 2025 raise total
- Languages supported
- 01Abridge (healthcare)5.3
- 02Modal (inference)4.5
- 03Recursive SI4
- 04Mind Robotics3.4
◆ DEEP DIVES
01 Cerebras Day-1 Reprices the AI Infra Book — And Reveals the SPV Era's Economics
What Happened
Cerebras closed its first trading day at $311 per share against the eighty-nine dollars Tiger Global paid a few months earlier, a seventy percent pop that values the company at roughly $41.7 billion. It is the first material AI-hardware IPO since 2021, and the more interesting story is not the headline mark but the return distribution across the cap table.
Investor Entry Cost Basis Day-1 Outcome Eclipse Capital 2016 Series A + SPVs $146.5M 6% stake worth $2.5B (17x) Tiger Global Sep 2025 at $89 ~$1B ~$1B paper gain (3.5x in months) Benchmark 2016 + $225M SPV $269M (93% late-stage) ~$300M cash, lower multiple The Benchmark datapoint is the one to sit with. The firm most ideologically committed to small, early-only funds broke its own model to raise a $225M SPV. The late-stage add was ninety-three percent of their total cost basis, put roughly $300M of cash to work, and lowered the fund's MOIC. They did it anyway. If Benchmark is running SPVs, the LP conversation about ownership defense has changed permanently.
Why This Matters for Your Book
The de-risking event that actually opened this window was OpenAI's $20B procurement commitment in December 2025. Cerebras had pulled its earlier filing because public-market buyers would not stomach G42 (UAE) customer concentration. The fix was swapping a geopolitically taxed customer for the most credible AI buyer alive. That is the deal.
The gating criterion for any AI-hardware company going public is now a signed hyperscaler or frontier-lab procurement anchor. Technical merit without it is a down-round risk regardless of benchmarks.
Fervo Energy's 33% debut pop to $10B+ on AI data center power demand validates the adjacent category. Together these prints compress the timeline on every late-stage AI infra exit model by 6-12 months.
Cross-Source Tension
Multiple sources flag a contradiction worth pricing. The seventy percent pop is either genuine demand for Nvidia alternatives, or, the more honest version, a deliberately under-allocated book met by a retail bid that turned up because the ticker had been sitting in headlines for two years. The first reading implies the next two or three AI hardware issuers clear at comparable marks. The second implies Cerebras went first and the third issuer gets honest pricing. The working view, probably wrong but worth stating: two more names clear on these terms before selectivity returns.
Customer concentration did not go away. It got better-branded. A single $20B customer is still a single customer, even when its name is OpenAI. Any softening in OpenAI's compute trajectory hits the equity story directly. The seventy percent pop prices none of that.
Action items
- Re-mark all AI infrastructure portfolio comps using Cerebras $41.7B and Fervo $10B+ as public anchors; brief LPs on NAV implications before quarter-end
- Stand up an SPV operating playbook for top 3-5 portfolio winners likely to raise $500M+ rounds in next 12 months
- Pressure-test every AI hardware portco on hyperscaler/foundation-model anchor customer status before the next board cycle
Sources:StrictlyVC · Katie Roof · Martin Peers · The Information AM · Bloomberg Technology
02 Anthropic's Capacity Triage — What Renting From Your Enemy Reveals About the Cycle
The Three Moves That Tell One Story
In one week Anthropic did three things that, taken together, are the same thing said three ways. They leased the entire Colossus 1 cluster — 220K+ GPUs including GB200s — from Elon Musk's xAI, who has on the record called them misanthropic and evil. They collapsed every Claude subscription into a dollar-matched API credit pool, ending the 70-90% arbitrage third-party harnesses had been running on subscription tiers. They hired a CFO with an October IPO on the calendar.
This is not three strategies. It is capacity triage, margin recovery, and IPO preparation in parallel because 80x growth against a 10x plan does not leave time to sequence politely.
What the xAI Lease Tells You
The cleanest tell in compute markets is when rivals rent capacity to declared enemies. That does not happen in a glut.
- xAI is exiting the frontier race. Leasing roughly 45% of current capacity to a competitor means Grok cannot train at the frontier this cycle. No meaningful B2B or B2C revenue, developer mindshare gone to open-source DeepSeek and Qwen, and the parent just became a landlord. Repriced, this is a neocloud plus X-distribution play, not a frontier lab.
- Neocloud pricing power is real. The public-market bears calling an AI capex glut are trading against a private-market reality where Nebius printed 684% YoY Q1 growth with four-plus customers bidding per GPU.
- ServiceNow blowing its annual Claude budget by May is the enterprise version. No SLAs, no per-user telemetry, buyers discovering consumption at invoice time.
When four firms independently decide the margin is in the deployment services rather than the model — Google, OpenAI/Bain, Salesforce, ServiceNow — the margin is probably in the deployment services rather than the model.
The Pricing Change That Kills Wrapper Economics
The subscription-to-API conversion is surgical: a 200-dollar Claude plan now buys exactly 200 dollars of programmatic tokens, and the harnesses sitting on top — Cline, OpenCode, OpenClaw — lost 20-40% of effective runway since Friday. OpenAI's counter of two months of free Codex for enterprise switchers says the rest. Duopoly subsidy war, timed to Anthropic's IPO and OpenAI's enterprise defense.
Any portfolio company whose COGS quietly assumed subsidized third-party Claude access needs an updated gross-margin model this week, not next quarter. The change is four days old. Most founders have not flagged it.
The AI Observability Category Forms in Real Time
This is probably wrong, but the obvious read on ServiceNow's AI Control Tower and the external Claude-monitoring tools emerging via API is a Datadog-scale opportunity in AI FinOps — token-level cost attribution, per-user spend caps, SLA monitoring across model APIs — with a seed-to-Series A window of six to twelve months before an incumbent locks the category. The counter-thesis is that ServiceNow absorbs it. Worth pricing both.
Action items
- Re-underwrite any xAI/Grok exposure (direct secondary, funds with xAI stakes, Grok-dependent apps) as infrastructure + X-distribution plays, not frontier-model plays
- Stress-test every Claude-dependent portco's unit economics assuming 70-90% subscription arbitrage is permanently gone; request updated cohort gross margin by month-end
- Launch a sourcing sprint on AI observability/FinOps at Seed-Series A: token-cost attribution, per-user spend caps, model-API SLA monitoring
- Increase allocation sizing for neocloud / GPU-leasing opportunities (CoreWeave secondary, Crusoe, Nebius, Lambda)
Sources:The Pragmatic Engineer · AINews · Laura Bratton · Martin Peers · TLDR AI
03 59% Agentic — Where Value Accrues Now That Agents Are the Majority Case
The Production Data
Vercel published its first production AI Gateway index this week, which is to say not a benchmark and not a self-reported ARR slide but downstream production data across 200K+ teams, and the findings quietly reframe the stack:
- 59% of token volume is agentic — the chat-completion era is already the minority revenue line
- Anthropic captures 61% of spend on Opus as premium reasoning
- Google captures 38% of volume on Flash as commodity throughput
There are two different businesses inside what we have been calling foundation models, or rather, the more interesting version: Anthropic gets paid a premium for long, tool-using, expensive agent calls, and Google is absorbing cheap high-throughput work that looks excellent in a volume chart and considerably less excellent in a gross margin table.
Platform Consolidation Is Moving Faster Than Startup Timelines
Three incumbent moves landed in the same quarter, which is the sort of clustering that usually means the corp dev calendars are talking to each other. SAP put €100M into an Autonomous Enterprise fund wiring NVIDIA and Microsoft into the platform layer. ServiceNow shipped Action Fabric, decoupling logic from UI and exposing workflows as headless APIs for agents. Apple is building agent governance into the App Store, explicitly blocking agents from spinning up unapproved sub-apps.
The cloud transition was the opposite shape: AWS defined infra, ERPs caught up late, and the gap was the trade. This time the ERPs went first.
Layer Who's Winning Deal Flow Signal Platform (SAP, ServiceNow) Incumbents Corp dev activating; €100M fund is first data point Interop / Infra (MCP, identity, governance) Undefined — greenfield Highest alpha window Agent Orchestration (standalone) Fragmented M&A exits preferred over continued rounds a16z's GTM Thesis: Orchestration Gravity Replaces Data Gravity
a16z planted a flag this week behind the system-of-intelligence layer, and the Stitch check is the position. Lemkin's SaaStr anecdote is the proof point worth chewing on: 10+ human seats cut to 2 humans plus 1 API seat, with spend rising 83% from $12K to $22K and 20+ agents running on top. Seat count collapsed. The bill went up.
This is probably wrong, but the thesis is that the moat has migrated from data living in Salesforce to workflows, reasoning state, and institutional context living in the AI layer, and whether that migration is half-finished or already done is the entire question. The investable window is 12-18 months before incumbents or consensus closes it.
The alpha in agent infra is layer selection, not sector exposure. Runtime SDKs, agent identity, governance, and observability at Series A pricing — before SAP's corp dev team does the sourcing for you.
What Gets Compressed
Notion now hosts Claude and Codex as hosted teammates inside its developer platform, and Airtable is writing $10M of Hyperagent credits, which means standalone agent-orchestration startups face a distribution problem they cannot outspend. The counter-thesis is that workflow incumbents fumble execution, which is possible and has happened before, but it is not the way to bet against SAP and ServiceNow's current shipping cadence. Vertical beats horizontal from here.
Action items
- Source 3-5 agent infrastructure deals in MCP tooling, agent identity, and agent observability before SAP's corp dev does the same work at Series B pricing
- Map pipeline against 'agent-hosting platform' risk — identify which deals get displaced if Notion/Airtable/Cursor absorb the workflow
- Stress-test every mobile AI agent position against Apple's App Store governance — if Gemini Intelligence ships the feature natively in summer 2026, does the business survive?
- Request Vercel AI Gateway production index as a recurring data source; use spend/volume split as a diligence benchmark for model-layer pitches
Sources:ben's bites · a16z · TLDR IT · TLDR · Simplifying AI · TLDR AI
04 AI Security Graduates to Federal Procurement — The Series A Window Is Now
Three Catalysts in One Week
LiteLLM hitting CISA's KEV is the entry that matters most this week, and the other two items below explain why it is not isolated.
- LiteLLM landed on CISA's KEV catalog as the first LLM-routing control plane federally flagged as actively exploited, which gives a CISO the first defensible line item for AI-gateway security inside an existing budget cycle rather than a future one.
- Anthropic's Mythos cleared both UK AISI simulated attack ranges, the first model to do so, with Congress routing access through NSA rather than CISA. That is IC-led offensive procurement, not civilian defense, and the agency distinction is the entire signal.
- DepthFirst's Open Defense Initiative claimed 10x cost efficiency over Mythos on vulnerability discovery: 12 memory corruption bugs in FFmpeg for roughly $1,000 against Anthropic's roughly $10,000 for the same scan.
Add OpenAI's Daybreak launching with Cloudflare, Cisco, CrowdStrike, PANW, Oracle, Zscaler, Akamai, and Fortinet listed as 'partners,' and the shape is familiar: a platform preparing to disintermediate its partners.
The Category Split
Segment Stage Leader/Signal Investor Action AI Gateway Security Category forming (KEV validation) LiteLLM exploit, Ollama CVSS 9.1 Pull forward Series A diligence Autonomous Vuln Discovery Emerging — 10x economics proven DepthFirst vs Mythos Validate benchmark on 2-3 codebases AI-Native Identity Defense Pre-consensus $40B projected 2027 deepfake losses Build target list of 5-8 companies Agentic SOC/GRC Saturating Exaforce, Drata, Teleport in same week Raise bar — demand proprietary data moats EDR Incumbents Moat eroding AI extracts detection logic in days De-rate forward multiples The EDR Moat Is Cracking
TrustedSec ran LLMs at five commercial EDRs and found they share identical building blocks: YARA rules, Lua engines, local ML classifiers. Reverse engineering that took weeks now takes days, which is a sentence written carefully. The detection IP that justified CRWD, PANW, and S multiples is commoditizing, and the Daybreak platform layer is queueing up to collect the value that leaves.
Mozilla validated the harness thesis in the same week: 271 bugs found in Firefox using a custom agentic harness on Claude, versus Mythos finding 1 real CVE in curl with a generic scan. The moat is in orchestration and harness IP, not in model access. Or rather, the more interesting version of the moat is.
AI-security is pricing as an AI feature and transacting as a defense-tech category. The Series B window closes on whichever framing the market agrees to first.
Counterpoint
The bear case deserves airtime and probably more than a paragraph: procurement cycles are slower than capability curves, incumbents absorbed the AI layer in endpoint detection the last time around, NSA briefings are not contracts, and DepthFirst's cost claim is self-reported against one codebase. This view is probably wrong on timing, but the honest version of the call is to fund the category and size to 9-12 month enterprise sales cycles rather than the 3-month POCs being pitched.
Action items
- Pull forward AI-security deal diligence by one quarter — specifically AI-gateway firewalls, model-artifact scanning, and LLM-runtime sandboxing at Series A
- Request DepthFirst data room and validate the 10x cost claim against Mythos with independent benchmarks on 2-3 additional codebases
- Commission 2-week portfolio-wide review of any security portco whose value prop assumes human-speed attacker cadence; flag repricing candidates
- Build target list of 5-8 AI-native identity/deepfake defense companies at Series A/B before the $40B 2027 TAM number enters consensus
Sources:CyberScoop · TLDR InfoSec · The Hacker News · SANS AtRisk · Clint Gibler · The Information AM
◆ QUICK HITS
Update: Cerebras closed at $311 (70% pop) — Eclipse netted 17x, Tiger holds $1B paper gain in months; lock-up expires in ~180 days, mark accordingly
StrictlyVC
Nebius printed 684% Q1 YoY growth ($399M revenue) with $2.47B capex against $2.26B operating cash — neocloud unit economics are structurally cash-negative even at hypergrowth
The Information AM
Anthropic's June 15 third-party credit unbundling means every Claude-wrapper portco's COGS model broke last Friday — demand updated gross margins before month-end
ben's bites
Abridge raised $550M at $5.3B with 250 health systems and 80M annual conversations — the ambient clinical documentation category is closed; adjacent workflow layers remain open
Latent.Space
Fivetran index: only 15% of enterprises have data foundations for agentic AI despite spending millions — data quality and lineage cited as #1 blocker by nearly half
TLDR Data
a16z's Lemkin proof point: 2 human seats + 20 agents, spend up 83% ($12K→$22K) — consumption pricing replacing per-seat is now measurable at named accounts
a16z
Thinking Machines Lab justified its $2B raise with 0.40s turn-taking latency vs. GPT-Realtime 1.18s — voice AI is a separate investable category from text LLMs
Simplifying AI
Clarity Act odds collapsed from 80% to 55% on Polymarket after Armstrong called the draft 'unworkable' — hedge stablecoin-exposed positions into the vote
TLDR Crypto
◆ Bottom line
The take.
Anthropic rented 220,000 GPUs from Elon Musk because 80x growth broke its infrastructure — while Cerebras popped 70% to $41.7B on day one and Vercel data shows agents are now 59% of production token volume. The AI trade has forked into three speeds: the model layer is priced to perfection and killing third-party arbitrage ahead of an October IPO, the infrastructure layer just printed its first real exit comp and scarcity is confirmed by enemies renting to enemies, and the agent stack is consolidating under incumbents before startups can define it. The Series A window in AI observability, agent governance, and AI-native security is measured in quarters, not years.
Frequently asked
- What does Anthropic leasing 220K GPUs from xAI signal about the compute cycle?
- It signals that GPU scarcity is real and durable, not the glut public markets have been pricing. Rivals only rent capacity to declared enemies when they have no alternative — Anthropic's 80x growth broke its plan, and xAI had spare Colossus 1 capacity because Grok cannot compete at the frontier. The read-through: neocloud pricing power (Nebius +684% YoY) is structural, and any 'AI capex glut' short is fighting private-market reality.
- How should I reprice xAI exposure after the lease?
- Re-underwrite xAI as a neocloud-plus-X-distribution business, not a frontier model lab. Leasing roughly 45% of current capacity to a competitor is a concession that Grok cannot train at the frontier this cycle, with no meaningful revenue and developer mindshare lost to DeepSeek and Qwen. The correct comp set is CoreWeave, Crusoe, and Nebius — not Anthropic or OpenAI — and any fund marking xAI as a model lab is carrying the wrong asset.
- What's the immediate portfolio impact of Anthropic killing the Claude subscription arbitrage?
- Any portco whose COGS quietly assumed subsidized third-party Claude access via Cline, OpenCode, or similar harnesses has lost 20-40% of effective runway as of last Friday. The new dollar-matched API credit pool ends the 70-90% arbitrage these wrappers were running. Request updated cohort gross margins by month-end — the change is four days old and most founders have not recalculated.
- Does the Cerebras pop mean the next AI hardware IPOs clear at similar marks?
- Probably for the next two issuers, then selectivity returns. The 70% pop is partly real demand for Nvidia alternatives and partly a deliberately under-allocated book meeting retail bid on a two-year-old story. More importantly, the gating criterion is now a signed hyperscaler or frontier-lab anchor — OpenAI's $20B commitment is what actually unlocked the window, and technical merit without a similar anchor carries down-round risk.
- Where is the alpha in the agent stack if 59% of tokens are already agentic?
- In interop and infrastructure layers — MCP tooling, agent identity, runtime governance, and observability — at Series A pricing before SAP and ServiceNow corp dev teams source the same names at Series B. Platform incumbents went first this cycle (unlike the cloud transition), so standalone orchestration startups face a distribution problem they cannot outspend. Vertical beats horizontal, and the window is 6-12 months.
◆ Same day, different angle
Read this day as…
◆ Recent in investor
Keep reading.
- SpaceX is pricing June 12 at one-point-seven-five trillion, roughly a hundred times revenue, into the worst tape we have seen for a listing…
- SpaceX is quietly collecting $2.17B/month in AI compute rent from Anthropic and Google — a $26B annualized run-rate that isn't in secondary…
- Anthropic edged OpenAI in enterprise billing on Ramp last week, 34.4 percent to 32.3, in the same week ServiceNow admitted it had burned its…
- ServiceNow burned its full-year Anthropic budget by May, with no SLAs, no per-user telemetry, no enterprise dashboard.
- Anthropic's June 15 pricing change closed the seventy-to-ninety percent subscription arbitrage the third-party Claude tools were quietly run…