Investor daily

Edition 2026-05-15 · read as Investor

Anthropic'sxAIDealSignalsThreeInvestorRepricingEvents

Sources
36
Words
1,802
Read
9min

Topics AI Capital Agentic AI LLM Inference

◆ The signal

Anthropic leased 220K GPUs from Elon Musk's xAI — the company its CEO called 'misanthropic and evil' — because 80x growth against a 10x plan broke its compute plan. In the same week, Anthropic killed the 70-90% subscription arbitrage powering Claude wrappers (effective June 15), and ServiceNow disclosed it blew its full-year Anthropic budget by May with zero SLA recourse. Three repricing events landing simultaneously: infrastructure scarcity is real enough to bend strategy, coding-agent portco margins changed last Friday, and 'enterprise AI ARR' in your book may not be SaaS-quality revenue. Audit all three before month-end.

◆ INTELLIGENCE MAP

  1. 01

    Compute Scarcity Forces Nuclear Option: xAI Becomes Anthropic's Landlord

    act now

    Anthropic rented xAI's entire Colossus 1 cluster (220K GPUs incl. GB200s) — ~45% of xAI's capacity — confirming compute shortage severe enough to bend rival strategy. Nebius 684% Q1 growth with capex > operating cash proves neocloud economics are structurally cash-negative. xAI effectively exited the frontier race.

    220K
    GPUs leased from rival
    6
    sources
    • Anthropic growth vs plan
    • Nebius Q1 rev growth
    • Nebius capex vs op cash
    • GPU customers per unit
    1. Anthropic growth80
    2. Planned growth10
    3. Nebius rev growth684
    4. GPU demand ratio4
  2. 02

    Cerebras 70% First-Day Pop Validates SPV Era and Sets Permanent Comps

    monitor

    Cerebras closed at $311 vs $89 private round (70% IPO pop), yielding Eclipse 17x, Tiger ~$1B paper gain. Benchmark broke its own model with a $225M SPV to defend ownership — early-stage purity is dead in AI. Every late-stage AI infra mark re-rates against this print this week.

    17x
    Eclipse return on Cerebras
    4
    sources
    • First-day close
    • Last private price
    • Fully diluted mkt cap
    • Benchmark SPV size
    1. Eclipse (2016 entry)17
    2. Tiger (Sep'25 entry)3.5
    3. Benchmark (mixed)1.1
  3. 03

    Enterprise AI Revenue ≠ SaaS Revenue: The Quality Crisis Emerges

    act now

    ServiceNow exhausted its full-year Anthropic budget by May — no SLAs, no per-user telemetry, no enterprise dashboard. Google, OpenAI/Bain, and Salesforce all hiring hundreds of FDEs confirms deployment is the bottleneck. AI observability/FinOps is forming as a Datadog-scale category with no winner yet.

    5
    sources
    • Budget blown by
    • FDE orgs hiring
    • Modal valuation
    • AI readiness gap
    1. Enterprise SaaS (typical)95
    2. Enterprise AI (Anthropic)20
  4. 04

    Agent Stack Hits Majority: 59% of Tokens and a June 15 Margin Event

    monitor

    Vercel production data shows 59% of token volume is agentic. Spend/volume split: Anthropic takes 61% of dollars, Google takes 38% of volume — two different businesses inside 'foundation models.' Anthropic's June 15 credit conversion kills the 70-90% wrapper arbitrage. SAP committed €100M to autonomous enterprise.

    59%
    tokens now agentic
    6
    sources
    • Anthropic share of $
    • Google share of volume
    • SAP agent fund
    • Wrapper margin loss
    1. Agentic workloads59
    2. Chat/completion41
  5. 05

    Vertical AI Data Moats: Abridge's $5.3B Prints the Category Template

    background

    Abridge raised $550M at $5.3B servicing 250 health systems with 80M+ annual conversations — a corpus no entrant can replicate. Health systems compressed release cycles from quarterly to monthly. The 'save time → save money → save lives' wedge-and-expand is now the vertical AI underwriting template.

    $5.3B
    Abridge valuation
    2
    sources
    • Health systems live
    • Annual conversations
    • 2025 capital raised
    • Specialties covered
    1. Wedge: Save TimeClinical docs — won
    2. Expand: Save MoneyPrior auth — active
    3. Expand: Save LivesCDS — FDA unlocked Jan'26

◆ DEEP DIVES

  1. 01

    Anthropic Rents from the Enemy: Compute Scarcity Just Bent Rival Strategy

    The Strategic Concession

    Anthropic has leased the entire Colossus 1 cluster, 220,000-plus NVIDIA GPUs including GB200s, from Elon Musk's xAI. That is roughly forty-five percent of xAI's compute, rented to the company Musk publicly called "misanthropic and evil." Renting compute to a declared enemy is what a shortage bending strategy looks like. It is not what equilibrium looks like.

    Dario Amodei has now conceded that Anthropic planned for 10x growth and got 80x. The quiet Claude Code nerfs and mid-trial Pro revocations were not strategy. They were capacity triage. This is the demand signal every neocloud thesis has been pricing in, arriving with named counterparties attached.


    xAI's Retreat Is the Under-Discussed Signal

    Renting forty-five percent of your compute to a competitor is, whatever else one calls it, a concession in the frontier race. xAI has no meaningful B2B or B2C traction, trails DeepSeek and Qwen in developer surveys, and its parent just became a landlord. The honest comp set is now neoclouds plus X-distribution, not Anthropic or OpenAI.

    Any fund or secondary offering xAI at frontier-lab comps is mis-marked. Grok has no revenue, lost developer mindshare, and its compute is generating more value as rental income than as training runs.

    Neocloud Economics: Real Demand, Ugly Unit Economics

    Nebius is the public benchmark, or rather the only public benchmark we have. 684% Q1 revenue growth on $399M, guiding to three to three-point-four billion for 2026, with four-plus customers bidding per GPU. The catch sits on the cash statement: $2.47B of capex against $2.26B in operating cash flow. Structurally cash-negative, growing extraordinarily.

    This is probably wrong, but: private neocloud rounds at eight to ten times forward revenue should compress to two to three times once the market accepts that capex exceeds operating cash on a permanent basis. The demand is real. The unit economics are not venture-returnable at the current entry multiple unless the exit is a strategic acquisition.

    The 'Compute Glut' Narrative Is Dead

    Public-market bears citing overcapacity are trading against a private market in which frontier labs rent from sworn enemies. The gap between the two is the arbitrage. The trade is picking the right layer of the infra stack, not blanket exposure.

    Action items

    • Re-underwrite all xAI/Grok exposure as infrastructure + X-distribution plays, not frontier-model labs — submit revised marks to IC by Friday
    • Increase allocation sizing on neocloud/GPU-leasing positions (CoreWeave secondary, Crusoe, Nebius, Lambda) with long-duration frontier-lab contracts
    • Compress neocloud private multiples from 8-10x to 2-3x forward revenue in any active term sheet, citing Nebius capex > operating cash as public comp

    Sources:Anthropic's 80x growth broke its infra · AI infra capex hits $100B/partnership · Anthropic at $900B + Nebius +684% · AI compute is sold out 4:1

  2. 02

    Enterprise AI Revenue Is Not SaaS Revenue — And the Market Just Found Out

    The ServiceNow Wake-Up Call

    ServiceNow, which is as sophisticated an enterprise software buyer as exists on this planet, blew through its full-year Anthropic budget by May 2026. Not because Claude underdelivered. Because Anthropic provides no granular per-user or per-tool usage telemetry, no enterprise dashboards, and no SLAs worth the name. National Life Group's CIO put it without ornament: Anthropic is 'great for consumer usage but not great for companies.'

    This is probably wrong, but: the market is being asked to value Anthropic at $900B+ on the premise that enterprise revenue justifies the number, and consumer-grade plumbing does not usually justify enterprise-grade multiples. The revenue is real. The quality of that revenue — or rather, its stickiness, its predictability, and whatever contractual protection sits underneath it — is a different question, and the answer is the one that matters.


    The FDE Consensus: Deployment Is the Bottleneck

    Four organizations independently concluded the margin lives in deployment services this quarter, which is the kind of coincidence that usually isn't:

    • Google Cloud is hiring hundreds of forward-deployed engineers
    • OpenAI/Bain stood up DeployCo and acquired a consulting firm for its 150-FDE roster
    • Salesforce is staffing the same function
    • ServiceNow built AI Control Tower to sell into the problem it discovered internally
    When four firms independently conclude the margin is in deployment services rather than the model, the margin is probably in deployment services rather than the model.

    The AI Observability Category Is Forming Now

    ServiceNow's response is the instructive part: it built AI Control Tower and is now selling it to other customers panicking about their Anthropic bills. A CDIO watched the problem form inside her own P&L and decided to be the vendor rather than the victim, which is the Datadog playbook with the serial numbers barely filed off.

    The category ingredients are all present: token-level cost attribution, per-user spend caps, SLA monitoring across model APIs, and budget governance. No independent winner exists yet. The 6-12 month sourcing window is open, and what the incumbents are not doing during that window is shipping the missing telemetry themselves.

    What This Means for LLM-Layer Marks

    Enterprise AI spend is reversible in ways traditional SaaS is not. Low switching costs, no contractual lock-in (because no SLAs), FOMO-driven procurement, and budget overruns discovered only at billing time produce a cliff-shaped risk profile rather than a curve. Apply a 20-40% 'reversibility discount' to any model-layer ARR multiple where SLAs and telemetry are absent. The thesis could be wrong if Anthropic ships enterprise controls in the next two quarters, or if procurement teams decide they don't care. Neither has happened yet.

    Action items

    • Demand SLA and usage-telemetry roadmap from every model-layer portco claiming enterprise ARR — submit findings to IC within 2 weeks
    • Launch a sourcing sprint on AI observability/FinOps for AI/token-cost-attribution at Seed to Series A — target 5 meetings within 30 days
    • Apply a 20-40% reversibility discount to any LLM-layer ARR multiple in your book where SLAs, telemetry, or contractual lock-in are absent
    • Build Palantir-alumni sourcing pipeline for deployment-services founders — 3-5 conversations before end of quarter

    Sources:Anthropic has an enterprise gap · Anthropic's 80x growth broke its infra · DuckDB goes client-server, 85% agentic-AI readiness gap · a16z has published another map

  3. 03

    The Agent Economy Hit Majority — And a June 15 Margin Event Is 30 Days Out

    59% Is the Base Rate, Not the Upside Case

    Vercel published what it calls the first production-grade index of real AI usage across more than 200,000 teams, and the headline number is that agentic workloads now carry 59% of all token volume. That is not a leading indicator. It is the majority case in production, which makes the chat-completion era a minority revenue line and most of the slideware from eighteen months ago retrospectively wrong.

    The more interesting cut is the spend/volume bifurcation. Anthropic captures sixty-one percent of dollars on premium reasoning via Opus. Google moves thirty-eight percent of volume on commodity throughput via Flash. Two different businesses are now legible inside what we have been calling 'foundation models,' which validates multi-model routing as the enterprise default and retires the single-model moat thesis.


    The June 15 Margin Event

    Anthropic converted every Claude subscription into a dollar-matched API credit pool. A two-hundred-dollar plan now buys exactly two hundred dollars of programmatic tokens, and the seventy-to-ninety percent arbitrage that Cline, OpenCode and dozens of similar harnesses were running against subscription pricing is gone. OpenAI answered in hours with two months of free Codex for enterprise switchers.

    Combine that with Claude Code's /goal mode absorbing the autonomous coding category and Notion's External Agents API commoditizing the interface, and the result is a two-sided squeeze on anyone whose business model was thin-wrapping a frontier model.

    Every Claude-dependent portco that priced COGS off subsidized subscription tokens lost twenty to forty percent of runway last Friday. The change is four days old. Most founders have not flagged it to their boards yet.

    Consolidation Accelerating from Above

    The platform layer is moving to own the agent surface, and the inventory looks like this:

    PlatformMoveWhat It Absorbs
    SAP€100M Autonomous Enterprise fund + NVIDIA/MSFTAgent orchestration startups
    ServiceNowAction Fabric — headless workflows via MCPStandalone workflow automation
    NotionDeveloper Platform + Claude/Codex as teammatesAgent-hosting pure plays
    Airtable$10M Hyperagent credit programAgent-native distribution

    The pattern is consistent. Incumbents are consolidating the autonomous-enterprise category before pure-play agent startups can define it. The cloud transition ran the other direction. This time ERPs and workflow platforms moved first, which means the obvious horizontal land grab is mostly closed.

    Where the Alpha Still Lives

    This is probably wrong in one or two of these wedges, but vertical beats horizontal in agent ops from here. The investable surface is agent governance (the gap SAP and Levie keep flagging), agent identity/auth (eighty-one percent bot-detection bypass is a market by itself), evaluation and observability (the Haiku-as-judge pattern is now canonical), and vertical-specific orchestration in regulated industries. The horizontal seat at this table is taken.

    Action items

    • Stress-test every Claude-dependent portco's gross margin assuming the 70-90% subscription arbitrage is permanently gone — request updated cohort margins by May 30
    • Source 3-5 agent infrastructure deals in MCP governance, agent identity, and evaluation tooling at Seed/Series A pricing — before SAP's corp dev team prices you out
    • Map pipeline against 'agent-hosting platform' displacement risk — flag any deal that gets absorbed if Notion/Airtable/Cursor ships the feature natively
    • Request Vercel AI Gateway production index as recurring data source and use spend/volume split as diligence benchmark for every model-layer pitch

    Sources:Vercel's first production AI Gateway index · Anthropic is squeezing its pre-IPO round · SAP wrote a check for one hundred million euros · a16z has published another map · Anthropic shipped Claude Code

  4. 04

    Vertical AI Moats Are Built from Data, Not Models: The Abridge Template

    The Category Winner Proof Point

    Abridge raised $550M in 2025 alone (two hundred fifty million early in the year, three hundred million in June at a $5.3B mark), and now services 250 large US health systems with 80M+ projected patient-clinician conversations this year across 28 languages and 50 specialties. The interesting part, or rather the part worth underwriting against, is the data moat that compounds with every patient visit at a rate no foundation model or new entrant can backfill after the fact.

    Health systems, which are historically the slowest enterprise buyers on earth, compressed Abridge's release cycles from quarterly or biannual to monthly. That single observation rewrites the velocity assumption in every healthcare SaaS DCF built in the last five years.


    The Three-Act Underwriting Framework

    Abridge's structure validates a template that probably extends to other regulated verticals, with the usual caveat that templates tend to work until they don't:

    1. Save Time (documentation) is the wedge that gets the initial sale to the CMIO.
    2. Save Money (prior auth, billing) is the expansion that captures the CFO budget, which is where the actual dollars live.
    3. Save Lives (clinical decision support) is long-dated optionality that the FDA's January 2026 CDS guidance may or may not unlock; treat it as a maybe.

    Each act monetizes the same proprietary conversation asset against a different buyer persona in the same account. You sell once and expand against data you accumulated for free as a byproduct of the first product, which is the actual structural advantage.

    The Non-Compete Posture Is the Key Insight

    Abridge positioned itself as a 'clinical intelligence layer' rather than an EHR replacement, explicitly non-competing with Epic and Cerner, which is the single biggest existential risk for any healthcare AI company and the one Abridge declined to take. The EHR remains the system of record and Abridge sits on top of it, which is now table stakes for any vertical AI deal in regulated industries.

    What I'd underwrite for in vertical AI deals: a per-use data flywheel competitors cannot access, at least two monetization vectors beyond the wedge, and an explicit non-compete posture with the dominant system-of-record.

    Where the Adjacent Alpha Sits

    The ambient documentation category is closed. Three adjacent chapters remain open, in roughly descending order of how much I'd pay to be early:

    • Payer-side prior authorization, where Abridge attacks from the provider side and the payer counterparty is greenfield.
    • Nursing and short-form clinical workflows, where a thirty-second visit is a different product entirely from a fifteen-minute physician encounter.
    • Specialty verticals outside US large systems (veterinary, dental, behavioral health, international markets) that Abridge won't prioritize for 24+ months.

    Action items

    • Kill or de-prioritize any active deals on direct ambient-scribe competitors targeting US large health systems — redirect pipeline to adjacent workflow layers
    • Update vertical AI underwriting framework to require per-use data flywheel, ≥2 monetization vectors, and explicit non-compete with dominant system-of-record
    • Build thesis memo on payer-side prior-auth automation with target list of 8-12 startups — schedule meetings within 60 days

    Sources:Abridge at $5.3B: the healthcare AI vertical just printed a category winner · Anthropic has an enterprise gap

◆ QUICK HITS

  • Update: Cerebras closed at $311 (70% first-day pop) vs $89 private round — Eclipse netted 17x, Tiger sitting on $1B paper gain; Modal at $4.5B is the cleanest private read-through for re-marking AI infra books this week

    Cerebras printed a seventy percent first-day pop

  • a16z published GTM thesis: value migrating from system-of-record to system-of-intelligence layer; Lemkin's proof point — 80% fewer seats, 83% higher spend ($12K→$22K), 20+ agents running — is the unit-economics template for consumption-based AI pricing

    a16z has published another map of where value accrues in GTM software

  • DepthFirst claims 10x cost advantage over Anthropic's Mythos on vulnerability discovery (12 bugs in FFmpeg for ~$1K vs ~$10K at Mythos) — autonomous vuln discovery is commoditizing before incumbents finish drawing moats

    AI-native security is splitting into two businesses

  • Anthropic's Mythos became first model to clear both UK AISI attack ranges; NSA (not CISA) reportedly winning access — signals offensive/IC-led procurement, not civilian defensive distribution; budget code forming

    Anthropic's Mythos cleared AISI this week

  • Google Gemini Intelligence embeds autonomous agents directly into Android OS for summer 2026 — any mobile AI assistant startup whose GTM assumes non-native distribution needs a contingency plan before WWDC

    Gemini becomes the OS: your agent-layer portfolio just got disintermediated

  • Fervo Energy IPO popped 33% to $10B+ on AI datacenter power demand — but only 115MW of 658MW pipeline has signed hyperscaler offtake; require contracted capacity as Series B gating criterion for energy-for-AI deals

    AI infra capex hits $100B/partnership

  • DuckDB shipped Quack client-server protocol breaking out of embedded-only — direct threat to Spark/Glue-heavy ETL vendors on sub-TB workloads; ask portco CEOs what % of ARR comes from sub-TB customers

    DuckDB goes client-server, 85% agentic-AI readiness gap

  • Thinking Machines Lab justified $2B raise with 0.40s turn-taking latency vs OpenAI's 1.18s — voice/interaction AI is separating into a distinct investable category from text LLMs with different benchmarks and unit economics

    Gemini becomes the OS: your agent-layer portfolio just got disintermediated

◆ Bottom line

The take.

Anthropic renting 220K GPUs from a sworn enemy, ServiceNow blowing its annual Claude budget by May with zero SLA recourse, and Cerebras popping 70% on day one are three datapoints that resolve into one portfolio instruction: the demand is real but the revenue quality is not SaaS-grade, the infrastructure layer is the one getting paid, and every coding-agent wrapper in your book just lost its margin structure with 30 days' notice. Audit portco Claude dependencies before June 15, apply a reversibility discount to any AI ARR without contractual lock-in, and concentrate new capital on the infrastructure and observability layers where the actual constraints live.

— Promit, reading as Investor ·

Frequently asked

What does Anthropic leasing GPUs from xAI actually signal about compute scarcity?
It signals that frontier compute is scarce enough to override strategic enmity. Anthropic took roughly 45% of xAI's Colossus 1 cluster — over 220,000 GPUs — from a CEO its own leadership called 'misanthropic and evil,' after planning for 10x growth and getting 80x. Renting from a declared rival is not equilibrium behavior; it validates durable scarcity and reframes xAI as an infrastructure-plus-distribution play rather than a frontier-model competitor.
How should the June 15 Claude subscription change affect portfolio company underwriting?
Stress-test every Claude-dependent portco assuming the 70-90% subscription-to-API arbitrage is permanently gone. Anthropic converted subscriptions into dollar-matched API credit pools, so wrappers like Cline and OpenCode that priced COGS off subsidized tokens just lost 20-40% of runway. The change is days old and most founders haven't flagged it to boards yet — request updated cohort margins before month-end.
Why is enterprise AI ARR not equivalent to traditional SaaS revenue?
Because it lacks the contractual scaffolding that makes SaaS revenue sticky. ServiceNow blew its full-year Anthropic budget by May with no SLAs, no per-user telemetry, and no enterprise dashboards — and a National Life Group CIO said Anthropic is 'not great for companies.' Low switching costs, FOMO procurement, and budget overruns discovered at billing time produce reversible revenue. Apply a 20-40% discount to model-layer ARR multiples where SLAs and telemetry are absent.
Where is the investable alpha in the agent economy if horizontal plays are closing?
In vertical depth and infrastructure layers incumbents haven't absorbed. With SAP's €100M fund, ServiceNow's Action Fabric, and Notion's platform consolidating horizontal agent-ops, the surviving wedges are agent governance, agent identity and auth (81% bot-detection bypass is its own market), evaluation and observability, and vertical-specific orchestration in regulated industries. The Series A window likely closes in 2-3 quarters as corp dev activates.
What underwriting criteria does the Abridge outcome set for vertical AI deals?
Three requirements: a per-use data flywheel competitors cannot backfill, at least two monetization vectors beyond the initial wedge, and an explicit non-compete posture with the dominant system-of-record. Abridge cleared all three — $550M raised in 2025, 250 health systems, 80M+ conversations, sitting on top of Epic and Cerner rather than against them. Deals that don't clear this bar face structural risk from the platform layer above.

◆ Same day, different angle

Read this day as…

◆ Recent in investor

Keep reading.