Why did ServiceNow exhaust its full-year Anthropic budget by May?

ServiceNow burned through its budget because there are no SLAs, no per-user telemetry, and no enterprise dashboard to govern Claude consumption. Without contractual lock-in or usage controls, even sophisticated buyers can't pace spend, which means Anthropic's enterprise ARR is structurally reversible at near-zero switching cost.

What does Anthropic renting 220,000 GPUs from xAI actually signal?

It signals a capacity crisis, not a strategic alliance. Anthropic planned for 10x growth and got 80x, forcing it to lease Colossus 1 from Elon Musk — who publicly called the company 'misanthropic and evil.' You don't rent from a declared enemy unless the plumbing has broken and there are no other options at scale.

Where should investors look for alpha if the model layer is overpriced?

One layer down: AI observability, FinOps for AI, token-cost attribution, and AI-native security harnesses. ServiceNow is already selling an AI Control Tower to the same CDIOs panicking over Claude bills, Modal is raising at $4.5B, and LiteLLM hitting CISA's KEV catalog has converted AI security into a budget line.

How should the 59% agentic token-volume figure change deal underwriting?

It invalidates any thesis priced on chat-completion as the base case. Agent workloads dominate production, with Anthropic capturing 61% of spend on tool-heavy reasoning and Google taking 38% of volume on commodity throughput — two different businesses that shouldn't share a multiple. Standalone orchestration startups face absorption risk; neutral infrastructure and vertical integrators with orchestration gravity are the survivors.

What makes Abridge the template for vertical AI underwriting?

Abridge satisfies three non-negotiables: a per-use data flywheel (80M+ annual patient-clinician conversations), at least two monetization vectors across buyer personas (documentation, prior auth, decision support), and an explicit non-compete posture toward the dominant system of record by treating Epic/Cerner as a filesystem rather than a replacement target. At a $5.3B mark with 250 health systems live, it's the comp benchmark for every new vertical AI memo.

Edition 2026-06-04 · read as Investor

Anthropic'sEnterpriseLeadRunsonConsumer-GradePlumbing

Sources: 36
Words: 1,667
Read: 8min

Topics AI Capital Agentic AI LLM Inference

◆ The signal

ServiceNow burned its full-year Anthropic budget by May, with no SLAs, no per-user telemetry, no enterprise dashboard. Ramp's data the same week put Anthropic ahead of OpenAI in enterprise billing share, 34.4 to 32.3, and Anthropic rented 220,000 GPUs from Musk's xAI because eighty-times growth broke the plumbing. The new enterprise leader is selling thirty billion dollars of ARR on consumer-grade infrastructure. The alpha, probably, is one layer down.

◆ INTELLIGENCE MAP

01
Enterprise AI Revenue Quality Crisis
act now
Anthropic passed OpenAI on Ramp billing (34.4% vs 32.3%) but ServiceNow exhausted its annual Claude budget by May due to zero usage telemetry. No SLAs, no granular dashboards, CIOs calling it 'not great for companies.' Revenue is real but reversible — treat model-layer ARR as consumer-grade until proven otherwise.
34.4%
Anthropic enterprise share
12
sources
- Anthropic share
- OpenAI share
- Anthropic growth vs plan
- GPUs rented from xAI
1. Anthropic34.4
2. OpenAI32.3
3. Google15
4. Other18.3
02
Agent Economy Crosses Production Majority
monitor
Vercel's production AI Gateway shows 59% of token volume is now agentic workloads. Spend/volume bifurcation is the tell: Anthropic captures 61% of spend (expensive reasoning calls), Google captures 38% of volume (commodity throughput). a16z publicly bet on 'system of intelligence' over system of record with Stitch investment. SAP committed €100M.
59%
agentic token share
6
sources
- Agentic token volume
- Anthropic spend share
- Google volume share
- SAP autonomous fund
1. Agentic workloads59
2. Chat/completion41
03
AI Security Splits: LiteLLM Hits KEV, Daybreak Absorbs Incumbents
monitor
LiteLLM became the first AI-infrastructure component on CISA's Known Exploited Vulnerabilities list. DepthFirst claims 10x cost advantage over Anthropic's Mythos on vulnerability discovery. OpenAI launched Daybreak with 8 incumbent security 'partners' — the pre-disintermediation posture. Category is forming: fund the runtime/harness layer, not generic wrappers.
10x
cost advantage claimed
6
sources
- LiteLLM status
- MDASH CVEs shipped
- Daybreak partners
- Exploit window
1. DepthFirst (12 bugs)1
2. Mythos (0 bugs)10
04
GTM Software: Orchestration Gravity Replaces Data Gravity
background
a16z's Stitch investment codifies the thesis: AI workflow orchestration captures more value than the CRM underneath it. Lemkin proof point: 2 human seats replacing 10+, spend up 83% ($12K→$22K), 20+ agents running on top. Seat-based SaaS isn't dying — it's being replaced by higher-ARPU consumption. Window to own the intelligence layer is 12-18 months.
83%
spend increase per account
4
sources
- Human seats cut
- Spend increase
- Agents deployed
- Salesforce TAM
1. Before (10 seats)12
2. After (2 seats + agents)22
05
AI Infra IPO Tape: First-Day Pops Set Permanent Comps
monitor
Cerebras closed at $311 vs $89 private round (70% first-day pop, ~$42B market cap). Eclipse netted 17x. Fervo Energy +33% to $10B+. Benchmark broke its own model with a $225M SPV. Late-stage AI infra marks must re-rate this week — but 70% pops almost always retrace, and Tiger's $1B paper gain is locked for 180 days.
17x
Eclipse return on Cerebras
5
sources
- Cerebras first-day pop
- Cerebras market cap
- Fervo debut pop
- Benchmark SPV
1. Eclipse (2016 entry)17
2. Tiger (Sep '25 entry)3.5
3. Benchmark (late-stage)1.1

◆ DEEP DIVES

01
Enterprise AI Revenue Is Not SaaS Revenue — And the Market Just Proved It
The Revenue Quality Problem Nobody Is Marking
ServiceNow, which is one of the most sophisticated enterprise software buyers on earth, exhausted its full-year Anthropic budget by May 2026. Not because Claude underperformed, but because there is no granular per-user, per-tool telemetry and no SLAs worth the name. National Life Group's CIO put it less politely: Anthropic is 'great for consumer usage but not great for companies.'
This matters because Anthropic just passed OpenAI on Ramp's billing data, 34.4% vs 32.3%, the first documented leadership change in enterprise AI, and over the trailing year Anthropic quadrupled business adoption while OpenAI grew 0.3%. The revenue underneath that lead is reversible at near-zero switching cost.
The Capacity Crisis Beneath the Revenue
Anthropic planned for 10x growth and got 80x, which is a forecasting error, or rather the more interesting version, a forecasting error that bends strategy. Hence the silent Claude Code nerfs, the mid-trial Pro revocations, the corporate account bans, and then the unthinkable: Anthropic leased xAI's entire Colossus 1 cluster (220K+ GPUs) from Elon Musk, the person who publicly called them 'misanthropic and evil.' When you rent from a declared enemy, you are not making a strategic choice. You are short capacity.
At the same time, every subscription became a dollar-matched API credit pool, so a $200 plan now buys $200 of programmatic tokens, which closes the 70-90% arbitrage that Cline and OpenCode had been running on the harness side. This is margin recovery timed to an October IPO, with a new CFO already in the chair.
Enterprise AI ARR is not SaaS ARR. It has no SLAs, no lock-in, no telemetry, and customers blow annual budgets by Q2. Price it accordingly.
The Forward-Deployed-Engineer Consensus
Four firms independently conceded the same point this week: deployment, not model quality, is the bottleneck. Google Cloud is hiring hundreds of forward-deployed engineers, OpenAI stood up DeployCo with Bain, and Salesforce and ServiceNow are staffing the same function. The Palantir playbook is now consensus, which means the interesting category is one layer over, in the AI observability and FinOps layer forming in real time.
This is probably wrong, but: ServiceNow is already selling an AI Control Tower to the same customers panicking about their Anthropic bills, and Modal is raising at $4.5B. The next Datadog-scale category lives wherever CDIOs are both buying and building the same tool.
Action items
- Demand SLA + usage-telemetry roadmap from every model-layer portco claiming enterprise ARR; apply 20-40% reversibility discount where absent
- Launch sourcing sprint on AI observability / FinOps-for-AI / token-cost-attribution at Seed to Series A within 30 days
- Stress-test every Claude-dependent portco's gross margin assuming subscription arbitrage is permanently gone
- Map FDE-layer investment targets sourced from Palantir alumni network before PE-backed DeployCo bids them away
Sources:Laura Bratton · The Pragmatic Engineer · AINews · TLDR AI · Morning Brew · The Hustle
02
59% Agentic — The Token Economy Just Flipped and Your Pipeline Is Priced to the Old One
The First Production Data
Vercel finally published the first production-grade AI Gateway index across more than two hundred thousand teams, and the headline that ought to rewrite a few AI memos is that 59% of all token volume is now agentic workloads. Chat completion, which is what most of these decks were sold on, is now the minority case in production. The agent thesis is not arriving. It is most of what is running.
The split between spend and volume is the more interesting puzzle. Anthropic captures 61% of spend on long, tool-heavy reasoning calls, while Google captures 38% of volume on the cheap, high-throughput commodity work. Two different businesses are sitting inside the phrase 'foundation models,' and they do not deserve the same multiple.
The Platform Land Grab
Three incumbent moves landed in the same quarter, all pointed roughly the same way:
- Anthropic's June 15 third-party credit unbundling — ends subsidized experimentation through downstream tools
- Notion's Developer Platform — Claude, Codex, Cursor, and Devin as hosted teammates inside the workspace
- SAP's €100M Autonomous Enterprise Fund — wired into NVIDIA and Microsoft at the platform layer
This is probably wrong, but the read is that standalone agent-orchestration startups of the LangChain or CrewAI class face absorption risk inside two to three quarters. The counter-thesis is that orchestration is hard enough that incumbents buy rather than build. Either way, the value migrates to neutral infrastructure doing multi-model routing, observability and eval, or to vertical integrators sitting on proprietary workflow data. Standalone orchestration is the layer that gets squeezed.
Agent workloads are 59% of production token volume. The winners are neutral infrastructure and vertical integrators, not standalone orchestration layers.
a16z's GTM Thesis Sets the Clock
a16z has now publicly declared that the 'system of intelligence' layer captures more value than the system of record beneath it, and put the Stitch cheque behind the claim. The Lemkin number makes the pitch concrete, or rather makes the consumption-based GTM legible in one customer: 2 human seats replacing 10+, spend up 83% from $12K to $22K, 20+ agents running on top. Fewer seats, higher ARPU, more compute. That is the deck.
What the actor is not doing, while spending here, is funding horizontal AI CRM wrappers, which is the trade that has already started compressing. The moat has moved from data gravity to orchestration gravity: institutional context, workflow memory, multi-system synthesis. The investable window is twelve to eighteen months before incumbents or consensus close it, and the thesis has been right for several quarters and could be wrong by the next earnings cycle. The alpha, if there is any left, sits in vertical-specific orchestration with measurable outputs.
Action items
- Request Vercel AI Gateway production index as a recurring data source; use spend/volume split as a diligence benchmark for every model-layer pitch
- Re-underwrite Claude-dependent developer tools before June 15 credit unbundling — model the API-rate fallback gross margin scenario
- Rerank AI-GTM pipeline by orchestration moat depth — prioritize narrow high-frequency workflows with accumulating institutional context over horizontal copilots
- Exit or hedge horizontal agent-ops exposure; vertical beats horizontal from here
Sources:ben's bites · a16z · TLDR IT · TLDR · AINews

AI Security Becomes a Budget Line: LiteLLM on KEV, Daybreak Absorbs Partners, DepthFirst Claims 10x

The Category-Formation Event

LiteLLM is the first AI-infrastructure component to land on CISA's Known Exploited Vulnerabilities catalog, which is interesting less as a vulnerability story than as the regulatory-validation moment that converts 'AI security' from pitch deck to enterprise budget line. In the same cycle Ollama shipped a CVSS 9.1 model-loader bug enabling data exfiltration via malicious GGUF files, and PraisonAI's auth-bypass was weaponized within four hours of disclosure. Four hours is not a window. It is a confession.

The AI-infra stack is sitting roughly where enterprise SaaS sat in 2014 on security maturity, which is to say: not great. Every dollar of AI-infra ARR is creating latent demand for AI-sec ARR, and the Series A window for runtime-layer winners is open now. It will not stay open long.

The Economics Split

Two models are emerging and the middle gets squeezed, which is the usual story but worth watching anyway.

Approach	Economics	Evidence
Specialized harness (DepthFirst)	~$1K for 12 memory corruption bugs in FFmpeg	10x claimed advantage; anchor logos landing
Frontier model (Anthropic Mythos)	~$10K for 0 bugs across hundreds of scans	But cleared both AISI attack ranges — capability lead
Platform absorption (OpenAI Daybreak)	Bundled with GPT-5.5 + Codex Security	Launched with 8 incumbent 'partners' — pre-disintermediation posture

The Daybreak partner list — Cloudflare, Cisco, CrowdStrike, PANW, Oracle, Zscaler, Akamai, Fortinet — is the tell. When a platform launches with every major incumbent listed as a friend, what you are looking at is pre-disintermediation rather than partnership. Cloudflare cutting twenty percent in the same week is one of those partners already reading the wind.

Cyber-AI is pricing as an AI feature and transacting as a defense-tech category. The Series B window closes on whichever framing the market agrees to first.

The EDR Moat is Cracking

TrustedSec ran LLMs at five commercial EDRs. Reverse engineering that previously took weeks now takes days, and all five products rest on the same architectural furniture — YARA rules, Lua engines, local ML classifiers. The detection IP that justified CRWD and PANW multiples is commoditizing in public. Microsoft's MDASH, meanwhile, shipped 16 validated Windows CVEs in a single Patch Tuesday, which is the first production evidence that autonomous vuln discovery works at hyperscaler scale.

Mozilla found 271 bugs in Firefox with a custom agentic harness on top of Claude Opus 4.6. Anthropic's Mythos found 1 real CVE in curl with a generic scan. The moat is in the harness, not the model. Fund accordingly.

Action items

Run portfolio-wide exposure sweep for LiteLLM (1.81.16–1.83.7), Traefik, Argo CD, MOVEit, PAN-OS, and Ivanti EPMM — escalate any internet-exposed instances this week
Open diligence on DepthFirst and map the AI-native AppSec harness category — apply the 'Mozilla-vs-curl test' to every inbound AI security pitch
Build target list of 5-8 AI-native identity/deepfake defense companies at Series A/B before the $40B 2027 TAM number enters consensus pricing
Add OpenAI Daybreak to competitive tracker for every security portfolio company — require 2-sentence answer to 'why aren't you a Daybreak connector in 18 months'

Sources:CyberScoop · TLDR InfoSec · Clint Gibler · The Hacker News · SANS AtRisk · The Information AM

04
Vertical AI Playbook Proven: Abridge at $5.3B Shows Where Moats Actually Live
The Category Winner Template
Abridge raised $550M in 2025 ($250M early year, $300M at a $5.3B mark) with 250 US health systems live and 80M+ patient-clinician conversations annually. That is a data moat no foundation model or new entrant can buy or scrape. Health systems — historically the slowest enterprise buyers — compressed release cycles from quarterly to monthly for this product.
The three-act structure is the investable template: save time (clinical documentation, won) → save money (prior auth, active) → save lives (clinical decision support, unlocked by January 2026 FDA guidance). Each act monetizes the same proprietary conversation asset against a different buyer persona.
The Underwriting Framework Update
Abridge validates three non-negotiable criteria for any vertical AI deal:
1. Per-use data flywheel — every customer interaction accrues proprietary training data competitors can't access
2. ≥2 monetization vectors beyond the initial wedge, targeting different buyer personas in the same account
3. Explicit non-compete posture with the dominant system of record (EHR, CRM, ERP) — companies pitching 'replacement' of incumbent platforms in regulated verticals are largely uninvestable at scale
The critical positioning: Abridge frames itself as a 'clinical intelligence layer' rather than an EHR replacement, treating Epic/Cerner as a filesystem. That neutralizes the single biggest existential risk in healthcare AI.
Vertical AI moats are built from proprietary interaction data, not model quality. Back the data flywheels, fade the AI wrappers, pre-empt the adjacent workflow layers before the same playbook prices them in.
Where the Adjacent Alpha Lives
The category-leader print doesn't close the healthcare AI book — it opens adjacent chapters:
- Payer-side prior authorization — Abridge attacks from the provider side; the payer counterparty remains greenfield with state-fragmented policies creating a parsing moat
- Nursing and short-form workflows — visits can be 30 seconds, a fundamentally different product surface
- Specialty verticals outside US large systems — veterinary, dental, behavioral health, international markets Abridge won't prioritize for 24+ months
- Picks-and-shovels — Kafka, Temporal, CRDTs, event-driven orchestration, memory stores, LLM judges — infrastructure serving multiple vertical AI winners
Action items
- Kill or de-prioritize active deals on direct ambient-scribe competitors targeting US large health systems; redirect sourcing to adjacent workflow layers
- Build thesis memo on payer-side prior-auth automation with target list of 8-12 startups to meet within the quarter
- Update vertical AI underwriting framework to require per-use data flywheel, ≥2 monetization vectors, and explicit non-compete posture vs. dominant system of record
- Track Abridge secondary market activity; pre-empt any tertiary opportunity at $8-12B range as the IPO comp re-rates the entire vertical AI book
Sources:Latent.Space

◆ QUICK HITS

Update: Cerebras closed first day at $311 (70% pop) vs $89 private round — Eclipse netted 17x, Tiger sitting on $1B paper gain locked 180 days; use $42B market cap as the new permanent comp for AI-infra marks
Katie Roof
Anthropic rented xAI's entire Colossus 1 cluster (220K+ GPUs) from Musk — the loudest demand signal in AI and a de facto concession that xAI is exiting the frontier race to become a compute landlord
The Pragmatic Engineer
Nebius printed 684% Q1 revenue growth but disclosed $2.47B capex vs $2.26B operating cash — the neocloud model is structurally cash-negative; compress private multiples from 8-10x to 2-3x forward revenue
Martin Peers
Anthropic's June 15 third-party credit unbundling eliminates the 70-90% subscription arbitrage that Cline and OpenCode ran — any wrapper on subscription tokens lost 20-40% of runway since Friday
AINews
Google Gemini Intelligence embeds autonomous task execution directly into Android (97%+ share markets) — any mobile AI agent startup assuming non-OS-level distribution needs a contingency before WWDC/summer launch
Simplifying AI
DuckDB's new Quack client-server protocol breaks it out of embedded jail — direct threat to Spark/Glue-heavy ETL vendors on sub-TB workloads; stress-test portfolio exposure
TLDR Data
Only 15% of enterprises have data foundations for agentic AI (Fivetran index) despite spending millions — data quality and lineage cited as #1 blocker by nearly half; cleanest picks-and-shovels setup since early Snowflake era
TLDR Data
Apple's third creator-tools acquisition (Patchflyer/Color.io, after Pixelmator and MotionVFX) confirms a deliberate roll-up into vertically-integrated Creator Studio — reprice Adobe tail risk
TLDR Design

◆ Bottom line

The take.

Enterprise AI's new leader has consumer-grade plumbing — no SLAs, no telemetry, customers blowing annual budgets by May — while 59% of production token volume has silently gone agentic and the security stack just landed its first federal exploitation notice. The model layer is priced to perfection on revenue that reverses in a quarter; the alpha has migrated to observability, agent infrastructure, and AI-security runtime plays that are still priced at Series A. Reprice the quality of your AI revenue assumptions this week, not next quarter.

Frequently asked

Why did ServiceNow exhaust its full-year Anthropic budget by May?: ServiceNow burned through its budget because there are no SLAs, no per-user telemetry, and no enterprise dashboard to govern Claude consumption. Without contractual lock-in or usage controls, even sophisticated buyers can't pace spend, which means Anthropic's enterprise ARR is structurally reversible at near-zero switching cost.
What does Anthropic renting 220,000 GPUs from xAI actually signal?: It signals a capacity crisis, not a strategic alliance. Anthropic planned for 10x growth and got 80x, forcing it to lease Colossus 1 from Elon Musk — who publicly called the company 'misanthropic and evil.' You don't rent from a declared enemy unless the plumbing has broken and there are no other options at scale.
Where should investors look for alpha if the model layer is overpriced?: One layer down: AI observability, FinOps for AI, token-cost attribution, and AI-native security harnesses. ServiceNow is already selling an AI Control Tower to the same CDIOs panicking over Claude bills, Modal is raising at $4.5B, and LiteLLM hitting CISA's KEV catalog has converted AI security into a budget line.
How should the 59% agentic token-volume figure change deal underwriting?: It invalidates any thesis priced on chat-completion as the base case. Agent workloads dominate production, with Anthropic capturing 61% of spend on tool-heavy reasoning and Google taking 38% of volume on commodity throughput — two different businesses that shouldn't share a multiple. Standalone orchestration startups face absorption risk; neutral infrastructure and vertical integrators with orchestration gravity are the survivors.
What makes Abridge the template for vertical AI underwriting?: Abridge satisfies three non-negotiables: a per-use data flywheel (80M+ annual patient-clinician conversations), at least two monetization vectors across buyer personas (documentation, prior auth, decision support), and an explicit non-compete posture toward the dominant system of record by treating Epic/Cerner as a filesystem rather than a replacement target. At a $5.3B mark with 250 health systems live, it's the comp benchmark for every new vertical AI memo.

◆ Same day, different angle

Read this day as…

◆ Recent in investor

Anthropic'sEnterpriseLeadRunsonConsumer-GradePlumbing

◆ INTELLIGENCE MAP

◆ DEEP DIVES

The Revenue Quality Problem Nobody Is Marking

The Capacity Crisis Beneath the Revenue

The Forward-Deployed-Engineer Consensus

The First Production Data

The Platform Land Grab

a16z's GTM Thesis Sets the Clock

The Category-Formation Event

The Economics Split

The EDR Moat is Cracking

The Category Winner Template

The Underwriting Framework Update

Where the Adjacent Alpha Lives

◆ QUICK HITS

The take.

Frequently asked

◆ RELATED THREADS