Edition 2026-05-28 · read as Investor
CerebrasIPOPopMasksAnthropicBudgetBlowupsatSaaSCo's
- Sources
- 36
- Words
- 1,600
- Read
- 8min
Topics Agentic AI AI Capital LLM Inference
◆ The signal
Cerebras popped 70% on day one to $311 while ServiceNow quietly blew its full-year Anthropic budget by May — with zero SLAs and no usage telemetry. Your AI marks are going up on proven exit liquidity while the revenue quality underneath the application layer is silently deteriorating. Anthropic's June 15 credit conversion eliminates the 70-90% token arbitrage that Claude-dependent wrappers relied on. The window to sell into strength and stress-test portfolio revenue quality is this week, not next quarter.
◆ INTELLIGENCE MAP
01 Enterprise AI Revenue Is Not SaaS Revenue
act nowServiceNow exhausted its full-year Anthropic budget by May with no SLAs, no per-user telemetry, and no contractual lock-in. Anthropic's June 15 credit conversion kills the 70-90% subscription arbitrage Claude wrappers depend on. OpenAI countered with 2-month free Codex — both labs buying developer lock-in ahead of Anthropic's likely October IPO.
- Budget blown by
- Credit conversion
- Likely IPO
- Wrapper margin hit
- ServiceNow budget blownMay 2026
- Credit conversion liveJune 15
- OpenAI Codex promo endsJuly 2026
- Anthropic IPO targetOctober 2026
02 AI Infra Exit Window: First-Day Proof Points
monitorCerebras closed at $311 (70% pop), delivering Eclipse 17x and Tiger $1B paper gain. Fervo popped 33% to $10B+ on AI power demand. Benchmark raised a $225M SPV to defend Cerebras ownership — the first institutional admission that SPVs are now structural, not optional. Two of the next three filers will clear; the third tests whether the bid is selective.
- Cerebras day-1 pop
- Fervo day-1 pop
- Benchmark SPV
- Tiger paper gain
03 Agent Economy Crosses 59% — Category Repricing Live
monitorVercel's production AI Gateway shows agentic workloads hit 59% of token volume — no longer a thesis, the majority case. Anthropic captures 61% of spend (premium reasoning), Google takes 38% of volume (commodity throughput). SAP committed €100M, ServiceNow shipped headless Action Fabric, and Notion launched a developer platform. The agent infrastructure window is 2-3 quarters before incumbents absorb it.
- Anthropic spend share
- Google volume share
- SAP agent fund
- Absorption window
04 AI Security Crosses KEV Threshold — Budget Line Confirmed
monitorLiteLLM hit CISA's KEV catalog — first AI-infra component flagged as actively exploited. MDASH produced 16 validated Windows CVEs in one Patch Tuesday. DepthFirst claims 10x cost advantage over Mythos on vulnerability discovery. OpenAI's Daybreak launched with 8 incumbent 'partners' in the pre-disintermediation pattern. Series A window for AI-native security is open now.
- LiteLLM status
- DepthFirst vs Mythos
- Daybreak partners
- PraisonAI exploit time
- DepthFirst (FFmpeg)1000
- Mythos (FFmpeg)10000
05 GTM Value Migration: Record to Intelligence Layer
backgrounda16z published its GTM thesis with a live Stitch check: system of intelligence captures majority value over the system of record. Lemkin proof point — 80% fewer human seats, 83% higher spend ($12K→$22K), 20+ agents running. Salesforce and HubSpot ($149B combined) face ARPU expansion but seat-count erosion. The investable window for vertical orchestration is 12-18 months before incumbents or consensus close it.
- Seat reduction
- CRM TAM at risk
- Agent-to-seat ratio
- Window remaining
- Before (10 seats)12000
- After (2 seats + agents)22000
◆ DEEP DIVES
01 Enterprise AI Revenue Quality Is Structurally Fragile — Reprice the Application Layer
The Revenue Looks Great Until You Read the Contract
The least comfortable fact in AI investing this week is that enterprise AI ARR does not behave like SaaS ARR, and the evidence keeps arriving from buyers who would prefer it didn't. ServiceNow, arguably the most sophisticated enterprise buyer alive, blew through its full-year Anthropic budget by May 2026. National Life Group's CIO described Anthropic as 'great for consumer usage but not great for companies.' The reason is banal: there is no granular per-user telemetry, no SLAs worth the name, and no contractual switching costs anywhere in the stack.
This matters because the thirty billion dollar ARR figure everyone is marking Anthropic against assumes enterprise-grade durability, and the plumbing underneath it is not enterprise-grade.
The June 15 Margin Event
Anthropic's decision to convert every Claude subscription into a dollar-matched API credit pool eliminates the 70-90% arbitrage that third-party harnesses (Cline, OpenCode, the various Claude wrappers) have been quietly running for months. A $200/month plan now buys exactly $200 of programmatic tokens, which means no more leveraging subscription-tier access for production-scale inference at a fraction of API rates.
OpenAI answered within days with two months of free Codex for enterprise switchers. Set against Ramp's April data showing Anthropic at 34.4% of business spend versus OpenAI at 32.3%, this looks like a subsidy war timed to pre-IPO margin recovery. Or rather, the more interesting version: a subsidy war both sides need to be seen losing money on, for different reasons.
Every Claude-dependent portfolio company whose unit economics were built on subscription-tier token access has lost 20-40% of runway since last Friday, a change that is four days old and which most founders have not yet flagged.
The Deployment Services Arms Race Confirms It
The industry has quietly conceded what Palantir proved twenty years ago, which is that deployment is the bottleneck, not model capability. Google is hiring hundreds of forward-deployed engineers, OpenAI stood up DeployCo with Bain, and Salesforce and ServiceNow are staffing the same function under different names. When four firms independently decide the margin lives in deployment services, the margin lives in deployment services.
The second-order effect is a new category — AI observability and FinOps — validated by ServiceNow's own AI Control Tower selling into the same customers panicking about their Anthropic bills. The company that burned its budget is now selling the budget-monitoring tool, which is not coincidence so much as a CDIO who watched the category form inside her own P&L and decided to charge for the lesson.
What This Means for the Book
The under-discussed risk is the reversibility of enterprise AI spend: absent SLAs there are no switching costs, absent usage telemetry there is no early warning, and FOMO-driven procurement (which describes most 2025-2026 AI spend) produces a cliff-shaped risk profile rather than the smooth retention curves SaaS multiples assume. This is probably wrong in a handful of names, but the working rule is to apply a 20-40% reversibility discount to any LLM-layer ARR where SLAs and telemetry are absent.
Action items
- Request updated gross-margin models from every Claude-dependent portfolio company reflecting the June 15 credit conversion — any wrapper running COGS against subscription tokens needs a revised cohort model by end of month
- Demand SLA + usage-telemetry roadmap from any portfolio company pitching enterprise AI ARR at next board meeting
- Build a sourcing sprint on AI observability / FinOps (token-cost-attribution, per-user spend caps, SLA monitoring) at Seed-Series A
- Apply 20-40% reversibility discount to any LLM-layer ARR in your marks where SLAs/telemetry/contractual lock-in are absent
Sources:Anthropic has an enterprise gap · Anthropic squeezing its pre-IPO round · Anthropic is reportedly running at thirty billion dollars · Anthropic's 80x growth broke its infra
02 Agent Workloads Are the Majority Case — Infrastructure Window Is 2-3 Quarters
59% Is Not a Thesis. It's the Base Rate.
Vercel published the first production-grade AI Gateway index this week — not a lab benchmark, not self-reported ARR, but downstream production data across 200K+ teams. The headline: agentic workloads now carry 59% of all token volume. The chat-completion era is officially the minority case in production.
More revealing is the spend-versus-volume bifurcation. Anthropic captures 61% of spend via Opus on premium reasoning calls. Google takes 38% of volume via Flash as commodity throughput. These are two different businesses emerging inside what everyone has been calling 'foundation models.' The pricing power lives with Anthropic. The scale lives with Google. Multi-model routing is not a sophistication — it's the default architecture.
The Incumbents Are Moving First This Cycle
Unlike the cloud transition where AWS and Azure defined infrastructure while ERPs caught up late, this time the incumbents are moving first:
- SAP committed €100M to an Autonomous Enterprise fund with Nvidia and Microsoft wired in
- ServiceNow shipped Action Fabric — headless APIs designed for agents to consume, decoupling logic from UI
- Notion launched a developer platform hosting Claude, Codex, Cursor, Decagon, Warp, and Devin as teammates
- Airtable launched Hyperagent with a $10M inference credit program targeting 500 founders
This means the question of where agent-layer value accrues looks materially different from the cloud parallel. Pure-play agent orchestration layers face absorption risk from both below (model labs climbing up) and above (workflow platforms climbing down).
The window for agent infrastructure at reasonable Series A pricing is 2-3 quarters. After that, SAP's corp dev team, Notion's platform, and Anthropic's own tooling absorb the category into existing surfaces.
The a16z GTM Thesis Sharpens the Allocation
a16z published its full GTM thesis with a live check (Stitch): the system of intelligence layer captures majority value over the system of record. The Lemkin proof point makes the economics concrete: Salesforce from 10+ human seats to 2 humans + 1 API seat, while spend rose 83% ($12K → $22K) and 20+ agents run on top. The seat count collapsed. The bill went up.
The investable wedge is narrow: vertical-specific orchestration layers with institutional-context accumulation. The moat has migrated from data gravity to orchestration gravity — institutional context, workflow memory, and multi-system synthesis are the new lock-in. Horizontal 'AI copilot for sales' plays walk straight into Salesforce's API-first counter-punch.
What Survives Absorption
Three characteristics define what the incumbents cannot replicate in a quarter: (1) proprietary action surfaces — a regulated workflow, a dataset, an integration graph; (2) accumulated institutional context that compounds with usage; (3) vertical depth the incumbent's horizontal platform cannot be bothered to defend. Everything else is a feature waiting to be shipped.
Action items
- Source 3-5 agent infrastructure deals in MCP tooling, agent identity/auth, agent observability, and knowledge-graph governance this quarter — before SAP's corp dev activation compresses entry points
- Map pipeline against 'agent-hosting platform' absorption risk — identify which deals get displaced if Notion/Airtable/ServiceNow absorb the workflow
- Request consumption/agent-usage metrics in all active AI-GTM diligence — specifically agent-to-seat ratio, API call volume growth, and NRR on existing logos
- Rerank AI-GTM pipeline by orchestration moat depth — prioritize narrow high-frequency workflows with measurable outputs over horizontal 'AI copilot for sales' plays
Sources:a16z has published another map · Vercel's first production AI Gateway index · SAP put one hundred million euros · Anthropic shipped Claude Code
03 AI Security Hit Its KEV Moment — The Series A Window Opens Now
LiteLLM Lands on the KEV Catalog
CISA added LiteLLM, an AI Gateway routing control plane, to its Known Exploited Vulnerabilities catalog this week. This is the first time an LLM-routing component has been federally flagged as actively exploited. Combine that with Ollama's GGUF model-loader bug (CVSS 9.1, data exfiltration via malicious model files) and OpenClaw shipping six critical CVEs in a single cycle, and the AI-infra stack sits at roughly the security maturity of enterprise SaaS in 2014.
This looks like a vulnerability story and is not. Or rather, the more interesting version is that it is the regulatory validation event that turns "AI security" from a pitch slide into an enterprise budget line with procurement authority attached.
Concurrent Disruption Signals
The category is being disrupted from three directions at once, and the wedges do not all have to clear for the thesis to work.
Economics: DepthFirst's Open Defense Initiative landed FFmpeg, Envoy, and Kata as anchor tenants, claiming 12 memory corruption bugs for ~$1,000 in compute versus Anthropic's Mythos missing the same bugs at ~$10,000 across hundreds of scans. If the 10x claim holds at scale, wrapper economics collapse for anyone building security workloads on rented frontier-model time.
Platform absorption: OpenAI's Daybreak launched with Cloudflare, Cisco, CrowdStrike, Palo Alto Networks, Oracle, Zscaler, Akamai, and Fortinet listed as "partners." That is the pre-disintermediation pattern, every incumbent named as a friend in the quarter before the platform eats them.
Autonomous discovery at scale: Microsoft's MDASH produced 16 validated Windows CVEs in a single Patch Tuesday, the first auditable evidence that multi-model vulnerability discovery works at hyperscaler production scale rather than in conference demos.
When attackers weaponize AI-framework CVEs in 4 hours (PraisonAI) and defenders need weeks to patch, the economics of selling security on a human-speed cadence break. The fundable layer is the one that matches machine speed.
The Investment Map
The category splits into four fundable wedges, and the diligence test on each is whether the moat is harness design or merely model access.
- AI-gateway security: MCP firewalls, LLM posture management, model-artifact scanning. LiteLLM on KEV is the "MongoDB ransomware moment" that creates enterprise procurement authority.
- Autonomous vulnerability discovery with specialized harnesses: the Mozilla-versus-curl delta is the diligence test. Mozilla found 271 Firefox bugs with a custom agentic harness; Mythos found 1 real CVE in curl with a generic scan. Harness design is the moat, not model access.
- Agent runtime security: an 81% bot-detection bypass rate plus 4-hour exploit windows on AI frameworks equals a defender-empty category with near-certain demand.
- Supply-chain provenance: Shai-Hulud's leaked source documents industrial-grade Sigstore forgery. The current trust chain is known-breakable.
EDR Moat Erosion
TrustedSec ran LLMs against five commercial EDRs and compressed reverse engineering from weeks to days; all five products rest on identical architectural furniture (YARA rules, Lua engines, local ML classifiers). Current multiples on CrowdStrike, Palo Alto, and SentinelOne are priced for the old moat. Detection-rule IP is becoming a commodity input. This is probably wrong on timing, but the durable positions look like distribution, data scale, and harness-heavy integration layers.
Action items
- Pull forward AI-security diligence by one quarter — specifically AI-gateway firewalls, model-artifact scanners, and LLM-runtime sandbox companies at Series A
- Request DepthFirst data room and validate 10x cost claim against Mythos with independent benchmark on 2-3 additional codebases
- Add OpenAI Daybreak to competitive tracker for every security portfolio company — require founders to articulate why their product isn't a Daybreak connector in 18 months
- Run portfolio-wide exposure sweep for LiteLLM (1.81.16–1.83.7), Traefik, Argo CD, and MOVEit Automation — escalate any that are internet-exposed
Sources:Cybersec alpha: AI-infra CVEs hit KEV · DepthFirst's Open Defense Initiative · Anthropic's Mythos cleared AISI · The EDR moat is cracking · Microsoft's MDASH producing 16 patched Windows flaws
◆ QUICK HITS
Update: Cerebras closed day one at $311 (70% pop) — Eclipse netted 17x from 2016 entry, Tiger sits on ~$1B paper from $89 entry months ago; Modal's $4.5B round is now the cleanest private read-through
Cerebras printed a seventy percent first-day pop
Benchmark raised a $225M SPV for Cerebras despite being 'the patron saint of small early-stage funds' — late-stage adds were 93% of total cost basis and lowered the fund multiple; SPV infrastructure is now structurally required for top-5 winners
Cerebras printed a seventy percent first-day pop
Abridge raised $550M at $5.3B servicing 250 health systems with 80M+ conversations — the ambient clinical scribe category is closed; alpha moved to payer-side prior auth, nursing workflows, and specialty verticals
Abridge at $5.3B: the healthcare AI vertical just printed a category winner
Google's Gemini Intelligence embeds autonomous task execution directly into Android (97%+ share in key markets, summer 2026 rollout) — every mobile AI assistant startup whose wedge is 'agent for your phone' is now a feature, not a company
Gemini becomes the OS: your agent-layer portfolio just got disintermediated
Fivetran readiness index confirms only 15% of enterprises have data foundations for agentic AI despite spending millions — data quality and lineage cited as #1 blocker by nearly half; picks-and-shovels window open at Series A/B
Data infra thesis update: DuckDB goes client-server, 85% agentic-AI readiness gap
Figure ran an 8-hour fully autonomous humanoid shift at human-parity speed with fleet coordination and self-maintenance — the first credible 'labor-hour' proof point; expect 1X, Physical Intelligence, and Apptronik rounds to mark against this comp
Anthropic squeezing its pre-IPO round
a16z's AI liability framework is coordinated lobbying against absolute-liability proposals — if developer liability for downstream misuse gets established, open-weight foundation models become uninsurable; stress-test deals at 15-25% revenue absorbed by litigation reserves
a16z is spending partner attention on AI liability policy
Nebius guided to 6x revenue growth ($530M→$3.4B) with 4+ customers bidding per GPU and $2.47B capex against $2.26B operating cash — neoclouds are structurally cash-negative even at 684% growth; compress private multiples from 8-10x to 2-3x forward
AI compute is sold out 4:1
◆ Bottom line
The take.
Enterprise AI revenue is growing at headline rates while the plumbing underneath it — no SLAs, no telemetry, no contractual lock-in — means that ARR is reversible in ways SaaS ARR never was. The AI exit window is proven liquid (Cerebras 70% pop, Eclipse 17x), agentic workloads are now 59% of production tokens, and AI-infra security just hit CISA's actively-exploited list for the first time. The trade this week: sell model-layer exposure into the strongest primary demand you'll see, stress-test every wrapper's unit economics before June 15 kills their margin arbitrage, and concentrate new capital where incumbents cannot absorb the category in two quarters — agent infrastructure governance and AI-native security.
Frequently asked
- How does Anthropic's June 15 credit conversion change wrapper economics?
- It eliminates the 70-90% token arbitrage that Claude-dependent wrappers ran by leveraging subscription-tier access for production inference. After June 15, a $200/month plan buys exactly $200 of programmatic tokens. Any portfolio company whose COGS model assumed subscription-tier token rates has effectively lost 20-40% of runway, and most founders have not yet flagged it.
- What discount should be applied to LLM-layer ARR without SLAs or usage telemetry?
- A 20-40% reversibility discount is the working rule. ServiceNow's full-year Anthropic budget blowout by May, combined with CIO sentiment that Claude is 'not great for companies,' demonstrates that enterprise AI spend reverses quickly absent contractual lock-in. Without SLAs, telemetry, or switching costs, retention curves look cliff-shaped rather than SaaS-smooth.
- Why is the agent infrastructure investment window only 2-3 quarters?
- Incumbents are moving first this cycle, unlike the cloud transition. SAP committed €100M with Nvidia and Microsoft, ServiceNow shipped Action Fabric, Notion launched a developer platform hosting Claude/Codex/Cursor, and Airtable launched Hyperagent. Pure-play orchestration layers face absorption from both model labs climbing up and workflow platforms climbing down, compressing Series A entry points fast.
- What makes LiteLLM's KEV listing an investable inflection rather than a vulnerability story?
- It is the first time a federal catalog has flagged an LLM-routing component as actively exploited, which converts AI security from a pitch slide into an enterprise budget line with procurement authority. Combined with Ollama's CVSS 9.1 model-loader bug and OpenClaw's six critical CVEs, it functions as the 'MongoDB ransomware moment' that triggers re-rating of AI-gateway firewalls and model-artifact scanners at Series A.
- What characteristics let an agent-layer company survive incumbent absorption?
- Three: proprietary action surfaces (regulated workflows, integration graphs, exclusive datasets), accumulated institutional context that compounds with usage, and vertical depth horizontal platforms cannot be bothered to defend. The moat has migrated from data gravity to orchestration gravity. Horizontal 'AI copilot for sales' plays walk into Salesforce's API-first counter-punch; narrow high-frequency workflows with measurable outputs do not.
◆ Same day, different angle
Read this day as…
◆ Recent in investor
Keep reading.
- SpaceX is pricing June 12 at one-point-seven-five trillion, roughly a hundred times revenue, into the worst tape we have seen for a listing…
- SpaceX is quietly collecting $2.17B/month in AI compute rent from Anthropic and Google — a $26B annualized run-rate that isn't in secondary…
- Anthropic edged OpenAI in enterprise billing on Ramp last week, 34.4 percent to 32.3, in the same week ServiceNow admitted it had burned its…
- ServiceNow burned its full-year Anthropic budget by May, with no SLAs, no per-user telemetry, no enterprise dashboard.
- Anthropic's June 15 pricing change closed the seventy-to-ninety percent subscription arbitrage the third-party Claude tools were quietly run…