How should I adjust gross margin assumptions for Claude-dependent portfolio companies?

Rebuild COGS models using full API-rate billing as the base case, since the dollar-matched credit conversion eliminated the 70-90% subsidy third-party harnesses like Cline, Codebuff, and OpenCode were running on. Companies that haven't flagged the hit have likely lost 20-40% of effective runway since May 12-13. Demand written answers by month-end on whether they're defending via proprietary workflow data, open-source distribution, or bounded-execution security.

Why is ServiceNow's Anthropic budget blowout a signal beyond one customer?

It proves enterprise AI spend is structurally unmonitored and contractually reversible. Anthropic ships no per-user telemetry, no SLAs, and no enterprise dashboard, so even sophisticated buyers like ServiceNow burned a full-year budget by May. That's why Google Cloud, OpenAI/Bain's DeployCo, and Salesforce are all staffing forward-deployed engineers — deployment, not capability, is the bottleneck, and the margin is migrating to that layer.

What discount should be applied to LLM-layer ARR multiples right now?

Apply a 20-40% reversibility haircut to enterprise ARR at any model-layer company lacking SLAs, telemetry, or contractual lock-in. The on-record CIO complaints (Romack, Mehta, National Life) are the sentiment that precedes RFP cycles, not renewal cycles, so the cliff-shaped churn risk isn't yet priced in. Renewals through 2026 may hold, but the mispricing is worth acting on before it resolves.

Where is the cleanest entry point in agent infrastructure given incumbent moves?

The interop and governance layer — MCP gateways, agent identity and auth, agent observability, and knowledge-graph tooling — is the seed-to-Series A window with a 6-12 month runway before SAP's €100M fund and ServiceNow's Action Fabric compress entry multiples. Standalone orchestration is already fragmenting toward M&A exits, and platform-layer bets are publicly re-rated. Vertical workflow AI offers a longer 12-18 month window where incumbent priority is lower.

What does the Lemkin Salesforce data point actually validate?

It validates consumption-based GTM at higher ARPU against collapsing seat counts: 10+ human seats compressed to 2 humans plus 1 API seat, while spend rose 83% from $12K to $22K with 20+ agents underneath. This is the template for upcoming SaaS earnings calls, so seat-based SaaS positions should be stress-tested at constant, +30%, and -30% seat-value scenarios. Positions that fail need trimming before the pattern goes mainstream.

Edition 2026-05-24 · read as Investor

AnthropicCreditShiftGutsAICodingToolMargins

Sources: 36
Words: 1,375
Read: 7min

Topics AI Capital Agentic AI LLM Inference

◆ The signal

Anthropic converted every Claude subscription into dollar-matched API credits four days ago, eliminating the 70-90% cost arbitrage that third-party harnesses (Cline, Codebuff, OpenCode) were running on — and most portfolio companies haven't flagged the margin hit yet. In the same week, ServiceNow disclosed it blew its full-year Anthropic budget by May because no SLA or usage telemetry exists. Enterprise AI revenue is simultaneously more expensive to generate and less defensible than anyone's model assumes. Rebuild your portco cost models this week — not next quarter.

Key facts

On May 12-13, 2026, Anthropic converted every Claude subscription into dollar-matched API credits, eliminating the 70-90% cost arbitrage used by third-party harnesses like Cline, Codebuff, and OpenCode.
OpenAI countered Anthropic's repricing by offering enterprise switchers two months of free Codex within a 30-day window.
ServiceNow exhausted its full-year Anthropic budget by May 2026 because Anthropic ships no per-user telemetry, no SLAs, and no enterprise spend dashboard.
Ramp's April 2026 data showed Anthropic at 34.4% of business AI spend versus OpenAI's 32.3%, the first documented leadership flip ahead of Anthropic's targeted October IPO.
Vercel's AI Gateway index found 59% of token volume is now agentic workloads, with Anthropic capturing 61% of spend and Google's Flash taking 38% of volume.

◆ INTELLIGENCE MAP

01
Enterprise AI Revenue ≠ SaaS Revenue
act now
ServiceNow exhausted its full-year Claude budget by May with no SLA, no per-user telemetry, and no contractual lock-in. Google, OpenAI/Bain, and Salesforce all hiring hundreds of FDEs confirms deployment — not model capability — is the bottleneck. AI ARR at 80x multiples carries a reversibility discount nobody is applying.
0%
SLA coverage on Claude
6
sources
- Budget exhausted by
- FDE hiring orgs
- Modal valuation
- OpenAI DeployCo size
1. Traditional SaaS ARR85
2. AI Model ARR45
02
Anthropic's Pre-IPO Margin Recovery Crushes Dev Tool Economics
act now
Anthropic's subscription-to-API-credit conversion eliminates the arbitrage third-party harnesses exploited. OpenAI countered with 2-month free Codex for enterprise switchers. Ramp shows Anthropic at 34.4% vs OpenAI 32.3% in business spend — the first documented leadership flip. October IPO target means this pricing structure is permanent.
70-90%
arbitrage eliminated
8
sources
- Anthropic biz share
- OpenAI biz share
- Likely IPO date
- Codex free promo
1. Anthropic34.4
2. OpenAI32.3
3. Google15
4. Other18.3
03
Agent Infrastructure: Incumbents Drawing the Category Map
monitor
SAP committed €100M to an autonomous enterprise fund. ServiceNow shipped Action Fabric (headless APIs for agent consumption). Vercel's production index shows 59% of token volume is agentic, with Anthropic taking 61% of spend while Google takes 38% of volume. The incumbents are defining this category before pure-plays can name it.
59%
agentic token volume
5
sources
- SAP fund size
- Anthropic spend share
- Google volume share
- Lemkin seat reduction
1. Agentic workloads59
2. Chat/completion41
04
AI Security Gets Its First Budget Line Item
background
LiteLLM hit CISA's KEV catalog — the first AI-infra component federally flagged as actively exploited. DepthFirst claims 10x cost efficiency over Mythos on vulnerability discovery. OpenAI launched Daybreak with 8 incumbent 'partners' in the pre-disintermediation pattern. EDR detection logic now extractable by LLMs in days, not weeks.
10x
DepthFirst cost edge
6
sources
- Microsoft MDASH CVEs
- Mozilla bugs found
- Daybreak partners
- Deepfake loss TAM
1. DepthFirst (FFmpeg)12
2. Mythos (FFmpeg)0
3. Mozilla harness (FF)271

◆ DEEP DIVES

01
Anthropic's Pricing Move Breaks Dev Tool Economics — 30 Days to Rebuild Margins
What Happened
On May 12-13, Anthropic converted every Claude subscription into a dollar-matched API credit pool, which is a polite way of saying a two-hundred-dollar plan now buys exactly two hundred dollars of programmatic tokens at standard API rates and not a token more. The arbitrage that third-party coding harnesses (Cline, OpenCode, OpenClaw) were quietly running on subscription-tier usage was somewhere between seventy and ninety percent. It is now zero. Within hours OpenAI countered with two months of free Codex for enterprise switchers inside a thirty-day window, which tells you which company was nervous about which.
Read this against Ramp's April data showing Anthropic at 34.4% of business spend versus OpenAI's 32.3%, the first documented leadership flip, and against Anthropic's likely October IPO and its fresh CFO hire. The read is narrow: margin recovery dressed as developer generosity, timed to pre-IPO diligence.
Why This Matters for Your Book
Any portfolio company whose COGS model quietly assumed subsidized subscription tokens has lost twenty to forty percent of effective runway since Friday. The change is four days old. Most founders have not flagged it because it landed as a policy update rather than a pricing announcement, which is the entire point.
Every Claude-dependent developer tool in your portfolio is worth less today than it was last Friday. The question is whether your marks reflect that yet.
The squeeze is two-sided, which is what makes it interesting rather than merely painful. Anthropic is metering from below through the credit conversion. Notion's External Agents API, now hosting Claude, Codex, Cursor, Decagon, Warp, and Devin in the same workspace, is commoditizing the interface layer from above. Margin compression and distribution displacement, in the same week.
The Counter-Thesis
There are two versions where this does not kill the wrapper layer, and they deserve a hearing. First, the credit conversion includes a +50% Claude Code limits increase through July 13, which is a temporary subsidy that buys roughly two months of cover. Second, enterprise procurement still prefers specialist vendors over model-API features, the same dynamic that kept Twilio alive when AWS shipped competitive messaging. That thesis has been right before. It is also the argument every incumbent makes the quarter before bundling arrives.
What To Do
The through-line is unglamorous: demand updated gross-margin models from every Claude-dependent portco by end of month, using API-rate billing as the base case. Founders who built on the arbitrage owe you a written answer on which of three paths they are taking — vertical moat via proprietary workflow data, open-source distribution on the Cline model, or bounded-execution security on the Cursor model. Anything that does not clear one of those is a pass or a mark-down.
Action items
- Request updated gross-margin models from every Claude-dependent portfolio company assuming full API-rate billing by May 31
- Accelerate Anthropic secondary/pre-IPO allocation decisions before book-building begins in August
- Map portfolio against Notion External Agents API displacement risk — identify which deals get absorbed if workspace platforms host the agents directly
Sources:AINews · TLDR AI · ben's bites · Daily Dose of DS · StrictlyVC
02
Enterprise AI Revenue Is Structurally Fragile — Apply the Reversibility Discount
The Proof Point
ServiceNow, which is not exactly a naive enterprise software buyer, burned through its full-year Anthropic budget by May 2026. Not because Claude failed to deliver. Because Anthropic ships no per-user telemetry, no SLAs, and no enterprise dashboard that would have passed muster at a mid-tier SaaS vendor in 2014. National Life Group's CIO put it plainly: Anthropic is 'great for consumer usage but not great for companies.'
This is the company the market is currently valuing at north of nine hundred billion dollars on the thesis that enterprise revenue carries the number. Consumer-grade plumbing does not usually carry enterprise-grade multiples, or rather, it does, until the renewal conversation.
The Pattern Is Larger Than Anthropic
Four firms reached the same conclusion independently, which is the part worth noticing: deployment is the bottleneck, not model capability.
- Google Cloud — hiring hundreds of forward-deployed engineers
- OpenAI/Bain — stood up DeployCo, bought a consulting firm for its 150-FDE roster
- Salesforce — staffing the same function
- ServiceNow — shipping AI Control Tower to monitor the very spend it cannot otherwise control
When the industry quietly concedes what Palantir worked out twenty years ago, the margin moves to the deployment layer. The models become the cost center.
Anthropic just taught the market that enterprise AI ARR is not SaaS ARR. The switching costs are near zero, the contractual lock-in is absent, and the budgets are unmonitored. Treat accordingly.
The Investable Gap
The AI observability and FinOps category is forming in public, which is rare and worth watching. ServiceNow's AI Control Tower is the validation and the competition in the same product. Token-level cost attribution, per-user spend caps, SLA monitoring across model APIs: no independent category winner exists yet. Modal at four and a half billion dollars is the nearest infrastructure comp. Whoever builds the Datadog of AI model consumption captures the budget ServiceNow accidentally proved is sitting there unmonitored.
What This Means for Marks
Any LLM-layer company pitching enterprise ARR at eighty times revenue without SLAs, telemetry, or contractual lock-in deserves a 20-40% reversibility discount. This is probably wrong if renewals hold through 2026, but the on-record CIO quotes (Romack, Mehta, National Life) are the sentiment that precedes RFP cycles, not renewal cycles. The risk profile is sticky until it isn't. That is the mispricing worth acting on before it resolves.
Action items
- Demand SLA and usage-telemetry roadmap from every model-layer company in portfolio pitching enterprise ARR — flag absent answers as a downgrade
- Launch a sourcing sprint on AI observability/FinOps (token-cost attribution, per-user caps, SLA monitoring) — target Seed to Series A before the category names a winner
- Apply a 20-40% reversibility haircut to AI-layer ARR multiples in all active valuations where SLAs and telemetry are absent
Sources:Laura Bratton · The Pragmatic Engineer · Martin Peers · Bloomberg Technology · a16z

Agent Infrastructure: Incumbents Are Defining the Category Before Startups Can Name It

The Convergence

In a single week: SAP committed €100M to an autonomous-enterprise fund that wires NVIDIA and Microsoft into the platform layer, ServiceNow shipped Action Fabric which decouples logic from UI and exposes workflows as headless APIs for agents, Notion launched a developer platform with Claude and Codex as hosted teammates, and Airtable committed $10M of Hyperagent inference credits to 500 agent-native founders. The incumbents are drawing the borders of the autonomous-enterprise category before the pure-plays get to name it. That is usually how categories end up named after the incumbents.

Vercel's first production AI Gateway index gives this some empirical grounding, or rather the only grounding anyone has bothered to publish: 59% of token volume is now agentic workloads. Anthropic takes 61% of spend with Opus as premium reasoning. Google takes 38% of volume with Flash as commodity throughput. Two different businesses are now visible inside what we have all been politely calling foundation models.

The a16z GTM Thesis — And Its Timing Problem

a16z has published the case that value migrates from the system of record (Salesforce at $140B, HubSpot at $9B) to the system of intelligence, meaning the orchestration layer that reads the CRM, decides what to do, and does it. The exhibit is Jason Lemkin cutting Salesforce from 10+ human seats to 2 humans plus 1 API seat while spend rose 83% ($12K→$22K) with 20+ agents underneath.

The seat count collapsed. The bill went up. That is the consumption-based GTM pitch in one customer — and it validates higher ARPU against lower headcount.

The problem with publishing a thesis is that consensus is forming right now. When a16z says it out loud, Tier 1 pipelines rerank inside one to two quarters, and entry multiples for AI-native GTM stop being interesting at precisely the moment everyone agrees the category exists. This is probably wrong, but it usually isn't.

Where the Window Sits

Layer	Winner	Window	Risk
Platform (SAP, ServiceNow)	Incumbents	Public re-rating	Already priced
Interop/governance (MCP, agent identity)	Undefined	Seed/A — 6-12 months	Absorption by platforms
Orchestration standalone	Fragmented	Closing — M&A exits preferred	Platform compression
Vertical workflow AI	Domain specialists	Prime entry — 12-18 months	Low incumbent priority

The cleaner alpha this quarter sits one layer over, in the interop and governance layer: MCP gateways, agent identity and auth, agent observability, knowledge-graph tooling. SAP's partner fund will deploy here eventually. The question is whether you are on the cap table at seed pricing before corp dev activation compresses entry. After that you are buying the same thesis at a different price.

Action items

Source 3-5 agent infrastructure deals in MCP tooling, agent identity/auth, and agent observability before SAP's fund deploys into the category
Stress-test every Claude-dependent third-party integration against Anthropic's June 15 credit unbundling — model gross margin impact before the pricing change hits renewals
Run a portfolio revalidation on seat-based SaaS positions using Lemkin's multiplicative framework — model seat value at constant, +30%, and -30% scenarios

Sources:a16z · TLDR IT · ben's bites · TLDR · Simplifying AI · TLDR AI

◆ QUICK HITS

Update: Cerebras closed first day at $311 (70% pop vs $185 IPO price) — Eclipse netted 17x, Tiger sitting on $1B paper gain, Benchmark's $225M SPV returned ~$300M but lowered fund multiple
Katie Roof
xAI leased its entire Colossus 1 cluster (220K+ GPUs) to Anthropic — the loudest demand signal in AI and a de facto concession that xAI is exiting the frontier race; reprice any Grok-dependent exposure as infra + distribution, not frontier lab
The Pragmatic Engineer
Abridge raised $550M at $5.3B with 250 health systems and 80M+ annual conversations — closed the ambient clinical documentation category; vertical AI data moats at this scale are unreplicable
Latent.Space
LiteLLM added to CISA's Known Exploited Vulnerabilities catalog — first AI-infrastructure control plane federally flagged as actively exploited; the AI-security category just got its regulatory validation event
SANS AtRisk
Google Gemini Intelligence embeds autonomous task execution into Android OS with summer 2026 rollout — every mobile AI agent startup positioned as 'AI layer on top of Android' is now a feature, not a company
Simplifying AI
Nebius guided to $3.0-3.4B revenue (6x growth from $530M) with 4+ customers bidding per GPU and $2.47B capex vs $2.26B operating cash — neocloud demand is real but unit economics are structurally cash-negative
Martin Peers
DepthFirst's Open Defense Initiative claims 10x cost efficiency over Anthropic's Mythos (12 FFmpeg bugs for ~$1K vs $10K) — autonomous vulnerability discovery is commoditizing before incumbents finish building moats
TLDR InfoSec
Only 15% of enterprises have data foundations for agentic AI at scale per Fivetran index — data quality and lineage cited as #1 blocker by nearly half; the cleanest picks-and-shovels setup since early Snowflake era
TLDR Data

◆ Bottom line

The take.

Enterprise AI revenue is structurally fragile — ServiceNow blew its full-year Claude budget by May with zero SLAs or telemetry, Anthropic's credit conversion just killed the 70-90% arbitrage powering your dev-tool portfolio companies, and incumbents are defining the agent-infrastructure category before pure-plays can name it. The three moves this week: rebuild every Claude-dependent portco's gross margin model before June, apply a 20-40% reversibility discount to any AI ARR without contractual lock-in, and concentrate new capital on the observability and governance layers that fix what the model providers won't.

Frequently asked

How should I adjust gross margin assumptions for Claude-dependent portfolio companies?: Rebuild COGS models using full API-rate billing as the base case, since the dollar-matched credit conversion eliminated the 70-90% subsidy third-party harnesses like Cline, Codebuff, and OpenCode were running on. Companies that haven't flagged the hit have likely lost 20-40% of effective runway since May 12-13. Demand written answers by month-end on whether they're defending via proprietary workflow data, open-source distribution, or bounded-execution security.
Why is ServiceNow's Anthropic budget blowout a signal beyond one customer?: It proves enterprise AI spend is structurally unmonitored and contractually reversible. Anthropic ships no per-user telemetry, no SLAs, and no enterprise dashboard, so even sophisticated buyers like ServiceNow burned a full-year budget by May. That's why Google Cloud, OpenAI/Bain's DeployCo, and Salesforce are all staffing forward-deployed engineers — deployment, not capability, is the bottleneck, and the margin is migrating to that layer.
What discount should be applied to LLM-layer ARR multiples right now?: Apply a 20-40% reversibility haircut to enterprise ARR at any model-layer company lacking SLAs, telemetry, or contractual lock-in. The on-record CIO complaints (Romack, Mehta, National Life) are the sentiment that precedes RFP cycles, not renewal cycles, so the cliff-shaped churn risk isn't yet priced in. Renewals through 2026 may hold, but the mispricing is worth acting on before it resolves.
Where is the cleanest entry point in agent infrastructure given incumbent moves?: The interop and governance layer — MCP gateways, agent identity and auth, agent observability, and knowledge-graph tooling — is the seed-to-Series A window with a 6-12 month runway before SAP's €100M fund and ServiceNow's Action Fabric compress entry multiples. Standalone orchestration is already fragmenting toward M&A exits, and platform-layer bets are publicly re-rated. Vertical workflow AI offers a longer 12-18 month window where incumbent priority is lower.
What does the Lemkin Salesforce data point actually validate?: It validates consumption-based GTM at higher ARPU against collapsing seat counts: 10+ human seats compressed to 2 humans plus 1 API seat, while spend rose 83% from $12K to $22K with 20+ agents underneath. This is the template for upcoming SaaS earnings calls, so seat-based SaaS positions should be stress-tested at constant, +30%, and -30% seat-value scenarios. Positions that fail need trimming before the pattern goes mainstream.

◆ Same day, different angle

Read this day as…

◆ Recent in investor

AnthropicCreditShiftGutsAICodingToolMargins

◆ INTELLIGENCE MAP

◆ DEEP DIVES

What Happened

Why This Matters for Your Book

The Counter-Thesis

What To Do

The Proof Point

The Pattern Is Larger Than Anthropic

The Investable Gap

What This Means for Marks