Why does Cursor building Composer 2 on Kimi K2.5 matter for the AI moat thesis?

It signals that a flagship US developer-tools company is routing production workloads through Chinese open-source infrastructure, which undermines the thesis that proprietary frontier model access constitutes a durable moat. When a venture darling makes that call, frontier capability is moving into the commodity column, and moats shift toward distribution, proprietary data, and workflow lock-in.

How should I think about gross margins on AI application companies in the portfolio?

Re-underwrite at 17-30% gross margin sensitivity rather than the 70% SaaS comps assume. Personalized inference breaks caching and multi-tenancy leverage, reasoning models burn 10-100x more tokens per query, and three frontier labs shipped human-heavy Forward-Deployed Engineer motions in the same week — confirming services-shaped delivery, not clean software economics.

What is the immediate risk for funds holding Anthropic exposure through secondary platforms?

Anthropic has publicly voided transfers routed through Hiive, Forge, Sydecar, Lionheart Ventures, Open Doors Partners, and Unicorns Exchange, meaning SPV paper through these venues may not be recognized by the issuer. That creates a 20-30% markdown risk and a materiality issue for this quarter's NAV — legal opinions on transfer enforceability should be obtained before month-end marks.

Where is the open investment window in the agent stack right now?

Agent governance, authorization, verification, and memory layers are still pre-consensus, with no named category leader. Microsoft Agent 365 and Cloudflare Artifacts are claiming pieces, but vendor-neutral governance and 'Okta for agents' authorization remain greenfield for the next 12-18 months before Series A multiples inflate 3-4x once the analyst category is named.

Does the Microsoft-OpenAI restructure change the Microsoft AI thesis?

Yes, materially. Capping Microsoft's revenue share at $38B and freeing OpenAI to diversify across AWS and Google unwinds the exclusivity premium that has been priced into MSFT since 2023. Combined with Anthropic's AWS GA launch at full feature parity, AWS is now first-class infrastructure for two of the three top labs, and the exclusivity survives only in form through 2032.

Edition 2026-05-14 · read as Investor

ChineseLabsErodeHyperscalerPricingPowerat20xLowerCost

Sources: 32
Words: 1,890
Read: 9min

Topics AI Capital LLM Inference Agentic AI

◆ The signal

Chinese labs are shipping frontier-adjacent models at roughly ten to twenty-eight times lower cost while holding fifty to seventy percent gross margins, and the US hyperscalers are committing north of a hundred billion dollars to compute on the assumption that pricing power holds. It probably does not, or rather, it holds in the places the Chinese labs cannot reach, which is a smaller set than the capex plans assume. Cursor built Composer 2 on Moonshot's Kimi K2.5. That is the tell.

Key facts

Chinese AI labs are shipping models 6-8 months behind the frontier at 10-28x lower prices while holding 50-70% gross margins, with DeepSeek V4 Pro priced at $0.43 per million input tokens.
Cursor built Composer 2 on MoonshotAI's Kimi K2.5, signaling a flagship US developer-tools company is routing workloads through Chinese open-source infrastructure.
AI-native software gross margins appear to cap near 17% versus the 70% SaaS comp, as reasoning models burn 10-100x more tokens per query and enterprise deployment requires Forward-Deployed Engineers.
Anthropic voided secondary transfers routed through Hiive, Forge, Sydecar, Lionheart Ventures, Open Doors Partners, and Unicorns Exchange, while public AI exposure vehicles trade at 2-3x NAV of underlying private shares.
Microsoft's revised OpenAI agreement capped its revenue share at $38 billion and freed OpenAI to diversify across AWS and Google, while Anthropic launched on AWS with full feature parity.

◆ INTELLIGENCE MAP

01
China's Efficiency Moat Reprices the US AI Stack
act now
Chinese labs extract 4-7x more intelligence per FLOP, ship models 6-8 months behind frontier at 10-28x lower prices while sustaining 50-70% margins. Token volume in China (~9 quadrillion/mo) is 2x US (~4 quadrillion/mo). DeepSeek raised external capital for the first time — the most efficient player is scaling.
10-28x
price gap at parity
3
sources
- China token volume
- US token volume
- Capability gap
- China 2025 funding
- US 2025 funding
1. DeepSeek V4 Pro0.43
2. GLM-51
3. Kimi K2.50.95
4. Claude Opus 4.612
02
AI App Margin Ceiling: 17% Gross, Not 70% SaaS
act now
Multiple sources converge on a structural math problem: personalized AI inference kills caching/multi-tenancy leverage, reasoning models burn 10-100x tokens, and deployment requires FDE-heavy services. The modeled ceiling: 1M users × $120 ARPU = 17% gross margin. Only four escape routes exist: cost-cloning (no moat), niche, luxury ($20K+ seats), or vertical integration into atoms.
17%
AI-native gross margin cap
6
sources
- SaaS comparison
- AI-native modeled
- Net margin at scale
- Token spend growth
1. Traditional SaaS70
2. AI-Native Apps17
03
Anthropic SPV Enforceability Crisis + 2-3x NAV Retail Froth
monitor
Anthropic publicly voided secondary transfers through Hiive, Forge, Sydecar, Lionheart, Open Doors, and Unicorns Exchange — a liquidity event in reverse. Simultaneously, publicly traded AI exposure vehicles trade at 2-3x NAV of underlying private shares. This creates a rare window: sell into the retail premium before enforcement closes the exits.
2-3x
NAV premium on AI funds
4
sources
- Platforms voided
- Retail vehicle premium
- OpenAI secondary mark
- MSFT-OpenAI rev cap
1. Private share FMV100
2. Retail vehicle price250
3. Post-enforcement SPV70
04
Stagflation Tape Threatens Private AI Marks
monitor
April CPI hit 3.8% against 3.6% wage growth — first negative real wages since 2023. Warsh confirmed 51-45, takes chair Friday. Traders pivoted from pricing cuts to a rate hike by year-end. 10Y jumped to 4.46%. Late-stage AI marks holding at 40-100x ARR cannot survive a sustained rate move without a forced re-pricing event.
3.8%
April CPI headline
1
sources
- CPI headline
- Wage growth
- Core CPI
- Energy YoY
1. Wage growth3.6
2. CPI3.8
3. Core CPI2.8
4. 10Y yield4.46
05
Orbital Compute Gets First Commercial Counterparties
background
Google is in active talks with SpaceX for Project Suncatcher launch capacity. Anthropic wants 'multiple gigawatts' of orbital compute. Varda signed its first commercial pharma deal with United Therapeutics. Three hyperscalers simultaneously exploring orbital compute suggests terrestrial power is a harder constraint than filings admit. Picks-and-shovels window is pre-consensus.
5
sources
- Suncatcher launch
- Anthropic demand
- OEMs exploring
1. SpaceX-Google talksNow
2. Suncatcher launchEarly 2027
3. Anthropic orbital2027-2028
4. SpaceX IPOWeeks away

◆ DEEP DIVES

01
China's Efficiency Arbitrage: Your US AI Book Is Mispriced by 10-28x
The Core Problem
A tour of 14 Chinese AI labs produced something closer to a unit-economics report than an AI story. Export controls did not suppress Chinese AI. They industrialized its efficiency. The labs are extracting 4-7x more intelligence per FLOP than naive scaling predicts, shipping models 6-8 months behind the frontier at 10-28x lower prices, and holding 50-70% gross margins while doing it. CAISI, the US government's own assessment, agrees the capability gap is roughly stable at 6-8 months. The unit-economics gap is structural and widening.
The Numbers That Break the Thesis
DeepSeek V4 Pro clears at $0.43/M input tokens. GLM-5 at $1.00. Kimi K2.5 at $0.95. Claude Opus 4.6's premium tier runs 11-28x higher for comparable-quality work, which is a lot of multiplier to defend. Z.ai reports 50% gross margins at those prices. MiniMax reports 70%. These are not subsidized loss-leaders, or rather, the more interesting version: they are profitable operations running on 8x less compute.
Chinese token volume has reached ~9 quadrillion tokens/month against roughly 4 quadrillion in the US and West. The consumption side is already 2x. Market structure is the other half of the puzzle: 14+ frontier-class labs in China, including Meituan, Ant, and ByteDance, against 5 in the US, fragmenting the chip pool and collectively out-iterating through open-source.
The Bellwether Signal
Cursor built Composer 2 on MoonshotAI's Kimi K2.5. A flagship US developer-tools company, one that sits in dozens of venture portfolios, is already routing workloads through Chinese open-source infrastructure. This is not a margin decision. It is capitulation on the proprietary-frontier-model moat.
When a darling portfolio candidate builds its flagship product on Chinese open-source, the thesis that "access to frontier models" constitutes a moat is no longer supported by the evidence.
Portfolio Implications
The $285B that flowed into US AI startups in 2025 against $12.4B in China was priced on the thesis that capital intensity creates durable pricing power. Chinese efficiency is that thesis's biggest counterexample on the board. Asking whether China will reach the frontier is the wrong framing. Adequate capability at 1/20th the cost collapses premium pricing in most enterprise use cases. That is the actual question, and mostly the answer is no.
CAISI, the lab tour, and Cursor's routing decision all point the same way. The durable moats now live in distribution + proprietary data + workflow lock-in, not raw model capability. Frontier model access belongs in the commodity column in most thesis docs this quarter, if LPs are reading carefully.
The Contrarian Read
China is not catching up on capability. The gap is stable. What is actually happening is that China is winning on unit economics while the US wins on capability, and those are different axes. Premium pricing survives where frontier capability is differentially necessary: complex reasoning and safety-critical agents, plus frontier coding (Claude on F4). Outside that perimeter, the Chinese price floor becomes the global price floor inside 12 months. This is probably wrong in the tails. It is the base case everywhere else.
Action items
- Run gross-margin stress test across every AI portfolio company assuming inference costs drop 80% in 12 months
- Source and diligence 3-5 'efficient inference' startups (kernel optimization, MoE serving, distillation-as-a-service) within 30 days
- Update sector thesis doc: reclassify 'frontier model access' from moat to commodity; elevate 'distribution + proprietary data + workflow lock-in'
- Track DeepSeek external-capital round valuation and syndicate as key comp benchmark
Sources:Azeem Azhar, Exponential View · TLDR Founders · TLDR AI · StrictlyVC

The 17% Margin Ceiling: AI Apps Are Consumer Goods Wearing SaaS Pricing

The Math That Kills SaaS Comps

Six independent sources this week landed on the same uncomfortable arithmetic, which most IC memos are still pretending they haven't read: AI-native software appears to cap at roughly 17% gross margin, or eleven percent net at scale, rather than the seventy percent that every SaaS comp in the deck quietly assumes. The mechanism is not mysterious. Personalized inference kills the caching and multi-tenancy leverage that made SaaS SaaS, reasoning models burn 10-100x more tokens per query (which offsets the per-token price declines everyone keeps citing as salvation), and enterprise deployment keeps needing Forward-Deployed Engineers. The gross margin shape is Palantir, not Snowflake. That is the trade.

The tell is that the evidence is now multi-directional. Google, OpenAI and Anthropic all shipped human-heavy deployment motions in the same week — FDEs, a thing literally branded 'Deployment Companies,' and PE joint ventures. When three frontier labs independently bolt on a consulting arm in the same seven days, they are not announcing a strategy. They are confessing that enterprise AI does not self-serve. The narrative premium in the stock prices still assumes clean software economics. The delivery model is services.

Four Escape Routes — Only Two Are Venture-Scale

Route	Venture-Scale?	Example	Margin Profile
Cost-clone (race to zero)	No	Commodity wrappers	Sub-10%
Niche/lifestyle	No	Bootstrapped AI tools	30-40%
Luxury software ($20K+ seats)	Yes	Bloomberg Terminal model	40-60%
Vertical integration (atoms/services)	Yes	Forus ($1B, Rx automation)	40-60% blended

The a16z Thesis Tell

Seema Amble at a16z went on record calling structural disruption on Salesforce, SAP and Workday, flagging manufacturing, construction, field-service and accounting as the AI-native SoR opportunity. Salesforce's 'headless product' launch reads correctly as defensive, or rather, as the most defensive thing an incumbent can do: you do not rebrand your APIs as products unless you are worried about whoever is building the UI above them. Per-seat pricing breaks structurally when the seat is an agent.

If a founder can't credibly answer 'which of the four escape routes are you?' — it's a margin trap, no matter the ARR growth rate.

The Counter-Narrative

Aaron Levie at Box has the opposite trade, and it is not a silly one. His view is that data-layer SaaS benefits from agent proliferation — agent activity moves from ten percent of workload today to ninety percent within three years, which drives multiplicative demand on data platforms through a seats + consumption stacking model. Engineering job openings are at a three-year high against 46K March layoffs, so the labor story is redistribution, not destruction. Token spend going from roughly one percent to ten percent of company opex opens a greenfield 'AI FinOps' category that currently has no incumbent.

The resolution, probably wrong but worth stating: data-moat incumbents win, thin wrappers die. The SaaS trade is not universally short. It is bifurcated. Overweight the data layer (Snowflake, Databricks, Box-class). Underweight thin-workflow SaaS without a consumption-pricing option. The rest is noise.

Action items

Re-underwrite every AI application company in portfolio and active pipeline using 17-30% gross margin sensitivity, not 70%
Require every AI portco to present a consumption-tier roadmap before 2027 renewal cycles at next board meeting
Source 3-5 seed/Series A companies building AI FinOps / token allocation governance tooling this quarter
Accelerate diligence on AI-native SoR in a16z-flagged verticals (manufacturing, construction, field-service, accounting)

Sources:TLDR Founders · a16z · The Information AM · Casey Newton · Not Boring · StrictlyVC

03
Anthropic SPV Crisis + Retail NAV Froth = Sell Into the Premium Now
The Enforceability Problem
Anthropic has publicly voided secondary transfers routed through Hiive, Forge, Sydecar, Lionheart Ventures, Open Doors Partners, and Unicorns Exchange, which is either housekeeping or — to use a more honest description — a NAV question for any fund carrying Anthropic exposure through those platforms. The gap between what an SPV says it owns and what the issuer will recognize has widened meaningfully in the last month. If the company refuses to honor the transfer, the paper marks down quietly regardless of what the secondary market is printing.
Three ways this goes. The secondary market formalizes and SPV holders are fine. Issuers win enforcement and paper marks down twenty to thirty percent. Or governance noise drags restrictions into a broader sector retrace. The middle outcome is the one being underpriced.
The Retail Froth Signal
Meanwhile, the publicly traded AI exposure vehicles offering retail synthetic exposure to Anthropic, OpenAI, and SpaceX now trade at two to three times the NAV of their underlying private shares, some of them doubling within days. This is the textbook signature of a marginal-buyer shift: institutional discipline has been priced out and retail FOMO is setting the clearing print.
When retail is paying 2-3x NAV for private AI exposure while the issuer is voiding the platforms that sold it to them, the trade is to distribute into the premium before the enforcement catches up.
The Microsoft-OpenAI Restructure
The revised agreement capped Microsoft's revenue share at thirty-eight billion dollars and freed OpenAI to diversify infrastructure across AWS and Google. That quietly unwinds every Microsoft AI moat thesis written since 2023. Anthropic's simultaneous AWS GA launch with full feature parity means AWS is now first-class frontier infrastructure for two of the three most valuable labs. The Microsoft AI exclusivity premium is over in substance, even as it is preserved in form through 2032.
The Opportunity Window
For anyone sitting on Anthropic, OpenAI, or SpaceX secondaries, the combination of peak demand, company-issued platform warnings, and a 2-3x retail vehicle premium is a textbook trim signal. This is probably wrong if the retail bid deepens rather than exhausts, but in-kind distributions, structured secondaries, and direct sales into retail-driven vehicles are the rare window where selling consensus pays above private FMV. The shelf life is weeks, not quarters.
What Changes This Week
Anyone holding Anthropic exposure via the six named platforms should obtain a legal opinion on transfer enforceability before month-end NAV marks. It is a materiality issue for this quarter's reporting, not a line item for the next LP meeting.
Action items
- Audit all portfolio and fund-level Anthropic exposure held via SPVs through Hiive, Forge, Sydecar, Lionheart, Open Doors, Unicorns Exchange — obtain legal opinion on enforceability by May 23
- Run exit-path analysis on all frontier AI secondary positions against the 2-3x retail vehicle premium — model in-kind distribution or structured sale
- Downgrade Microsoft AI exclusivity premium in all models using MSFT-OpenAI axis — the $38B cap + AWS/GCP unlock structurally ends exclusivity
- Prepare LP communication on Anthropic SPV enforceability risk before quarter-end if exposure exceeds materiality threshold
Sources:StrictlyVC · AI Breakfast · Techpresso · AINews · Martin Peers

Agent Governance Is a Category Before It Has a TAM — Front-Run It

The Governance Layer Gets Claimed

Microsoft shipped Agent 365 plus the Work IQ API this week and told anyone paying attention that the governance and orchestration layer for multi-agent enterprise workflows belongs to them. Cloudflare put Artifacts into beta with Git-like versioning, rollback, and audit for AI agent outputs. Versa shipped agent policy controls. GitLab restructured into 60 teams around the same problem. This is what a category looks like in the six to twelve months before Gartner writes the Magic Quadrant.

The demand signal is visible in the same week. Datadog's telemetry across more than 1,000 organizations shows agent framework adoption has doubled year on year, which is the headline number. Underneath it, multi-model is now the default posture and LLM tech debt is compounding faster than teams retire it. Atlassian's survey has 74% of IT leaders citing AI security as a major barrier, while 79% of the same group are exploring AI incident management. Adoption is outrunning controls.

Where This Sits in the Stack

Layer	Status	Investor Window
Agent orchestration (Claude Code, Cursor)	Late-stage, consensus	Closed at good prices
Agent governance/audit	Category forming	Open 12-18 months
Agent verification	Unsolved — no clear leader	Greenfield
Agent memory/state	Seed-stage (Agentmemory, Statewright)	Pre-seed entry open

The 'Okta for Agents' Opportunity

a16z calls agent authorization "one of the hardest unsolved problems," which is the language partners use when they want the category named after their portfolio company. No named leader exists yet. The shape rhymes with the 2010 Okta setup, where seed checks priced the category before it had a label. This is probably wrong, but the counter-thesis worth respecting is that identity categories sometimes take a decade to resolve. Build the target list anyway.

The supply-chain attacks that landed the same week sharpen the urgency, or rather, make it legible to people who had been ignoring it. Mini Shai-Hulud hit Mistral, TanStack, and UiPath packages with persistence in Claude Code and VS Code settings files, across 121 compromised npm packages, including what is being called the first confirmed AI-assisted zero-day. Agents with code deployment authority plus compromised packages equals a board-level risk most portfolios have not audited.

Agent governance is roughly where container orchestration was in 2015: multiple vendors shipping into it, no narrative dominates yet, TAM still pre-consensus. By the time the analyst note lands, Series A pricing will be 3-4x current levels.

Action items

Build a shortlist of 10+ vendor-neutral agent governance / observability / policy-enforcement startups within 30 days
Issue portfolio-wide advisory: require SBOM + agent-identity controls for any portco running production agentic workflows
Source 2-3 AI code verification startups (formal specs, AI-checks-AI pipelines, agent output validation) this quarter
Initiate conversations with Cloudflare Artifacts team to understand their agent versioning roadmap and buy-vs-build calculus for startups in the space

Sources:TLDR IT · CyberScoop · Risky.Biz · TLDR DevOps · The Pragmatic Engineer · TLDR

◆ QUICK HITS

Update: Anthropic ~$950B valuation confirmed across multiple sources — company actively voiding SPV transfers through 6 named platforms; enforce legal review on any exposure held through these vehicles before month-end
StrictlyVC
OpenAI deprecated finetuning APIs — value migrating from model customization to agent runtimes; mark down finetuning-platform portfolio exposure and reallocate toward 'Git-for-agents' primitives
AINews
Cursor cut vector DB costs 95% by switching from incumbent to turbopuffer — first-gen vector DBs (Pinecone/Weaviate class) face churn risk as AI coding tools scale past early workloads
The Pragmatic Engineer
Warsh confirmed 51-45 as Fed governor, takes chair Friday — April CPI at 3.8% vs wages 3.6% means first negative real wages since 2023; close any open bridge or extension rounds before June FOMC
Morning Brew
AWS Redshift RG eliminated $5/TB Spectrum scanning fee and cut vCPU cost 30% — reprice any active data warehouse or lakehouse deal against this new hyperscaler floor
TLDR DevOps
a16z's Amp hit $1.3B valuation as a compute-pooling vehicle backed by YC and unnamed cloud providers — if it works, hyperscaler pricing power softens AND wrapper defensibility erodes simultaneously
StrictlyVC
Datadog telemetry from 1,000+ orgs confirms multi-model is default posture and agent framework adoption doubled — Amazon 'tokenmaxxing' episode proves usage metrics are contaminated, demand task-completion proof in diligence
TLDR
Varda signed first commercial pharma deal with United Therapeutics for microgravity drug crystallization — space manufacturing crossed from thesis to revenue in one contract
The Download from MIT Technology Review

◆ Bottom line

The take.

Chinese AI labs are profitable at 10-28x below US frontier pricing while holding 50-70% margins — meaning either Western prices compress or Western multiples do, and you have about 12 months to decide which. Run the margin stress test across every AI portfolio company this week: if the business doesn't survive at Chinese price floors, it's a markdown waiting to happen. The durable moats are now distribution, proprietary data, and workflow lock-in — not model access, not compute scale, and definitely not the 70% gross margins your SaaS comps still assume.

Frequently asked

Why does Cursor building Composer 2 on Kimi K2.5 matter for the AI moat thesis?: It signals that a flagship US developer-tools company is routing production workloads through Chinese open-source infrastructure, which undermines the thesis that proprietary frontier model access constitutes a durable moat. When a venture darling makes that call, frontier capability is moving into the commodity column, and moats shift toward distribution, proprietary data, and workflow lock-in.
How should I think about gross margins on AI application companies in the portfolio?: Re-underwrite at 17-30% gross margin sensitivity rather than the 70% SaaS comps assume. Personalized inference breaks caching and multi-tenancy leverage, reasoning models burn 10-100x more tokens per query, and three frontier labs shipped human-heavy Forward-Deployed Engineer motions in the same week — confirming services-shaped delivery, not clean software economics.
What is the immediate risk for funds holding Anthropic exposure through secondary platforms?: Anthropic has publicly voided transfers routed through Hiive, Forge, Sydecar, Lionheart Ventures, Open Doors Partners, and Unicorns Exchange, meaning SPV paper through these venues may not be recognized by the issuer. That creates a 20-30% markdown risk and a materiality issue for this quarter's NAV — legal opinions on transfer enforceability should be obtained before month-end marks.
Where is the open investment window in the agent stack right now?: Agent governance, authorization, verification, and memory layers are still pre-consensus, with no named category leader. Microsoft Agent 365 and Cloudflare Artifacts are claiming pieces, but vendor-neutral governance and 'Okta for agents' authorization remain greenfield for the next 12-18 months before Series A multiples inflate 3-4x once the analyst category is named.
Does the Microsoft-OpenAI restructure change the Microsoft AI thesis?: Yes, materially. Capping Microsoft's revenue share at $38B and freeing OpenAI to diversify across AWS and Google unwinds the exclusivity premium that has been priced into MSFT since 2023. Combined with Anthropic's AWS GA launch at full feature parity, AWS is now first-class infrastructure for two of the three top labs, and the exclusivity survives only in form through 2032.

◆ Same day, different angle

Read this day as…

◆ Recent in investor

ChineseLabsErodeHyperscalerPricingPowerat20xLowerCost

◆ INTELLIGENCE MAP

◆ DEEP DIVES

The Core Problem

The Numbers That Break the Thesis

The Bellwether Signal

Portfolio Implications

The Contrarian Read

The Math That Kills SaaS Comps

Four Escape Routes — Only Two Are Venture-Scale

The a16z Thesis Tell

The Counter-Narrative

The Enforceability Problem

The Retail Froth Signal

The Microsoft-OpenAI Restructure

The Opportunity Window

What Changes This Week