Investor daily

Edition 2026-05-14 · read as Investor

ChineseLabsErodeHyperscalerPricingPowerat20xLowerCost

Sources
32
Words
1,890
Read
9min

Topics AI Capital LLM Inference Agentic AI

◆ The signal

Chinese labs are shipping frontier-adjacent models at roughly ten to twenty-eight times lower cost while holding fifty to seventy percent gross margins, and the US hyperscalers are committing north of a hundred billion dollars to compute on the assumption that pricing power holds. It probably does not, or rather, it holds in the places the Chinese labs cannot reach, which is a smaller set than the capex plans assume. Cursor built Composer 2 on Moonshot's Kimi K2.5. That is the tell.

◆ INTELLIGENCE MAP

  1. 01

    China's Efficiency Moat Reprices the US AI Stack

    act now

    Chinese labs extract 4-7x more intelligence per FLOP, ship models 6-8 months behind frontier at 10-28x lower prices while sustaining 50-70% margins. Token volume in China (~9 quadrillion/mo) is 2x US (~4 quadrillion/mo). DeepSeek raised external capital for the first time — the most efficient player is scaling.

    10-28x
    price gap at parity
    3
    sources
    • China token volume
    • US token volume
    • Capability gap
    • China 2025 funding
    • US 2025 funding
    1. DeepSeek V4 Pro0.43
    2. GLM-51
    3. Kimi K2.50.95
    4. Claude Opus 4.612
  2. 02

    AI App Margin Ceiling: 17% Gross, Not 70% SaaS

    act now

    Multiple sources converge on a structural math problem: personalized AI inference kills caching/multi-tenancy leverage, reasoning models burn 10-100x tokens, and deployment requires FDE-heavy services. The modeled ceiling: 1M users × $120 ARPU = 17% gross margin. Only four escape routes exist: cost-cloning (no moat), niche, luxury ($20K+ seats), or vertical integration into atoms.

    17%
    AI-native gross margin cap
    6
    sources
    • SaaS comparison
    • AI-native modeled
    • Net margin at scale
    • Token spend growth
    1. Traditional SaaS70
    2. AI-Native Apps17
  3. 03

    Anthropic SPV Enforceability Crisis + 2-3x NAV Retail Froth

    monitor

    Anthropic publicly voided secondary transfers through Hiive, Forge, Sydecar, Lionheart, Open Doors, and Unicorns Exchange — a liquidity event in reverse. Simultaneously, publicly traded AI exposure vehicles trade at 2-3x NAV of underlying private shares. This creates a rare window: sell into the retail premium before enforcement closes the exits.

    2-3x
    NAV premium on AI funds
    4
    sources
    • Platforms voided
    • Retail vehicle premium
    • OpenAI secondary mark
    • MSFT-OpenAI rev cap
    1. Private share FMV100
    2. Retail vehicle price250
    3. Post-enforcement SPV70
  4. 04

    Stagflation Tape Threatens Private AI Marks

    monitor

    April CPI hit 3.8% against 3.6% wage growth — first negative real wages since 2023. Warsh confirmed 51-45, takes chair Friday. Traders pivoted from pricing cuts to a rate hike by year-end. 10Y jumped to 4.46%. Late-stage AI marks holding at 40-100x ARR cannot survive a sustained rate move without a forced re-pricing event.

    3.8%
    April CPI headline
    1
    sources
    • CPI headline
    • Wage growth
    • Core CPI
    • Energy YoY
    1. Wage growth3.6
    2. CPI3.8
    3. Core CPI2.8
    4. 10Y yield4.46
  5. 05

    Orbital Compute Gets First Commercial Counterparties

    background

    Google is in active talks with SpaceX for Project Suncatcher launch capacity. Anthropic wants 'multiple gigawatts' of orbital compute. Varda signed its first commercial pharma deal with United Therapeutics. Three hyperscalers simultaneously exploring orbital compute suggests terrestrial power is a harder constraint than filings admit. Picks-and-shovels window is pre-consensus.

    5
    sources
    • Suncatcher launch
    • Anthropic demand
    • OEMs exploring
    1. SpaceX-Google talksNow
    2. Suncatcher launchEarly 2027
    3. Anthropic orbital2027-2028
    4. SpaceX IPOWeeks away

◆ DEEP DIVES

  1. 01

    China's Efficiency Arbitrage: Your US AI Book Is Mispriced by 10-28x

    The Core Problem

    A tour of 14 Chinese AI labs produced something closer to a unit-economics report than an AI story. Export controls did not suppress Chinese AI. They industrialized its efficiency. The labs are extracting 4-7x more intelligence per FLOP than naive scaling predicts, shipping models 6-8 months behind the frontier at 10-28x lower prices, and holding 50-70% gross margins while doing it. CAISI, the US government's own assessment, agrees the capability gap is roughly stable at 6-8 months. The unit-economics gap is structural and widening.

    The Numbers That Break the Thesis

    DeepSeek V4 Pro clears at $0.43/M input tokens. GLM-5 at $1.00. Kimi K2.5 at $0.95. Claude Opus 4.6's premium tier runs 11-28x higher for comparable-quality work, which is a lot of multiplier to defend. Z.ai reports 50% gross margins at those prices. MiniMax reports 70%. These are not subsidized loss-leaders, or rather, the more interesting version: they are profitable operations running on 8x less compute.

    Chinese token volume has reached ~9 quadrillion tokens/month against roughly 4 quadrillion in the US and West. The consumption side is already 2x. Market structure is the other half of the puzzle: 14+ frontier-class labs in China, including Meituan, Ant, and ByteDance, against 5 in the US, fragmenting the chip pool and collectively out-iterating through open-source.

    The Bellwether Signal

    Cursor built Composer 2 on MoonshotAI's Kimi K2.5. A flagship US developer-tools company, one that sits in dozens of venture portfolios, is already routing workloads through Chinese open-source infrastructure. This is not a margin decision. It is capitulation on the proprietary-frontier-model moat.

    When a darling portfolio candidate builds its flagship product on Chinese open-source, the thesis that "access to frontier models" constitutes a moat is no longer supported by the evidence.

    Portfolio Implications

    The $285B that flowed into US AI startups in 2025 against $12.4B in China was priced on the thesis that capital intensity creates durable pricing power. Chinese efficiency is that thesis's biggest counterexample on the board. Asking whether China will reach the frontier is the wrong framing. Adequate capability at 1/20th the cost collapses premium pricing in most enterprise use cases. That is the actual question, and mostly the answer is no.

    CAISI, the lab tour, and Cursor's routing decision all point the same way. The durable moats now live in distribution + proprietary data + workflow lock-in, not raw model capability. Frontier model access belongs in the commodity column in most thesis docs this quarter, if LPs are reading carefully.


    The Contrarian Read

    China is not catching up on capability. The gap is stable. What is actually happening is that China is winning on unit economics while the US wins on capability, and those are different axes. Premium pricing survives where frontier capability is differentially necessary: complex reasoning and safety-critical agents, plus frontier coding (Claude on F4). Outside that perimeter, the Chinese price floor becomes the global price floor inside 12 months. This is probably wrong in the tails. It is the base case everywhere else.

    Action items

    • Run gross-margin stress test across every AI portfolio company assuming inference costs drop 80% in 12 months
    • Source and diligence 3-5 'efficient inference' startups (kernel optimization, MoE serving, distillation-as-a-service) within 30 days
    • Update sector thesis doc: reclassify 'frontier model access' from moat to commodity; elevate 'distribution + proprietary data + workflow lock-in'
    • Track DeepSeek external-capital round valuation and syndicate as key comp benchmark

    Sources:Azeem Azhar, Exponential View · TLDR Founders · TLDR AI · StrictlyVC

  2. 02

    The 17% Margin Ceiling: AI Apps Are Consumer Goods Wearing SaaS Pricing

    The Math That Kills SaaS Comps

    Six independent sources this week landed on the same uncomfortable arithmetic, which most IC memos are still pretending they haven't read: AI-native software appears to cap at roughly 17% gross margin, or eleven percent net at scale, rather than the seventy percent that every SaaS comp in the deck quietly assumes. The mechanism is not mysterious. Personalized inference kills the caching and multi-tenancy leverage that made SaaS SaaS, reasoning models burn 10-100x more tokens per query (which offsets the per-token price declines everyone keeps citing as salvation), and enterprise deployment keeps needing Forward-Deployed Engineers. The gross margin shape is Palantir, not Snowflake. That is the trade.

    The tell is that the evidence is now multi-directional. Google, OpenAI and Anthropic all shipped human-heavy deployment motions in the same week — FDEs, a thing literally branded 'Deployment Companies,' and PE joint ventures. When three frontier labs independently bolt on a consulting arm in the same seven days, they are not announcing a strategy. They are confessing that enterprise AI does not self-serve. The narrative premium in the stock prices still assumes clean software economics. The delivery model is services.

    Four Escape Routes — Only Two Are Venture-Scale

    RouteVenture-Scale?ExampleMargin Profile
    Cost-clone (race to zero)NoCommodity wrappersSub-10%
    Niche/lifestyleNoBootstrapped AI tools30-40%
    Luxury software ($20K+ seats)YesBloomberg Terminal model40-60%
    Vertical integration (atoms/services)YesForus ($1B, Rx automation)40-60% blended

    The a16z Thesis Tell

    Seema Amble at a16z went on record calling structural disruption on Salesforce, SAP and Workday, flagging manufacturing, construction, field-service and accounting as the AI-native SoR opportunity. Salesforce's 'headless product' launch reads correctly as defensive, or rather, as the most defensive thing an incumbent can do: you do not rebrand your APIs as products unless you are worried about whoever is building the UI above them. Per-seat pricing breaks structurally when the seat is an agent.

    If a founder can't credibly answer 'which of the four escape routes are you?' — it's a margin trap, no matter the ARR growth rate.

    The Counter-Narrative

    Aaron Levie at Box has the opposite trade, and it is not a silly one. His view is that data-layer SaaS benefits from agent proliferation — agent activity moves from ten percent of workload today to ninety percent within three years, which drives multiplicative demand on data platforms through a seats + consumption stacking model. Engineering job openings are at a three-year high against 46K March layoffs, so the labor story is redistribution, not destruction. Token spend going from roughly one percent to ten percent of company opex opens a greenfield 'AI FinOps' category that currently has no incumbent.

    The resolution, probably wrong but worth stating: data-moat incumbents win, thin wrappers die. The SaaS trade is not universally short. It is bifurcated. Overweight the data layer (Snowflake, Databricks, Box-class). Underweight thin-workflow SaaS without a consumption-pricing option. The rest is noise.

    Action items

    • Re-underwrite every AI application company in portfolio and active pipeline using 17-30% gross margin sensitivity, not 70%
    • Require every AI portco to present a consumption-tier roadmap before 2027 renewal cycles at next board meeting
    • Source 3-5 seed/Series A companies building AI FinOps / token allocation governance tooling this quarter
    • Accelerate diligence on AI-native SoR in a16z-flagged verticals (manufacturing, construction, field-service, accounting)

    Sources:TLDR Founders · a16z · The Information AM · Casey Newton · Not Boring · StrictlyVC

  3. 03

    Anthropic SPV Crisis + Retail NAV Froth = Sell Into the Premium Now

    The Enforceability Problem

    Anthropic has publicly voided secondary transfers routed through Hiive, Forge, Sydecar, Lionheart Ventures, Open Doors Partners, and Unicorns Exchange, which is either housekeeping or — to use a more honest description — a NAV question for any fund carrying Anthropic exposure through those platforms. The gap between what an SPV says it owns and what the issuer will recognize has widened meaningfully in the last month. If the company refuses to honor the transfer, the paper marks down quietly regardless of what the secondary market is printing.

    Three ways this goes. The secondary market formalizes and SPV holders are fine. Issuers win enforcement and paper marks down twenty to thirty percent. Or governance noise drags restrictions into a broader sector retrace. The middle outcome is the one being underpriced.

    The Retail Froth Signal

    Meanwhile, the publicly traded AI exposure vehicles offering retail synthetic exposure to Anthropic, OpenAI, and SpaceX now trade at two to three times the NAV of their underlying private shares, some of them doubling within days. This is the textbook signature of a marginal-buyer shift: institutional discipline has been priced out and retail FOMO is setting the clearing print.

    When retail is paying 2-3x NAV for private AI exposure while the issuer is voiding the platforms that sold it to them, the trade is to distribute into the premium before the enforcement catches up.

    The Microsoft-OpenAI Restructure

    The revised agreement capped Microsoft's revenue share at thirty-eight billion dollars and freed OpenAI to diversify infrastructure across AWS and Google. That quietly unwinds every Microsoft AI moat thesis written since 2023. Anthropic's simultaneous AWS GA launch with full feature parity means AWS is now first-class frontier infrastructure for two of the three most valuable labs. The Microsoft AI exclusivity premium is over in substance, even as it is preserved in form through 2032.

    The Opportunity Window

    For anyone sitting on Anthropic, OpenAI, or SpaceX secondaries, the combination of peak demand, company-issued platform warnings, and a 2-3x retail vehicle premium is a textbook trim signal. This is probably wrong if the retail bid deepens rather than exhausts, but in-kind distributions, structured secondaries, and direct sales into retail-driven vehicles are the rare window where selling consensus pays above private FMV. The shelf life is weeks, not quarters.

    What Changes This Week

    Anyone holding Anthropic exposure via the six named platforms should obtain a legal opinion on transfer enforceability before month-end NAV marks. It is a materiality issue for this quarter's reporting, not a line item for the next LP meeting.

    Action items

    • Audit all portfolio and fund-level Anthropic exposure held via SPVs through Hiive, Forge, Sydecar, Lionheart, Open Doors, Unicorns Exchange — obtain legal opinion on enforceability by May 23
    • Run exit-path analysis on all frontier AI secondary positions against the 2-3x retail vehicle premium — model in-kind distribution or structured sale
    • Downgrade Microsoft AI exclusivity premium in all models using MSFT-OpenAI axis — the $38B cap + AWS/GCP unlock structurally ends exclusivity
    • Prepare LP communication on Anthropic SPV enforceability risk before quarter-end if exposure exceeds materiality threshold

    Sources:StrictlyVC · AI Breakfast · Techpresso · AINews · Martin Peers

  4. 04

    Agent Governance Is a Category Before It Has a TAM — Front-Run It

    The Governance Layer Gets Claimed

    Microsoft shipped Agent 365 plus the Work IQ API this week and told anyone paying attention that the governance and orchestration layer for multi-agent enterprise workflows belongs to them. Cloudflare put Artifacts into beta with Git-like versioning, rollback, and audit for AI agent outputs. Versa shipped agent policy controls. GitLab restructured into 60 teams around the same problem. This is what a category looks like in the six to twelve months before Gartner writes the Magic Quadrant.

    The demand signal is visible in the same week. Datadog's telemetry across more than 1,000 organizations shows agent framework adoption has doubled year on year, which is the headline number. Underneath it, multi-model is now the default posture and LLM tech debt is compounding faster than teams retire it. Atlassian's survey has 74% of IT leaders citing AI security as a major barrier, while 79% of the same group are exploring AI incident management. Adoption is outrunning controls.

    Where This Sits in the Stack

    LayerStatusInvestor Window
    Agent orchestration (Claude Code, Cursor)Late-stage, consensusClosed at good prices
    Agent governance/auditCategory formingOpen 12-18 months
    Agent verificationUnsolved — no clear leaderGreenfield
    Agent memory/stateSeed-stage (Agentmemory, Statewright)Pre-seed entry open

    The 'Okta for Agents' Opportunity

    a16z calls agent authorization "one of the hardest unsolved problems," which is the language partners use when they want the category named after their portfolio company. No named leader exists yet. The shape rhymes with the 2010 Okta setup, where seed checks priced the category before it had a label. This is probably wrong, but the counter-thesis worth respecting is that identity categories sometimes take a decade to resolve. Build the target list anyway.

    The supply-chain attacks that landed the same week sharpen the urgency, or rather, make it legible to people who had been ignoring it. Mini Shai-Hulud hit Mistral, TanStack, and UiPath packages with persistence in Claude Code and VS Code settings files, across 121 compromised npm packages, including what is being called the first confirmed AI-assisted zero-day. Agents with code deployment authority plus compromised packages equals a board-level risk most portfolios have not audited.

    Agent governance is roughly where container orchestration was in 2015: multiple vendors shipping into it, no narrative dominates yet, TAM still pre-consensus. By the time the analyst note lands, Series A pricing will be 3-4x current levels.

    Action items

    • Build a shortlist of 10+ vendor-neutral agent governance / observability / policy-enforcement startups within 30 days
    • Issue portfolio-wide advisory: require SBOM + agent-identity controls for any portco running production agentic workflows
    • Source 2-3 AI code verification startups (formal specs, AI-checks-AI pipelines, agent output validation) this quarter
    • Initiate conversations with Cloudflare Artifacts team to understand their agent versioning roadmap and buy-vs-build calculus for startups in the space

    Sources:TLDR IT · CyberScoop · Risky.Biz · TLDR DevOps · The Pragmatic Engineer · TLDR

◆ QUICK HITS

  • Update: Anthropic ~$950B valuation confirmed across multiple sources — company actively voiding SPV transfers through 6 named platforms; enforce legal review on any exposure held through these vehicles before month-end

    StrictlyVC

  • OpenAI deprecated finetuning APIs — value migrating from model customization to agent runtimes; mark down finetuning-platform portfolio exposure and reallocate toward 'Git-for-agents' primitives

    AINews

  • Cursor cut vector DB costs 95% by switching from incumbent to turbopuffer — first-gen vector DBs (Pinecone/Weaviate class) face churn risk as AI coding tools scale past early workloads

    The Pragmatic Engineer

  • Warsh confirmed 51-45 as Fed governor, takes chair Friday — April CPI at 3.8% vs wages 3.6% means first negative real wages since 2023; close any open bridge or extension rounds before June FOMC

    Morning Brew

  • AWS Redshift RG eliminated $5/TB Spectrum scanning fee and cut vCPU cost 30% — reprice any active data warehouse or lakehouse deal against this new hyperscaler floor

    TLDR DevOps

  • a16z's Amp hit $1.3B valuation as a compute-pooling vehicle backed by YC and unnamed cloud providers — if it works, hyperscaler pricing power softens AND wrapper defensibility erodes simultaneously

    StrictlyVC

  • Datadog telemetry from 1,000+ orgs confirms multi-model is default posture and agent framework adoption doubled — Amazon 'tokenmaxxing' episode proves usage metrics are contaminated, demand task-completion proof in diligence

    TLDR

  • Varda signed first commercial pharma deal with United Therapeutics for microgravity drug crystallization — space manufacturing crossed from thesis to revenue in one contract

    The Download from MIT Technology Review

◆ Bottom line

The take.

Chinese AI labs are profitable at 10-28x below US frontier pricing while holding 50-70% margins — meaning either Western prices compress or Western multiples do, and you have about 12 months to decide which. Run the margin stress test across every AI portfolio company this week: if the business doesn't survive at Chinese price floors, it's a markdown waiting to happen. The durable moats are now distribution, proprietary data, and workflow lock-in — not model access, not compute scale, and definitely not the 70% gross margins your SaaS comps still assume.

— Promit, reading as Investor ·

Frequently asked

Why does Cursor building Composer 2 on Kimi K2.5 matter for the AI moat thesis?
It signals that a flagship US developer-tools company is routing production workloads through Chinese open-source infrastructure, which undermines the thesis that proprietary frontier model access constitutes a durable moat. When a venture darling makes that call, frontier capability is moving into the commodity column, and moats shift toward distribution, proprietary data, and workflow lock-in.
How should I think about gross margins on AI application companies in the portfolio?
Re-underwrite at 17-30% gross margin sensitivity rather than the 70% SaaS comps assume. Personalized inference breaks caching and multi-tenancy leverage, reasoning models burn 10-100x more tokens per query, and three frontier labs shipped human-heavy Forward-Deployed Engineer motions in the same week — confirming services-shaped delivery, not clean software economics.
What is the immediate risk for funds holding Anthropic exposure through secondary platforms?
Anthropic has publicly voided transfers routed through Hiive, Forge, Sydecar, Lionheart Ventures, Open Doors Partners, and Unicorns Exchange, meaning SPV paper through these venues may not be recognized by the issuer. That creates a 20-30% markdown risk and a materiality issue for this quarter's NAV — legal opinions on transfer enforceability should be obtained before month-end marks.
Where is the open investment window in the agent stack right now?
Agent governance, authorization, verification, and memory layers are still pre-consensus, with no named category leader. Microsoft Agent 365 and Cloudflare Artifacts are claiming pieces, but vendor-neutral governance and 'Okta for agents' authorization remain greenfield for the next 12-18 months before Series A multiples inflate 3-4x once the analyst category is named.
Does the Microsoft-OpenAI restructure change the Microsoft AI thesis?
Yes, materially. Capping Microsoft's revenue share at $38B and freeing OpenAI to diversify across AWS and Google unwinds the exclusivity premium that has been priced into MSFT since 2023. Combined with Anthropic's AWS GA launch at full feature parity, AWS is now first-class infrastructure for two of the three top labs, and the exclusivity survives only in form through 2032.

◆ Same day, different angle

Read this day as…

◆ Recent in investor

Keep reading.