Leader daily

Edition 2026-05-22 · read as Leader

OffensiveAIJustBrokeYourPatchSLAandThreatModel

Sources
36
Words
1,725
Read
9min

Topics Agentic AI AI Capital AI Regulation

◆ The signal

Two data points from this week sit awkwardly together. Anthropic's Mythos cleared both UK AISI end-to-end cyber attack simulations, and TrustedSec showed AI compressing commercial EDR reverse engineering from weeks to days across all five products tested. The defensive premise that offensive AI lags human operators broke in public. Patch SLAs calibrated to a 30-day weaponization window now have to explain a PraisonAI vulnerability that saw active exploitation in four hours. Last quarter's security budget was written against an adversary that no longer exists.

◆ INTELLIGENCE MAP

  1. 01

    Security Architecture Breaks: Full Network Takeover + EDR Transparency

    act now

    AI offensive capability crossed a discontinuity: Mythos cleared both AISI hardest cyber ranges (a first), EDR agents are now architecturally transparent via AI-assisted reversing, and active exploitation windows collapsed to 4 hours. The 18-year NGINX RCE and Foxconn's 8TB exfiltration confirm the defender's model assumed costs that no longer exist.

    4 hrs
    exploit window
    8
    sources
    • EDR reversal time
    • NGINX latent RCE
    • Foxconn data stolen
    • AI honeypot attacks/mo
    • CISA KEV AI entries
    1. Old exploit dev30
    2. New exploit dev0.17
    3. Patch SLA (critical)7
    4. Actual exposure0.17
  2. 02

    Compute Market Being Locked Up Through Bilateral Megadeals

    monitor

    xAI leased 220K GPUs (45% of Colossus) to Anthropic — Musk funding a competitor he publicly despises. Cerebras priced at $56B on a $20B OpenAI anchor. Fervo Energy debuted at $10B+. Microsoft's OpenAI tab passed $100B. The marginal unit of frontier compute now has a named buyer for the decade. The spot market assumption in most AI roadmaps is being deleted.

    $56B
    Cerebras IPO valuation
    7
    sources
    • xAI GPUs leased
    • Cerebras first-day pop
    • OpenAI-Cerebras deal
    • Microsoft→OpenAI total
    • Fervo Energy IPO
    1. Microsoft→OpenAI100
    2. OpenAI→Cerebras20
    3. Fervo (energy)10
    4. Cerebras market cap56
  3. 03

    Agent Execution Layer: Platform Wars Declared Simultaneously

    monitor

    SAP (€100M fund + Knowledge Graph), ServiceNow (headless Action Fabric via MCP), Apple (agent App Store gating), Google (Gemini Intelligence on 3B+ Android devices this summer), and Amazon (Buy for Me cross-retailer agent) all declared claims on the agent execution layer in the same window. The question is no longer whether agents run your workflows. It's whose platform they run through.

    59%
    AI traffic is agentic
    7
    sources
    • Agent token share
    • Android market share
    • GTM value migration
    • SAP fund size
    • Bot bypass rate
    1. Agentic workloads59
    2. Chat/human workloads41
  4. 04

    Enterprise AI Governance Vacuum: Spending Outruns Controls

    act now

    ServiceNow blew its full-year Anthropic budget by May. 85% of organizations are spending millions on agentic AI without adequate data foundations. AI liability is being written in courts now — before legislation exists. Duolingo's retreat quantified the 'slop tax' at 20%. The operating model assumes governance that doesn't exist yet.

    85%
    orgs without AI foundations
    6
    sources
    • Budget blown by
    • Data-ready orgs
    • AI content slop tax
    • Anthropic growth vs plan
    • Data problems: tooling
    1. Enterprise AI readiness15
  5. 05

    AI-Native Org Design: Middle Management Economics Invert

    background

    Lovable dissolved its growth management layer and found VPs voluntarily taking IC roles for autonomy over authority. Cisco's stock rose 15% on AI orders the same day it cut 4,000 jobs. 103,000 tech layoffs by mid-May already approaches 2025's full-year total. The coordination layer that justified management headcount is being compressed to near-zero cost.

    103K
    tech layoffs by mid-May
    4
    sources
    • 2025 full-year cuts
    • Cuts by mid-May
    • Cisco job cuts
    • Cloudflare layoffs
    • VP→IC time savings
    1. Mid-May 2026 layoffs103000
    2. Full year 2025124000

◆ DEEP DIVES

  1. 01

    Your EDR Is Glass and the Adversary Has a 4-Hour Clock — The Security Operating Model Needs Rebuilding

    The capability threshold moved a step function this quarter

    The defensive architecture most security programs are running on was invalidated this week, and the evidence arrived from independent directions at once. TrustedSec ran LLMs against five commercial EDR products and found all five share identical architectural patterns: YARA-style rules, behavioral logic, allowlists, prefilters, and local ML classifiers. Work that used to occupy a skilled reverse engineer for weeks now takes days. Anthropic's Mythos became the first model to clear both UK AISI simulated attack ranges, the benchmarks built specifically to test autonomous offensive cyber capability. OpenAI's GPT-5.5 cleared one of two. Both are outperforming an exponential trend line that was already doubling every few months.

    Then the 4-hour exploitation window on PraisonAI: disclosure to active targeting in the time it takes to schedule a change-advisory meeting. An 18-year-old RCE in NGINX's rewrite module, present since 2007, surfaced alongside it, affecting nearly every modern web application.

    The patch window used to be measured in months because attackers needed months. Now it is measured in months because procurement needs months; the attacker side moved while the defender side did not.

    The AI infrastructure stack is under active exploitation

    CISA added five AI tooling vulnerabilities to the Known Exploited Vulnerabilities catalog in a single week. LiteLLM (unauthenticated database queries), Ollama (GGUF model loader data exfiltration at CVSS 9.1), and OpenClaw (six simultaneous critical CVEs) are all being exploited in production. A Raspberry Pi honeypot dressed as an AI endpoint was indexed by Shodan in 3 hours and absorbed 113,000 requests in a month, with tooling that evolved mid-experiment to detect honeypots.

    In the same window, Foxconn lost 8 terabytes of confidential designs from Apple, Intel, Google, and Nvidia to the Nitrogen ransomware group. The assumption that contract manufacturers held manageable supply-chain custody risk just proved aspirational.

    The defender's response

    Microsoft stood up MDASH (multi-model AI vulnerability discovery) and found 16 exploitable flaws in a single Patch Tuesday. Mozilla found 271 bugs in Firefox 150 using Claude Mythos with custom harnesses, against curl's 1 CVE from generic scanning. The variable is harness design, not model quality. The offensive application of these same capabilities by threat actors is a 12-18 month timeline, not a theoretical one.

    Congress is routing Mythos access through NSA rather than CISA, which tells you which mission the government has prioritized. The private sector is on its own for the defensive application.


    What this forces

    A reasonable skeptic would point out that EDR vendors have weathered every prior architectural critique and shipped through it. The reasonable skeptic is correct about the past. What the skeptic does not explain is why a posture calibrated to an adversary that needed human researchers to chain exploits should hold against one that does not. The compensating controls, identity and network telemetry and behavioral analytics above the endpoint, are the ones that matter in the next eighteen months. The endpoint agent is no longer the load-bearing control.

    Action items

    • Commission a red team exercise specifically targeting your EDR with AI-assisted reverse engineering tools within 30 days
    • Compress critical vulnerability patch SLAs from 30-day to 72-hour maximum for internet-facing assets by end of Q3
    • Audit all AI infrastructure tooling (LiteLLM, Ollama, model registries) for security controls by end of month — most were adopted without security review
    • Invest in custom AI vulnerability scanning harnesses for your 3 most critical codebases this quarter, following Mozilla's pattern

    Sources:Clint Gibler · The Information AM · CyberScoop · The Hacker News · SANS AtRisk · TLDR InfoSec

  2. 02

    Compute Is Being Pre-Sold for the Decade — The Spot Market Assumption Just Died

    xAI Leased Its Crown Jewels to a Competitor

    The most revealing infrastructure signal this week is a leasing deal. Elon Musk agreed to lease 220,000 GPUs — 45% of xAI's Colossus cluster — to Anthropic, a company he has publicly called "misanthropic and evil." A reasonable skeptic would call this opportunistic capital management. The reasonable skeptic is partly correct. What the skeptic does not explain is why financial logic would overwhelm competitive logic this visibly unless Grok never achieved meaningful traction and the lease revenue exceeds what the GPUs could earn running xAI's own products. The frontier lab population is contracting in private before it contracts in headlines.

    When the financial logic of renting your GPU cluster to a rival exceeds running your own model on it, the question of how many viable frontier labs exist has been answered — just not publicly.

    The Bilateral Deal Era

    Cerebras priced its IPO at $56 billion, sixteen percent above range, with a seventy percent first-day pop. The catalyst was a quiet $20 billion procurement commitment from OpenAI in December 2025. One anchor customer turned a regulatory cautionary tale into the most successful tech IPO in five years. Tiger Global entered at $89 a share in September 2025 and saw $311 on day one, a 249% return in eight months.

    Fervo Energy debuted above $10 billion, surging thirty-three percent on day one, the largest clean energy IPO tied explicitly to AI datacenter demand. Google's option for 3 gigawatts from Fervo against only 658 MW currently contracted is the number that matters. That is sixty-plus data centers from a single power supplier.

    These are not market signals about demand. They are contractual facts about supply allocation. Nebius reports 4+ customers competing for every GPU it brings online while growing revenue 684%. The marginal buyer arriving in 2027 will get compute. The marginal buyer will not get 2025 terms.

    Microsoft's $100B Bet and What It Prices In

    Court filings in the Musk lawsuit disclosed that Microsoft has spent over $100 billion on OpenAI infrastructure by June 2026, against roughly thirty billion in direct revenue. OpenAI has committed another $280 billion to Microsoft servers going forward. The closest historical analog is the fiber buildout of the late 1990s, with one critical difference. Cisco's AI orders moving from $5 billion to $9 billion in the same window suggests the demand is real this time, not yet, but real.

    Fewer than five entities on earth can play at this scale: Microsoft/OpenAI, Google, Amazon, Meta, and the Nvidia ecosystem. That is not a procurement relationship. It is an alliance that sets long-run degrees of freedom for everyone else.


    The Planning Consequence

    The board-deck version of this story is that compute is getting expensive. The complete version is more useful. Infrastructure cost assumptions from six months ago are now likely 30-50% too optimistic, and the window for securing favorable multi-year compute terms is closing while most enterprise AI roadmaps remain scoped quarter to quarter. Power contracts signed in 2026 will determine competitive position in 2028 through 2030. The compute market is being divided through relationship-based bilateral commitments, not open market competition, and the firms that treat the next two quarters as a rehearsal for the next decade will find the decade already allocated.

    Action items

    • Audit your compute procurement strategy this quarter — determine if multi-year commitments are needed before capacity gets locked through bilateral deals
    • Explore whether becoming a 'transformational customer' for an emerging compute or energy company could secure strategic advantage
    • Accelerate any in-progress M&A conversations with AI infrastructure targets before the IPO window fully reprices expectations
    • Secure long-term power supply agreements for any planned data center expansion — treat energy as a strategic asset with 2-3 year procurement horizons

    Sources:The Pragmatic Engineer · StrictlyVC · Katie Roof · The Information AM · Martin Peers · Bloomberg Technology

  3. 03

    Five Platforms Declared Claims on the Agent Execution Layer in the Same Week — Your Architecture Decision Is Being Made For You

    The Collision Is Here

    SAP and ServiceNow stopped talking past each other this week. Both are now explicitly pitching as the execution layer where AI agents touch systems of record and actually do things. SAP is building a vertically integrated Knowledge Graph that makes its own agents contextually superior inside SAP's data universe. ServiceNow adopted MCP (Model Context Protocol) servers as the communication standard for its headless Action Fabric, telling the ecosystem that agents reach ServiceNow through an open protocol.

    These are incompatible theories of how the agent economy organizes: open interoperability versus data-moat integration. Both can win in different segments. The unresolved problem is that agents acting across finance, HR, IT, and procurement need one authoritative place to reconcile state. Two authoritative places is zero authoritative places.

    Apple and Google Claim the Consumer Layer

    Apple is inserting itself at the agent layer on iOS, specifically targeting agents that spin up smaller apps on the spot after Apple has already approved the parent app. The framing is safety. The mechanism is governance that prevents agents from routing around the 30% tax. For any company shipping AI agents on iOS, this is a new constraint layer that has to be priced into product economics before WWDC makes it fait accompli.

    Google's Gemini Intelligence ships this summer on Galaxy S26 and Pixel 10, extending to watches, cars, glasses, and laptops. In markets where Android holds 97%+ share, like India, that converts an installed base of 3+ billion devices into an agent platform. The grocery-list demo, where the agent reads a screenshot and builds the cart, turns the app into infrastructure and the agent into the interface.

    A product that is technically API-addressable but sits outside the default agent's routing table will be bypassed the same way sites outside the default search index were bypassed in 2005.

    Amazon Moves to Own the Transaction

    Amazon's Buy for Me enables AI-mediated purchases from competing retailers inside Amazon's agent surface. A reasonable skeptic would call it a feature. The reasonable skeptic is wrong. It is a claim on the transaction layer of the open web. Merchants who assumed Amazon competed for the listing now find it competing for the checkout while routing orders to their own fulfillment. The margin math deserves modeling before the next planning cycle.

    Where Value Settles

    a16z published research estimating $150 billion of GTM value migrating from CRM to the AI orchestration layer. The supporting data point is the one worth staring at: a single customer running twenty-plus agents saw 80% fewer human seats but 83% higher total spend. Seat-based pricing is breaking. Consumption-based agent pricing is forming. The CRM stops being where work happens and becomes where work is recorded.

    The Vercel production data confirms it: 59% of all AI token volume is now agentic workloads. The interface layer of software is unbundling from the application layer faster than most roadmaps assume. The orchestration surface, where workflows, permissions, and institutional memory live, is where switching costs accumulate. A model can be swapped in an afternoon. An agent graph wired into twelve internal systems cannot.

    Action items

    • Conduct an 'agent readiness' audit of your platform architecture within 60 days — determine if third-party AI agents can discover, invoke, and orchestrate your workflows without a human UI
    • Evaluate MCP as a strategic investment for your platform roadmap and build or integrate MCP server capabilities this quarter
    • Model per-action/per-outcome pricing scenarios against your current seat-based revenue and run a pilot with 3-5 customers by Q4
    • Decide whether your product is the platform agents route through or the capability agents call — then align architecture, pricing, and partnerships to that decision by end of Q3

    Sources:TLDR IT · TLDR · Techpresso · Simplifying AI · a16z · TLDR Design

◆ QUICK HITS

  • Update: Anthropic revenue reached $30B ARR (up from $9B four months ago, 120x growth in 24 months) — the fastest enterprise displacement in software history, confirming single-vendor AI dependency is now a first-order strategic risk

    StrictlyVC

  • ServiceNow's CDIO confirms they blew their full-year Anthropic budget by May — building an 'AI Control Tower' workaround they're now selling to other enterprises, signaling AI cost governance is an emerging product category

    Laura Bratton

  • a16z published comprehensive AI liability blueprint advocating user-liability defaults and damages caps — the largest VC firm is spending real political capital ($115.5M in 2026 midterms) to shape the regime before courts set precedent

    a16z AI Policy Brief

  • Lovable dissolved its growth management layer and VPs are voluntarily taking IC roles — former VP shipping in hours what cross-functional squads took weeks, spending 90% of time on high-value building vs. coordination tax

    Lenny's Newsletter

  • Abridge raised at $5.3B as 'clinical intelligence layer' with 80-100M+ medical conversations — the wedge-and-expand playbook in a market that's 20% of US GDP, with prior authorization compressed from 45 days to minutes

    Latent.Space

  • Update: Trump-Xi summit structured as chips-for-rare-earths swap with 25% revenue extraction on H200 sales to 10+ Chinese firms — a monetization template, not a decoupling framework

    The Download from MIT Technology Review

  • Training efficiency breakthroughs are compounding — Nous Research 2-3x (token superposition), NVIDIA Star Elastic 360x, Datology 17x — changing custom model build-vs-buy math for any enterprise with proprietary data

    AINews

  • Duolingo retreated from blanket AI mandate with CEO quantifying a ~20% 'slop tax' on AI-generated content at scale — performative adoption looks identical to productive adoption for two quarters before diverging

    TLDR Marketing

  • White House internal fight between ODNI and Commerce over AI model assessment authority — if IC wins, pre-release government evaluation becomes a de facto licensing regime for frontier AI, extending release timelines by months

    Risky.Biz

◆ Bottom line

The take.

The security model, the compute market, and the platform layer all moved this week — not incrementally but structurally. AI offensive capability cleared full network takeover for the first time while EDR architectures became transparent to AI-assisted reversing in days. Compute is being locked up through $20-100B bilateral deals that delete the spot-market assumption most AI roadmaps depend on. And five major platforms simultaneously declared claims on the agent execution layer that will determine who your customers interact with versus who becomes invisible infrastructure underneath. The three decisions being made this quarter whether you make them or not: where detection actually lives when the endpoint is transparent, whether your compute access survives a world of named bilateral buyers, and which side of the agent boundary your product sits on in 2028.

— Promit, reading as Leader ·

Frequently asked

Why are 30-day patch SLAs no longer defensible for internet-facing assets?
Because the weaponization window has collapsed below the patch cycle. PraisonAI saw active exploitation within four hours of disclosure, and CISA added five AI tooling vulnerabilities to its Known Exploited catalog in a single week. SLAs calibrated to a 30-day attacker timeline now operate inside a confirmed exposure window, which is why 72-hour maximums for critical internet-facing assets are becoming the new floor.
What changed about EDR that makes endpoint agents no longer load-bearing?
TrustedSec demonstrated that AI compresses commercial EDR reverse engineering from weeks to days, and all five products tested share identical architectural patterns: YARA-style rules, behavioral logic, allowlists, prefilters, and local ML classifiers. Once those patterns are transparent to an attacker's model, the endpoint agent stops being a meaningful detection control. Identity, network telemetry, and behavioral analytics above the endpoint are the controls that actually carry weight over the next 18 months.
What does xAI leasing 220,000 GPUs to Anthropic actually signal about the compute market?
It signals that the frontier-lab population is contracting privately before it contracts publicly, and that bilateral lease economics now exceed the value of running your own model on your own cluster. When a CEO leases 45% of a flagship cluster to a rival he has publicly disparaged, the implication is that Grok did not achieve traction and that compute itself has become the more valuable asset. For buyers, it confirms that spot-market assumptions are dead and multi-year terms are the only reliable path to capacity.
How should a product team decide whether to be an agent platform or an agent-callable capability?
The decision should be driven by where switching costs accumulate in your category and whether you control a system of record or a workflow surface. Platforms like ServiceNow and SAP are claiming the orchestration layer where agents reconcile state across systems, while Apple, Google, and Amazon are claiming the consumer transaction layer. Sitting between those positions as a point solution with a nice UI is no longer stable, because agents will route around any product that is not either the orchestrator or a first-class callable capability.
Why is seat-based SaaS pricing breaking, and what replaces it?
Seat-based pricing breaks when one human operator running 20+ agents drives 80% fewer seats but 83% higher total spend, as a16z documented. The work is migrating from human users to agent invocations, so revenue has to follow consumption, actions, or outcomes rather than logins. Vendors who do not have a consumption model ready before their largest customers ask will see revenue compress as seat counts fall faster than expansion can offset.

◆ Same day, different angle

Read this day as…

◆ Recent in leader

Keep reading.