Leader daily

Edition 2026-06-06 · read as Leader

Anthropic'sMythosBreakstheEDRThreatModelOvernight

Sources
36
Words
1,432
Read
7min

Topics Agentic AI AI Capital LLM Inference

◆ The signal

Anthropic's Mythos cleared both UK AISI simulated attack ranges this week, a first, while TrustedSec demonstrated that all five major commercial EDR products share architectures an AI reverse-engineers in days rather than weeks. The defensive stack was priced against an adversary that needed human researchers, months, and bespoke tooling. That adversary is now an overnight LLM prompt. The patch window has not compressed. The threat model has been replaced.

◆ INTELLIGENCE MAP

  1. 01

    Defensive Security Architecture Failure

    act now

    Full network takeover by AI, EDR transparency to AI reverse-engineering, 4-hour exploit weaponization, AI infrastructure on CISA KEV, and Foxconn's 8TB breach collectively invalidate the cost-asymmetry assumption that underpins most security budgets. The defender's response window has collapsed below most organizations' patch SLAs.

    4hrs
    exploit weaponization window
    8
    sources
    • EDR reversal time
    • AISI ranges cleared
    • Honeypot attacks/month
    • Foxconn data exfil
    1. EDR Bypass (old)90
    2. EDR Bypass (now)3
    3. Exploit Window (old)30
    4. Exploit Window (now)0.17
  2. 02

    Enterprise Execution Layer War: SAP vs. ServiceNow vs. the AI Hyperscalers

    monitor

    SAP's €100M fund + Knowledge Graph and ServiceNow's MCP-based Action Fabric represent incompatible architectures for AI agent execution. The question is no longer which vendor owns your data — it's which vendor agents write to when they commit actions. ServiceNow blew its full-year Anthropic budget by May, revealing governance isn't keeping pace.

    €100M
    SAP agent fund
    5
    sources
    • Agents bypass bots
    • Agentic token share
    • ServiceNow budget
    • MCP adoption
    1. Agentic workloads59
    2. Conversational41
  3. 03

    AI Infrastructure Phase Transition: Compute Becomes Financial Instrument

    monitor

    Cerebras IPO'd at $56B (70% first-day pop) on a $20B OpenAI commitment. Fervo Energy debuted at $10B+ (33% pop) on AI power demand. xAI leased 45% of Colossus to Anthropic. Compute is being pre-sold in $10B+ blocks, financialized as a tradeable asset, and priced out of reach for late entrants. The spot-market assumption in most AI budgets is obsolete.

    $56B
    Cerebras IPO valuation
    7
    sources
    • OpenAI-Cerebras deal
    • Fervo valuation
    • GPU demand ratio
    • xAI Colossus leased
    1. Microsoft→OpenAI100
    2. Cerebras IPO56
    3. OpenAI→Cerebras20
    4. Fervo Energy10
  4. 04

    AI Liability Regime Being Written — In Courts, Not Congress

    background

    a16z published the most comprehensive AI liability blueprint yet (user-liability defaults, damages caps), while active litigation could impose massive penalties on developers for downstream misuse before any legislation exists. The regime that wins determines whether open-source AI remains insurable. The window to influence is quarters, not years.

    $115M
    a16z political spend
    4
    sources
    • Clarity Act odds
    • a16z 2026 donations
    • Framework passage
    • Open-source threat
    1. Q3 2026Active court cases setting precedent
    2. Q4 2026a16z framework lobbying peak
    3. H1 2027Federal preemption vote likely
    4. H2 2027Regime hardens — options close
  5. 05

    Org Design Becomes Competitive Weapon: The Coordination Cost Collapse

    background

    Lovable dissolved its growth management layer 5 months ago — ex-VPs now ship enterprise features solo in hours. The economic case for middle management collapses when AI eliminates coordination overhead. Top talent is voluntarily choosing autonomy over authority, and the companies losing them are the ones with the highest manager-to-maker ratios.

    90%
    time on high-value work
    4
    sources
    • Lovable model age
    • HI-C high-value time
    • Cumulative tech layoffs
    • Duolingo slop tax
    1. Traditional (VP + team)15
    2. AI-native (HI-C solo)1

◆ DEEP DIVES

  1. 01

    Your Security Architecture Was Priced Against Last Year's Adversary — Full Network Takeover Changes the Category

    The Capability Discontinuity

    Tuesday's briefing reported an 81% autonomous hack rate and called it the trend. This week's data is a different category. Anthropic's Mythos became the first model to clear both UK AISI simulated attack ranges — full network takeover, not persistence. OpenAI's GPT-5.5-cyber completed one. The UK AI Security Institute confirms this is above the exponential line that was already doubling every few months. Congress is holding closed-door Mythos demos. The access is going to NSA rather than CISA, which tells you which mission the government has decided is the priority.

    The EDR Transparency Problem

    A reasonable skeptic would argue that endpoint vendors still have depth-of-defense the attacker has to grind through. The skeptic is partially right and structurally wrong. TrustedSec ran LLMs against five commercial EDR products and found all five are architecturally identical: YARA-style rules, behavioral logic, allowlists, prefilters, scripted engines (some readable as Lua after a single decryption pass), and local ML classifiers. Work that took skilled reversers weeks now takes days. The defensive model assumed obscurity bought time. The time is gone.

    The security model of the defensive stack was built on the premise that the cost of understanding the agent exceeded the value of bypassing it for most adversaries. That premise is no longer true for a growing share of the threat population.

    The Exploit Window Has Collapsed

    PraisonAI went from disclosure to active exploitation in 4 hours. An 18-year undetected RCE in NGINX's rewrite module proves even foundational infrastructure escapes audit. A Raspberry Pi honeypot dressed as AI infrastructure was indexed by Shodan in 3 hours and absorbed 113,000+ attacks per month, with attacker tooling evolving mid-experiment to detect and evade the honeypot itself. The patch SLA most organizations operate against was set in a different decade.

    The Supply Chain Dimension

    Foxconn lost 8TB of confidential designs from Apple, Google, Intel, and Nvidia through a single breach. AI infrastructure tooling — LiteLLM, Ollama, OpenClaw — is now on CISA's Known Exploited Vulnerabilities catalog. The AI gateway went from experiment to production at most firms without passing through security review.

    Where Sources Agree and Diverge

    All eight sources covering this theme agree that the defender's cost-asymmetry advantage has inverted. They diverge on timeline. Some suggest 12-18 months before these capabilities are broadly available to threat actors. Others point to the open-sourcing of Shai-Hulud and industrialized guardrail bypass as evidence the window is already closed. The conservative assumption — that current patch SLAs and an EDR-centric model survive unchanged through 2027 — is the one no source supports.

    Action items

    • Commission a red-team exercise specifically targeting your EDR with AI-assisted reverse engineering to measure actual detection gap against the TrustedSec findings
    • Compress critical vulnerability patch SLAs to 72 hours maximum for internet-facing assets, with 24-hour target for AI infrastructure
    • Conduct emergency inventory of all AI infrastructure tooling (LiteLLM, Ollama, model registries, AI gateways) and validate against CISA KEV list
    • Shift detection investment toward identity, network telemetry, and behavioral analytics above the endpoint layer over the next two quarters
    • Brief the board on AI cyber capability discontinuity — frame as a threat model replacement, not a patch cycle

    Sources:Clint Gibler · The Information AM · AINews · CyberScoop · SANS AtRisk · The Hacker News

  2. 02

    The Agent Execution Layer Is the New Platform Control Point — And the Land Grab Started This Week

    Two Incompatible Architectures, One Decision Window

    SAP and ServiceNow stopped talking past each other this week. Both now claim the execution layer, the surface where AI agents commit writes to systems of record. SAP's answer is a €100M fund and a vertically integrated Knowledge Graph that makes its own agents contextually superior inside SAP's data universe. ServiceNow's answer is a headless Action Fabric exposed via MCP (Model Context Protocol) that any agent can call. These are not adjacent positions. They are incompatible theories of how the agent economy organizes.

    Agents that act across finance, HR, IT, and procurement need one authoritative place to reconcile state. Two authoritative places is zero authoritative places.

    MCP Is Becoming the De Facto Standard

    ServiceNow adopting MCP servers as the communication standard for Action Fabric pulls the ecosystem toward that protocol. When a company with workflow gravity across IT, HR, and customer service declares that agents talk to it via MCP, the weight of that declaration compounds. Notion's developer platform launched the same week with explicit agent-hosting positioning. The protocol question is being answered by adoption, not by committee.

    The Budget Blowout Signal

    A reasonable skeptic would call the next data point an outlier. ServiceNow's CDIO admitted the company blew its full-year Anthropic budget by May. Anthropic offers no SLAs, no usage telemetry, and no comment on enterprise cost overruns. That is not immaturity. It is a deliberate choice to optimize capability over governance. ServiceNow's response was to build its own AI Control Tower and sell it to other enterprises. The market is routing around vendor deficiency.

    The Pricing Model Shift

    SAP is not charging per-seat for autonomous finance agents. It is positioning around workflow execution value. ServiceNow's headless architecture implies consumption-based pricing on agent API calls. Vercel's production data settles the direction: 59% of all AI token volume is now agentic workloads, not human conversations. Per-seat pricing structurally cannot capture this value. The transition from seat-based to action-based revenue is being decided this year, not this decade.

    What This Means for the Architecture Decision

    The decision is not which vendor wins the category. Neither will, cleanly. The decision is which vendor owns the execution layer for the processes that cannot stop, and which one gets relegated to integration. SAP's claim is strongest where the process is the transaction, meaning order-to-cash and record-to-report. ServiceNow's claim is strongest where the process is the workflow across systems. Most enterprises have both shapes. The run-both instinct will not survive the first quarter in which agents need to commit writes to both at once.

    Action items

    • Conduct an 'agent readiness' audit — map which of your platforms can be discovered, invoked, and orchestrated by third-party AI agents without a human UI
    • Evaluate MCP as a strategic investment for your platform roadmap — build or integrate MCP server capabilities before Q4
    • Stand up an AI governance function with authority over tool/vendor rationalization before Q3 budgeting
    • Model the financial impact of per-action/per-outcome pricing on your revenue if agents reduce human seat consumption by 50%+

    Sources:TLDR IT · Laura Bratton · a16z · TLDR · Simplifying AI · ben's bites

  3. 03

    Compute Is Being Financialized — The Spot-Market Assumption Just Got Deleted

    Three IPOs That Changed the Capital Structure of AI

    The reasonable read of last week is that three IPOs repriced the inputs to every AI infrastructure plan, not just three balance sheets. Cerebras priced at $56B fully diluted, sixteen percent above the raised range, with a seventy percent first-day pop. That is the most successful tech IPO in five years, and the catalyst was not the technology. It was a $20B procurement commitment from OpenAI that converted a regulatory cautionary tale into market validation. Fervo Energy debuted above $10B with a thirty-three percent first-day pop, framed openly as an AI datacenter trade. Google holds an option for 3 gigawatts from Fervo against 658MW currently contracted. That is sixty-plus data center facilities from a single energy supplier.

    Compute as Financial Instrument

    xAI is leasing 45% of its Colossus cluster (220,000 GPUs) to Anthropic, and Musk has publicly called Anthropic misanthropic and evil. The financial logic has overwhelmed the competitive logic. Grok never achieved meaningful traction, and the lease revenue exceeds what Grok could generate from the same silicon. Excess infrastructure is moving onto the lease market. Enterprise compute economics could shift on a twelve-to-eighteen-month horizon as a result.

    A company that is quietly preparing to stand beside its flagship partner rather than behind it is a company that has already concluded the single-vendor story does not survive contact with the next contract cycle.

    The Numbers That Reframe the Budget

    MetricValueImplication
    Microsoft → OpenAI total commitment$100B+This is the floor for platform-scale AI, not the ceiling
    Nebius revenue growth684% YoY4:1 demand-to-GPU ratio persists
    Anthropic ARR trajectory$9B → $30B in ~4 monthsEnterprise switching velocity is unprecedented
    Anthropic total capital raised$75BHardware-scale capital producing software-scale growth

    What This Means Operationally

    Most infrastructure plans rest on a quiet assumption that capacity will be available somewhere, at some price, when the workload arrives. That line item is being deleted. Supply is being pre-sold in bilateral blocks of $10B and up. The firms shipping AI product on schedule are the ones that locked capacity twelve-to-eighteen months ago. A plan written last quarter that assumed three viable suppliers in eighteen months now has one and a half.

    A reasonable skeptic would say Microsoft's behavior is idiosyncratic, not a market signal. The reasonable skeptic is correct on the first point and wrong on the second. Microsoft is preparing to stand beside OpenAI rather than behind it, and actively exploring other AI startup deals. That is a company that has concluded single-provider dependency does not survive the next cycle. If the best-positioned buyer in the world is hedging, the argument for your team not hedging is weaker than it was last Monday.

    Action items

    • Audit compute procurement strategy and determine if multi-year committed capacity is required — model the cost differential between locking now vs. spot-pricing in 12 months
    • Explore whether becoming a 'transformational customer' for an emerging AI chip or infrastructure company could secure strategic compute advantage
    • Accelerate any in-progress M&A conversations with AI infrastructure targets before IPO window fully reprices seller expectations
    • Secure long-term power supply agreements or partnerships for any planned data center expansion

    Sources:The Information AM · TLDR AI · Martin Peers · StrictlyVC · Bloomberg Technology · Katie Roof

◆ QUICK HITS

  • Update: Anthropic's Mythos cleared BOTH UK AISI attack ranges (full network takeover) — first model to do so, outpacing the exponential trend that was already doubling every few months

    CyberScoop

  • Anthropic's June 15 pricing restructure caps third-party tools (Cursor, Zed) at plan-value credits then bills API rates — a platform tax that reprices every product built on Claude through intermediaries

    ben's bites

  • Apple preparing App Store agent framework for WWDC — agents that 'spin up smaller apps on the spot' will face approval gates and fee extraction, creating a new constraint layer for consumer AI products on iOS

    Techpresso

  • Google's Gemini Intelligence ships this summer on Galaxy S26 and Pixel 10 — converting 97%+ Android market share in key markets into an OS-level agent platform where apps become API infrastructure

    Simplifying AI

  • Only 15% of organizations have data foundations adequate for agentic AI — the other 85% are spending millions on agents that will fail on data governance, not model quality (95.2% cite organizational problems, not tooling)

    TLDR Data

  • Lovable's HI-C model (dissolved management, ex-VPs as autonomous ICs) is 5 months in and expanding — 90% of time on building vs. coordination, with senior operators voluntarily choosing autonomy over authority

    Lenny's Newsletter

  • Training efficiency compounding: 2-3x from token superposition (Nous), 360x from elastic post-training (NVIDIA Star), 17x from data curation (Datology) — custom model economics changing on a quarterly basis

    AINews

  • US-China chip deal includes 25% revenue-share extraction on H200 sales to 10+ Chinese companies — a 'controlled engagement' template that makes full-decoupling assumptions in supply chain plans look expensive

    The Download from MIT Technology Review

  • Duolingo walked back blanket AI mandate after discovering ~20% of AI-generated output is unusable 'slop' — performative adoption indistinguishable from productive use for about two quarters before numbers diverge

    TLDR Marketing

◆ Bottom line

The take.

AI achieved full autonomous network takeover this week while the defensive stack was proven transparent to the same AI that's attacking it — and the enterprise execution layer, the surface where AI agents actually commit writes, is being claimed right now by SAP, ServiceNow, and the hyperscalers in architectures that will be irreversible within 18 months. Meanwhile, compute is being pre-sold in $20B bilateral blocks that delete the spot-market assumption from most AI budgets. Three decisions are being forced this quarter: where your detection actually lives if EDR becomes transparent, who owns the execution layer your agents write to, and whether you've locked compute before the named buyers take what remains.

— Promit, reading as Leader ·

Frequently asked

What does Mythos clearing both UK AISI attack ranges actually mean for our defensive posture?
It means a frontier model executed full network takeover — not just persistence — on government-grade simulated ranges, a capability previously requiring skilled human teams over months. The defensive stack most enterprises operate was priced against that older adversary. Treat this as a threat-model replacement, not an incremental escalation, and brief the board accordingly.
If all five major EDR products share the same architecture, what should we shift detection investment toward?
Move investment above the endpoint: identity, network telemetry, behavioral analytics, and cross-system anomaly detection. TrustedSec showed AI can reverse-engineer EDR internals in days, collapsing the obscurity premium. EDR remains useful as one layer, but the compensating controls layered on top are what survive an adversary that already understands your agent.
How aggressively should we compress patch SLAs given the new exploit windows?
Target 72 hours maximum for internet-facing critical vulnerabilities and 24 hours for AI infrastructure components. PraisonAI moved from disclosure to active exploitation in four hours, meaning a standard seven-day SLA leaves systems exposed for the entire weaponization-to-mass-exploitation curve. AI-tooling assets (LiteLLM, Ollama, gateways) deserve the tightest tier because several are now on CISA's KEV list.
Does the SAP versus ServiceNow execution-layer split force us to pick a side?
Not immediately, but the run-both instinct will not survive the first quarter where agents must commit writes across both systems. SAP is strongest where the process is the transaction; ServiceNow is strongest where the process is cross-system workflow. Audit which of your processes fall into each shape and decide where authoritative state lives before agent rollouts force the answer for you.
Why does the Cerebras and Fervo IPO activity matter for our compute planning?
Because frontier compute capacity is being pre-sold in bilateral $10B+ blocks, deleting the assumption that capacity will be available at spot pricing when workloads arrive. Microsoft is hedging beyond OpenAI, xAI is leasing 45% of Colossus to Anthropic, and Google has optioned 3GW from a single energy supplier. Plans assuming three viable suppliers in 18 months should be re-modeled against one-and-a-half.

◆ Same day, different angle

Read this day as…

◆ Recent in leader

Keep reading.