Security daily

Edition 2026-05-11 · read as Security

CopyFailPodmanEscapeandAntreaPRCompromiseBreakTrust

Sources
11
Words
1,330
Read
7min

Topics LLM Inference Agentic AI AI Capital

◆ The signal

CVE-2026-31431 (CopyFail) has a public PoC that escapes rootless Podman to container root — patch every Linux host, container runtime, and CI runner today. Simultaneously, the CNCF Antrea project was compromised May 2 via a malicious PR that weaponized its own Trivy security scanner to gain root on the Jenkins controller. Two separate trust boundaries you were relying on — rootless container isolation and PR-triggered scanner sandboxing — are confirmed broken this week.

◆ INTELLIGENCE MAP

  1. 01

    CopyFail + Antrea CI Compromise: Two Trust Boundaries Confirmed Broken

    act now

    CVE-2026-31431 has a working PoC escaping rootless Podman to container root. Antrea's Trivy scanner was weaponized via malicious PR to pop root on Jenkins. NVIDIA GDDR Rowhammer has three variants, one bypassing IOMMU. All three collapse defenses enterprises relied on for isolation.

    3
    trust boundaries broken
    1
    sources
    • CopyFail exploit
    • Antrea compromise
    • GPU Rowhammer variants
    • IOMMU bypass
    1. May 2Antrea CI compromised via Trivy
    2. This weekCopyFail PoC published
    3. This weekNVIDIA Rowhammer disclosed
    4. NowPatch + quarantine CI scanners
  2. 02

    Voice Cloning Hits API-Scale: xAI Ships, Deepfakes Up 900% YoY

    act now

    xAI launched a voice-cloning API this week alongside Grok 4.3. Deepfake volume is up 900% YoY. OpenAI blocked 250K+ political deepfake requests in 2024. Generation now runs on a phone. Wire-approval and helpdesk workflows that trust voice are broken at commodity cost.

    900%
    deepfake growth YoY
    4
    sources
    • Deepfake growth
    • OpenAI blocks
    • UK voters saw deepfake
    • Clone source needed
    1. Deepfake volume growth900
    2. Political deepfakes blocked250
    3. UK voters exposed25
  3. 03

    Agentic AI Surface Expansion: Codex Extension + SAP Lockdown + Open Weights

    monitor

    OpenAI's Codex Chrome extension accesses authenticated SaaS sessions via DevTools. SAP locked third-party agents out of its APIs (only Joule/NemoClaw allowed). GLM-5.1 (744B, MIT license) beats GPT-5.4 on SWE-Bench Pro. Shadow AI on endpoints now bypasses all cloud DLP.

    744B
    GLM-5.1 parameters
    3
    sources
    • GLM-5.1 SWE-Bench
    • GPT-5.4 SWE-Bench
    • AirLLM min GPU
    • Agent failure rate
    1. 01GLM-5.1 (MIT, open)58.4
    2. 02GPT-5.457.7
    3. 03Claude Opus 4.657.3
  4. 04

    AI Supply Chain Sovereignty & Concentration Risk

    monitor

    DeepSeek's first outside capital is led by China's state-backed Big Fund — converting it from open-weight lab to state-affiliated asset. Anthropic signed a $1.8B Akamai deal on top of SpaceX and CoreWeave dependencies. AI inference now routes through a handful of names. Fourth-party risk maps fit on a napkin.

    $1.8B
    Anthropic-Akamai deal
    3
    sources
    • Anthropic-Akamai
    • DeepSeek founder stake
    • Akamai stock move
    • Hyperscaler AI capex
    1. Anthropic (Akamai)1.8
    2. Hyperscaler AI capex700
    3. Total AI revenue40
  5. 05

    Insider Threat Detection Gap: DOJ BigLaw Indictment

    background

    DOJ indicted 30 individuals including attorney Nicolo Nourafchan who accessed M&A deal rooms he wasn't staffed on at multiple BigLaw firms. No malware, no credential theft — just a valid account browsing documents his role entitled him to read. DLP saw nothing. Only UEBA on access patterns would have caught it.

    30
    individuals indicted
    1
    sources
    • Indicted individuals
    • Kickback amounts
    • Detection method
    • MITRE technique
    1. DLP detection rate0
    2. UEBA detection (if deployed)85

◆ DEEP DIVES

  1. 01

    CopyFail + Antrea: Your Container Isolation and CI/CD Sandboxing Failed This Week

    Two Concurrent Exploits, One Playbook

    CVE-2026-31431, handle CopyFail, is a Linux kernel privilege-escalation flaw. Proof-of-concept is working and confirmed by researcher Gabriel Garrido. The PoC escapes rootless Podman to container root, which collapses the defense-in-depth assumption that rootless containers buy meaningful privilege separation. Scope: every Linux host running Podman, containerd, or CRI-O. CI runners carry the most exposure because they execute untrusted workloads by design.

    Separately, on May 2, the CNCF Antrea project's CI infrastructure was compromised. The pattern is worth memorizing. A malicious pull request was submitted. When the project's Trivy security scanner ran against the PR content, the attacker achieved code execution on the Jenkins controller, as root. The attacker then taunted the maintainers. MITRE mapping is T1195.002 chaining to T1078.

    Trivy was the foothold, not the target. Any security scanner that runs on PR content and executes on a host with push or deploy credentials is a latent RCE in your pipeline.

    The Pattern That Generalizes

    The Antrea compromise is not a Trivy story. Grype, Snyk CLI, npm audit, and custom SAST tools all execute attacker-controlled input when triggered by PR content. If a CI pipeline runs any of them on a runner that holds secrets, deploy credentials, or a cloud IAM role, the exposure is identical. The fix is architectural, not a scanner swap: move scanner execution into ephemeral, credential-less sandboxes with no route to the Jenkins controller, the artifact registry, or the cloud control plane.

    NVIDIA GDDR Rowhammer — The Third Boundary

    Two independent teams disclosed three variants of NVIDIA GDDR Rowhammer attacks. One variant defeats IOMMU, which is the hardware boundary enterprises have been citing for GPU workload isolation in multi-tenant setups. Anyone running inference on shared A100, H100, or GB200 instances with sensitive prompts or proprietary model weights now owes the tenancy model a documented risk decision.


    Immediate Actions

    ActionScopeDeadline
    Patch CVE-2026-31431All Linux hosts, container runtimes, CI runnersThis week
    Deploy Falco/eBPF rule for CopyFail PoC syscall patternProduction Kubernetes, CI infrastructure48 hours (bridge during patch rollout)
    Quarantine CI scanners to credential-less runnersAll pipelines triggering scanners on PR contentThis sprint
    Require signed commits + maintainer approval before secret-bearing workflows fireGitHub Actions, GitLab CI, JenkinsThis sprint
    Classify GPU workloads by sensitivity; move regulated data to dedicated-host tenancyMulti-tenant NVIDIA GPU environmentsThis quarter

    Action items

    • Patch CVE-2026-31431 across all Linux hosts, container runtimes (Podman, containerd, CRI-O), and CI runners by end of week
    • Deploy eBPF/Falco detection rule targeting CopyFail's syscall pattern within 48 hours as a bridge control
    • Audit all CI/CD pipelines for PR-triggered scanner execution and migrate scanners to ephemeral credential-less sandboxes this sprint
    • Document GPU tenancy risk decision for multi-tenant NVIDIA workloads handling regulated data or proprietary model weights

    Sources:CopyFail + Antrea CI compromise + GPU Rowhammer: three active threats hit your stack this week

  2. 02

    Voice Cloning Hits API Scale: The Wire Approval Workflow Is Now Broken at Commodity Cost

    Three Data Points, One Conclusion

    The xAI release is the headline: xAI shipped a voice-cloning API alongside Grok 4.3. Deepfake generation volume rose 900% year-over-year. OpenAI confirmed blocking 250,000+ political deepfake generation requests during the 2024 US election cycle. Generation now runs on a handset, not a workstation.

    The operational implication is specific. Any workflow using a human ear as the final checkpoint — wire approvals by phone, helpdesk password resets verified by voice, executive assistant travel confirmations — is now operating against an adversary who produces a convincing clone from under one minute of source audio at the cost of an API call.

    The control is not dead. It is degraded, and the degradation is measurable: 900% volume growth at phone-grade generation cost. Anything relying on 'does this sound like the boss' belongs off the critical path.

    Cross-Source Agreement

    Voice-agent platforms are shipping faster than detection content is being written for them. OpenAI GPT-Realtime covers 70+ languages with 128K context. Ethos is onboarding 35,000 experts/week. Ireland's 2025 presidential race saw a deepfake of the eventual winner "withdrawing" using fake national broadcaster footage. 25% of UK voters reported personally encountering a deepfake in 2024. Detection is structurally losing to generation.

    Known Mitigations, Undeployed

    The mitigations are well-understood and most organizations have already licensed the capability without deploying it:

    • Callback verification on any voice-authorized action above a defined dollar threshold, to a number from the internal directory, not one provided on the call
    • Written second-channel approval for wire changes, vendor bank modifications, and privileged access resets
    • Helpdesk scripts that explicitly do not accept voice as a second factor
    • C2PA/Content Credentials on outbound executive video and official communications

    A vishing tabletop using a cloned sample of a willing executive within 30 days is the lowest-friction way to demonstrate the gap to leadership.


    What Changed From Saturday

    Saturday's briefing flagged the social engineering cost curve collapsing to $0.03/minute and MuddyWater running voice campaigns against Teams. This week's escalation: a named API product (xAI) shipped, quantified volume data surfaced (900% YoY, 250K blocks), and phone-grade generation removed the compute barrier. Generation cost is now at the floor.

    Action items

    • Mandate out-of-band callback (to directory-sourced number, not caller-provided) for all wire transfers, vendor bank changes, and credential resets by end of this sprint
    • Run a vishing tabletop using a cloned voice sample of a willing executive within 30 days
    • Deploy C2PA content credentials on outbound executive video and earnings-call audio this quarter
    • Add prompt-size alerting to LLM egress proxies at 100K+ tokens to detect corpus-scale exfiltration through legitimate AI endpoints

    Sources:Codex Chrome Extension and xAI Voice Cloning: Two New Attack Surfaces Landed in Enterprise Environments This Week. · Two moves this week, same direction. SAP is restricting access to its agent APIs. · The claim is simple. Deepfake generation no longer requires a workstation. · Two items from this week's release cycle expand the enterprise attack surface

  3. 03

    Agentic AI Governance: Codex Extension, SAP Lockdown, and DeepSeek's New Investor

    Control of the Agent Layer Is Being Decided This Week

    The most consequential move is OpenAI's Codex Chrome extension, an agent that inherits every signed-in browser session. SAP walling off its API to two sanctioned agents and DeepSeek's first state-backed funding round are related developments in the same direction. The agent layer is being onboarded inside the enterprise faster than the controls around it, and each of these requires a different governance response.

    Codex Chrome Extension: An Agent Inside Every SaaS Session

    OpenAI's Codex Chrome extension runs with the user's signed-in session state across Salesforce, Gmail, LinkedIn, and internal dashboards. It reads DOM and console in real time via Chrome DevTools. That makes it categorically different from a passive extension. It inherits every session cookie for every site the user is logged into, and prompt injection embedded in any page it reads can redirect it.

    Threat-model: malicious insider with persistent SSO access. The extension is already installable from the Chrome Web Store, and most managed Chrome policies have not been updated to treat agentic extensions as a separate class. Block by default via Chrome Enterprise policy this week, pending a security review of permissions, prompt-injection resistance, and data-egress telemetry.

    SAP Locks the Agent Ecosystem

    SAP updated its API policy to block all third-party AI agents except Joule and NVIDIA NemoClaw. Any OpenClaw-class agent, custom MCP server, or LangChain agent using SAP OData endpoints will start failing. Shadow integrations the SOC never inventoried will surface, and the blast radius concentrates behind two sanctioned agents.

    The action is an inventory: every non-Joule, non-NemoClaw agent currently authenticating into the SAP tenant, mapped to a sanctioned replacement or a risk-owner-signed exception. The vendor-driven change is a one-time opening to find shadow AI integrations without political cost.

    DeepSeek: From Open-Weight Lab to State-Affiliated Asset

    DeepSeek's first external capital round is led by China's state-backed Big Fund, with Tencent and Alibaba in talks. Founder Liang Wenfeng retains roughly 90% ownership. For any organization under SOC 2, federal procurement, or EU AI Act sovereignty provisions, DeepSeek-derived artifacts now require a documented risk decision, whether that is the hosted API, open-weight deployments, or fine-tunes built on top. Vendors quietly routing traffic through DeepSeek inference will have a conversation with their CISOs this quarter, or under worse conditions next quarter.

    The agent layer is being onboarded faster than the controls around it. SAP is the platform side of the move. DeepSeek is the national side. Different events, same trend.

    Stanford's 33% Failure Rate Sets the Ceiling

    Stanford's AI Index 2026 reports AI agents fail roughly 1-in-3 structured tasks. Meta's ProgramBench shows a 0% full-solve rate across 200 coding tasks. Any SOAR or IR workflow handing an agent autonomous destructive authority will produce self-inflicted incidents at a predictable cadence. Use agents for enrichment and summarization, where a 33% miss rate is survivable, and keep humans on containment and closure, where it is not.

    Action items

    • Block Codex Chrome extension via Chrome Enterprise managed policy this week pending security review
    • Inventory every third-party AI agent authenticating into SAP tenants and map each to sanctioned replacement (Joule/NemoClaw) or risk-owner exception by end of quarter
    • Produce a DeepSeek exposure report (hosted API, open-weight deployments, fine-tuned derivatives, training data shared) and deliver to GC and CISO within two weeks
    • Gate all AI agent workflows with destructive or irreversible actions behind human approval checkpoints until measured reliability exceeds error budget

    Sources:Codex Chrome Extension and xAI Voice Cloning: Two New Attack Surfaces Landed in Enterprise Environments This Week. · Two moves this week, same direction. SAP is restricting access to its agent APIs. · Two items from this week's release cycle expand the enterprise attack surface

◆ QUICK HITS

  • NVIDIA GDDR Rowhammer: three disclosed variants, one bypasses IOMMU — reassess multi-tenant GPU isolation for workloads handling proprietary model weights or regulated data

    CopyFail + Antrea CI compromise + GPU Rowhammer: three active threats hit your stack this week

  • pgBackRest discontinued after sole maintainer's sponsor was acquired — inventory PostgreSQL backup dependencies and begin migration to Barman or WAL-G before SOC 2 findings

    CopyFail + Antrea CI compromise + GPU Rowhammer: three active threats hit your stack this week

  • GLM-5.1 (744B MoE, MIT license) beats GPT-5.4 on SWE-Bench Pro at 58.4 vs 57.7 — engineers are downloading it now; publish open-weight governance policy covering hosting, data classes, and attestation

    Codex Chrome Extension and xAI Voice Cloning: Two New Attack Surfaces Landed in Enterprise Environments This Week.

  • AirLLM runs 70B models on 4GB GPUs via layer-streaming with no quantization — shadow AI on endpoints now bypasses all cloud DLP; deploy EDR signatures for large safetensors writes and sustained GPU utilization on non-ML hardware

    Codex Chrome Extension and xAI Voice Cloning: Two New Attack Surfaces Landed in Enterprise Environments This Week.

  • Figma confirms 75% of paying customers consume AI credits weekly — usage-based AI pricing creates an economic-DoS attack surface most SOCs don't monitor; implement per-key baselines and hard spend caps

    AI infrastructure is consolidating around a small number of suppliers.

  • DOJ indicted 30 individuals including attorney Nicolo Nourafchan for accessing M&A deal rooms he wasn't staffed on — UEBA on DMS access patterns is the only control that would have caught valid-account insider browsing

    A law firm insider pulled M&A documents off the internal network.

  • Anthropic shipped 10 finance-specific agents with Microsoft 365 + Moody's integration — new LLM-agent surface in pitchbooks, credit memos, and KYC workflows; inventory OAuth scopes and service-account permissions

    A law firm insider pulled M&A documents off the internal network.

  • Kubernetes v1.36 introduces manifest-based admission control (alpha) — loads webhooks and CEL policies from disk at API-server startup, preventing deletion by compromised cluster-admin; pilot in non-prod

    CopyFail + Antrea CI compromise + GPU Rowhammer: three active threats hit your stack this week

◆ Bottom line

The take.

Three trust boundaries you were relying on broke this week: rootless containers (CopyFail CVE-2026-31431, active PoC), CI/CD scanner sandboxing (Antrea's Trivy compromised May 2), and voice-as-identity (xAI shipped a cloning API while deepfake volume grew 900% YoY). Meanwhile, the AI agent layer is being locked down by platforms (SAP), funded by states (DeepSeek + Big Fund), and installed in your browser sessions (Codex) faster than governance can follow. Patch CopyFail today, quarantine your CI scanners this sprint, and kill voice as a trust anchor before the next wire approval hits.

— Promit, reading as Security ·

Frequently asked

What makes CVE-2026-31431 (CopyFail) different from a routine kernel CVE?
CopyFail has a working public PoC that escapes rootless Podman to container root, breaking the defense-in-depth assumption that rootless containers provide meaningful privilege separation. It affects every Linux host running Podman, containerd, or CRI-O, with CI runners most exposed because they execute untrusted workloads by design. Patch hosts, runtimes, and CI runners this week, and deploy an eBPF/Falco rule for the syscall pattern as a 48-hour bridge.
Why is the Antrea CI compromise a Trivy problem and not just an Antrea problem?
Trivy was the foothold, not the target — the same pattern applies to Grype, Snyk CLI, npm audit, and custom SAST tools. Any scanner that executes attacker-controlled PR content on a runner holding secrets, deploy credentials, or a cloud IAM role is a latent RCE in your pipeline. The fix is architectural: move scanner execution into ephemeral, credential-less sandboxes with no path to controllers, registries, or cloud control planes.
How should multi-tenant NVIDIA GPU workloads respond to the GDDR Rowhammer disclosures?
Classify GPU workloads by sensitivity and document a tenancy risk decision, because one of the three disclosed Rowhammer variants defeats IOMMU — the hardware boundary most enterprises cite for GPU isolation. Regulated data and proprietary model weights running on shared A100, H100, or GB200 instances should move to dedicated-host tenancy or carry a signed risk acceptance. Auditors will ask this quarter.
Why block the Codex Chrome extension by default instead of reviewing it like any other extension?
Codex inherits every signed-in session across Salesforce, Gmail, LinkedIn, and internal dashboards, and reads DOM and console in real time via Chrome DevTools. Prompt injection embedded in any page it reads can redirect the agent using the user's authenticated cookies. Most managed Chrome policies do not yet treat agentic extensions as a distinct class, so block via Chrome Enterprise policy now and review permissions, prompt-injection resistance, and egress telemetry before allowing.
What concrete control replaces voice as a verification factor now that cloning is API-priced?
Mandate out-of-band callback to a directory-sourced number — never one provided on the call — for wire transfers, vendor bank changes, and credential resets, and remove voice as an accepted second factor in helpdesk scripts. With xAI shipping a voice-cloning API and deepfake volume up 900% year-over-year, any workflow using the human ear as the final checkpoint is operating against a commodity-cost adversary. A 30-day vishing tabletop using a willing executive's cloned voice is the fastest way to surface the gap.

◆ Same day, different angle

Read this day as…

◆ Recent in security

Keep reading.