Product daily

Edition 2026-05-29 · read as Product

AnthropicEndsHarnessDiscount,Triggers30-DayCostReset

Sources
36
Words
1,750
Read
9min

Topics Agentic AI LLM Inference AI Regulation

◆ The signal

Anthropic is killing the 70-90% implicit discount on third-party harness usage starting June 15 — every developer running Claude through Cursor, Cline, or OpenCode just got a 5-10x cost increase on that workflow. OpenAI responded within hours with 2 months free Codex for enterprise switchers, creating a 30-day decision window. ServiceNow burned its entire full-year Anthropic budget by May, proving this isn't theoretical. Your AI cost model has exactly 30 days to adapt before the invoice arrives.

◆ INTELLIGENCE MAP

  1. 01

    AI Cost Governance Crisis: June 15 Pricing Reset

    act now

    Anthropic splits first-party vs third-party credit pools June 15, ending 70-90% implicit subsidies. ServiceNow exhausted its full-year budget by May. OpenAI offers 2 months free Codex with 30-day deadline. The era of subsidized AI inference is ending — model your new COGS this week.

    70-90%
    implicit discount eliminated
    6
    sources
    • ServiceNow budget burn
    • OpenAI switch offer
    • Decision window
    • Anthropic ARR
    1. TodayAudit third-party usage
    2. June 15Credit pools split
    3. June 15 + 30dOpenAI offer expires
    4. Q3New pricing in full effect
  2. 02

    Enterprise Headless Layer: SAP + ServiceNow + Salesforce Ship MCP

    monitor

    Three of the five largest enterprise vendors converged on MCP as the agent execution standard in the same week. SAP committed €100M in partner funding plus a Knowledge Graph. ServiceNow decoupled workflows from UI via Action Fabric. Buyers now ask 'can our agents call this directly?' — products without headless APIs lose at shortlist stage.

    €100M
    SAP agent partner fund
    4
    sources
    • SAP fund
    • Agent bypass rate
    • RFP window
    • MCP build time
    1. 01SAP€100M fund + Knowledge Graph
    2. 02ServiceNowAction Fabric (MCP)
    3. 03SalesforceAgentforce + WhatsApp
    4. 04NotionExternal Agents API
  3. 03

    PM Role Compression: The 90% Building Ratio

    monitor

    Lovable ships with zero PMs. Elena Verna built and deployed an enterprise pricing page alone — work that previously needed PM + designer + engineer + one week. She reports 90% of time building, almost no meetings. AI tools compress the PM-designer-engineer triangle into a single high-context operator. The role survives on judgment, not coordination.

    90%
    time spent building
    1
    sources
    • Verna build ratio
    • Traditional team
    • HI-C time to ship
    • Lovable PMs employed
    1. HI-C Model90
    2. Traditional PM25
  4. 04

    AI Security: Full Network Takeover + Harness > Model

    monitor

    Anthropic's Mythos cleared both UK AISI attack ranges — the first model to achieve autonomous full network takeover. Mozilla found 271 bugs using the same-class model with a custom harness; curl found 1 CVE with generic scanning. The delta is 270 bugs of harness engineering. Security ROI lives in orchestration, not model access.

    271
    bugs found (with harness)
    5
    sources
    • Mozilla (with harness)
    • curl (without harness)
    • Exploit time collapse
    • Identity fraud TAM
    1. Mozilla (custom harness)271
    2. curl (generic scan)1
  5. 05

    Multi-Model Routing Is Now Default Architecture

    background

    Vercel's production data across 200K+ teams confirms 59% of token volume is agentic and most large deployments route across multiple providers. Anthropic captures 61% of spend (Opus), Google captures 38% of volume (Flash). No vendor loyalty persists beyond two release cycles. Single-provider lock-in is architectural debt, not a strategy.

    59%
    agentic token volume
    8
    sources
    • Agentic workloads
    • Anthropic spend share
    • Google volume share
    • Teams measured
    1. Agentic workloads59
    2. Traditional AI41

◆ DEEP DIVES

  1. 01

    Your AI Invoice Changes June 15 — The 30-Day Decision Window Is Open

    The Pricing Mechanics That Just Broke

    A backend lead opened her Cursor tab on a Tuesday morning, ran the same Claude-powered refactor she runs every Tuesday, and did not notice anything different. She will notice on June 15, when Anthropic splits third-party Claude tool usage (Cursor, Cline, Zed, OpenCode, T3 Code) into a separate credit pool equal to the plan's dollar value, with overages at full API rates. The implicit discount developers were riding — estimated at 70-90% below API pricing — is gone. The per-developer cost assumption for any team running Claude through a non-Anthropic harness is wrong by roughly an order of magnitude.

    This is not generosity ending. Anthropic hired a CFO and is aiming at an October 2026 IPO. The S-1 needs revenue-per-user numbers that implicit subsidies cannot produce. PMs building on Claude should model at least one more pricing adjustment before October.

    The ServiceNow Warning Shot

    ServiceNow CDIO Kellie Romack reported her team's full-year Anthropic budget was consumed by May 2026, five months into a twelve-month plan. She cannot say which users or workloads burned it, because Anthropic does not ship per-user telemetry. PagerDuty and National Life Group report the same. National Life's Nimesh Mehta called Anthropic "great for consumer usage but not great for companies."

    AI costs are structurally unpredictable, and the model providers have not built the instrumentation customers need to govern them.

    OpenAI's Displacement Offer

    Sam Altman offered 2 months of free Codex to enterprise customers who switch within 30 days, timed to the frustration. Ramp's data has Anthropic at 34.4% versus OpenAI's 32.3%. OpenAI lost the business adoption lead for the first time and is counter-attacking with displacement pricing.

    The Decision Framework

    Two axes. First: is Claude usage load-bearing for production workflows, or exploratory? Second: is the harness replaceable with Anthropic-native tooling at similar quality?

    • Load-bearing + replaceable: Renegotiate with Anthropic inside the 30-day window while leverage exists
    • Load-bearing + not replaceable: Pilot Codex on the free offer this week, not next month
    • Exploratory: Stop paying metered rates for exploration. Move to whichever vendor is currently subsidizing

    The 50% rate limit increase Anthropic bundled in is a grace period, not a concession. Build cost telemetry before June 15, not after.

    Action items

    • Model the cost impact of Anthropic's new credit structure on all third-party Claude usage by end of this week
    • Pilot OpenAI Codex on one production workflow using the 2-month free offer before the 30-day window closes
    • Ship per-customer, per-feature AI cost attribution dashboards before your next AI feature launch
    • Add AI usage governance (rate limiting, usage dashboards, admin spend caps) to your enterprise roadmap

    Sources:A product manager opened three vendor pricing pages this week · A finance lead at ServiceNow opened the Anthropic invoice in May · Your AI cost model breaks June 15 · A platform PM opened her integrations dashboard on Monday · Anthropic just flipped OpenAI in enterprise · A developer opened the Claude console on a Tuesday

  2. 02

    Enterprise Buyers Now Ask One Question: 'Can Our Agents Call This Directly?'

    Three Vendors, One Week, One Standard

    SAP, ServiceNow, and Salesforce all shipped autonomous agent architectures last week, and they converged on the same execution layer: headless workflows callable over MCP. The pitch deck calls this an industry standard emerging. What is actually happening is three vendors making the same platform bet at the same time. SAP backed it with a €100M partner fund and a Knowledge Graph for agent context. ServiceNow decoupled workflow logic from UI via Action Fabric, exposing it for any third-party agent to call. Companies do not stand up hundred-million-euro funds for features. They do it for bets they intend to defend for years.

    The Procurement Shift Already Happened

    A procurement manager at a Fortune 500 asked three enterprise software vendors the same question last week: "Can our agents call this directly, or do my people have to click through your UI?" Two vendors had no answer. The third moved to the next stage. That question is showing up in RFPs 2-3 quarters before most product teams expect it, which means the roadmap conversation is already late by the time the sales team forwards the email.

    If your APIs are not agent-consumable, agents will route around you to a competitor whose APIs are.

    What "Agent-Callable" Actually Requires

    The work is smaller than the deck will suggest for most teams: 1 week of scoping, 2-4 weeks of build, assuming the underlying API is not already broken. The separate decision — whether the core UI should be restructured around agents as primary users — is a roadmap question, not a sprint question. Teams conflate the two and ship neither. Getting the sequence wrong in either direction is expensive.

    Notion's Contrasting Play

    Notion did not build its own agent. It launched an External Agents API letting Claude, Codex, Cursor, and Devin operate directly inside Notion workspaces. That is the context-layer bet: make the product the shared workspace every agent needs. The 2x2 resolves on one axis. If the product holds collaborative data but cannot out-build foundation vendors on agent capability, the Notion path is correct. If it can ship a better agent for a specific workflow, building the agent is the right call. Hedging both is what most decks recommend and what most roadmaps cannot afford.

    The Signal in the Support Queue

    The forcing function is concrete. Pull the last twenty support tickets from the top-decile accounts. Count how many assume a human in the seat versus how many assume an agent is doing the work. If that ratio moved even ten points toward agents in the last two quarters, the headless layer is a retention bet on the next renewal cycle, not a platform bet on the next board meeting. Read the tickets before picking the sprint.

    Action items

    • Audit your product's API surface for agent-consumability — can a third-party AI agent discover, authenticate, and execute core workflows without UI?
    • Scope an MCP-compatible headless layer against your existing API by end of quarter
    • Evaluate SAP's €100M Autonomous Enterprise partner fund for product fit — application deadline likely within next quarter
    • Survey your top 20 enterprise accounts on agent usage plans and add findings to next quarter's planning inputs

    Sources:A customer success lead at a mid-market SaaS company opened her own product's API documentation · 59% of AI traffic is now agentic · Your AI cost model breaks June 15 · A designer on a mid-sized SaaS team spent six weeks this spring polishing the onboarding flow

  3. 03

    The PM Who Ships Alone: Lovable's Zero-PM Model and What Survives

    The Case Study

    Elena Verna shipped Lovable's enterprise pricing page to production alone. She is the former head of growth at Amplitude, Miro, Dropbox, and SurveyMonkey, and on the day she pushed that page live there was no PM scoping requirements, no designer on mocks, no engineer on the build. A workflow that used to take three roles and a week of calendar time collapsed into one person and a few hours. In her HI-C (High-Impact Contributor) role, she reports spending ~90% of time building, with almost no meetings.

    Lovable has zero product managers. Engineers talk to users, write specs, ship code, and read the feedback themselves. The company is growing fast enough that the absence is the model, not an oversight.

    What's Actually Being Unbundled

    The PM role is four jobs in a trenchcoat: user research, prioritization, spec-writing, and cross-team coordination. AI tools fold the first three into the builder, but only when the builder is the one talking to users. The fourth job disappears when the org is small enough that coordination happens in one channel. Ravi Mehta's framing is the right one: AI does not make a PM world-class at design or engineering. It makes them average-to-good at everything at once.

    The PMs who survive this shift look less like project managers and more like mini-GMs who happen to prototype and iterate directly.

    Where the Model Breaks

    The pattern works at Lovable's size. It breaks at some larger size, and nobody knows where. Two diagnostic questions separate the cells. One: is prioritization owned by a named person, or does it emerge from whoever shouts loudest? Two: do builders talk to users every week, or does feedback arrive filtered through a deck? Lovable sits in "named owner, direct contact." That cell runs without PMs. Every other cell still needs one.

    The Talent Risk for Established Companies

    Senior builders who can get autonomy and impact density at a flat org will leave to get it. Not all of them want a VP title. The best ones will go where they can spend 90% of the week creating instead of 90% aligning. Companies that ungate information access pull disproportionate talent. Companies that protect management layers end up with coordinators and no builders.


    The Framework for Monday

    Two axes. How much of the week goes to building versus coordinating, and whether the org lets a single operator ship a customer-facing surface without a handoff chain. If an enterprise pricing page takes the team a week and an HI-C a few hours, the gap is not speed. It is structure, and it compounds across every experiment and iteration cycle.

    Action items

    • Calculate your personal build-vs-coordinate ratio this week — log hours in meetings/alignment vs hours producing customer-visible output
    • Ship one small project end-to-end using AI tools (landing page, pricing experiment, internal tool) without engaging your cross-functional team
    • Identify 1-2 senior ICs or managers on your team who might be more productive with full autonomy and fewer reports — design a pilot HI-C structure
    • Rewrite your PM career narrative around judgment and strategy rather than coordination — update positioning for interviews, reviews, and internal visibility

    Sources:A product manager at a Series B company opened Lovable's careers page three times last month

  4. 04

    AI Security: The Harness Is the Moat, Not the Model

    The Capability Jump

    Anthropic's Mythos became the first model to clear both UK AISI simulated attack ranges, achieving full autonomous network takeover. The previous generation capped at 'advanced persistence.' OpenAI's GPT-5.5-cyber cleared one of the two. Both ran ahead of the trend line of AI cyber tasks doubling every few months. The old threat model assumed attackers could get a foothold and then needed a human to escalate. That assumption is stale.

    The Mozilla vs. curl Result Is the Product Lesson

    A Mozilla security engineer ran Claude Mythos Preview inside a custom agentic harness her team built. The harness writes reproducible test cases, scales across ephemeral VMs, and wires into the existing security lifecycle. It surfaced 271 bugs, including sandbox escapes, race conditions, and use-after-free vulnerabilities that fuzzers had missed for years.

    The same Mythos model, pointed at curl's 178K lines of C with generic scanning, found exactly 1 low-severity CVE out of 5 claimed issues. Daniel Stenberg's verdict: "primarily marketing."

    The delta between 271 and 1 is not the model. It is 270 bugs worth of harness engineering.

    What This Means for Product Security

    Here is what is actually happening to users. The time between vulnerability disclosure and weaponized exploit collapsed from weeks to hours. PraisonAI's auth bypass was actively exploited within 4 hours of disclosure. Patch SLAs of 30-60 days were designed for the old exploit economics. Those economics changed this week.

    Palo Alto Networks scanning with AI models found dozens of serious vulnerabilities across 130+ products. That is the base rate for any product with real surface area. Enterprise security teams will notice before most product orgs do. The first symptom is an RFP asking for AI-powered security testing evidence in the build pipeline.

    The Build Decision

    FactorCustom Harness (Mozilla)Generic Scanning (curl)
    Bugs found2711 CVE
    CostHigher upfrontLower per-scan
    RequiresBug corpus + triage pipelineModel access only
    Result qualityReproducible test cases"Primarily marketing"

    The framework is one question. Does the team already have a corpus of prior findings and a triage pipeline running? If yes, buy model access and build the harness. If no, the sprint work is building the corpus, not evaluating models. Picking one of those is better than hedging both.

    Action items

    • Compress your critical vulnerability patch SLA to <72 hours — update your incident response playbook to assume a 4-hour exploitation window post-disclosure
    • Pilot AI-assisted security testing on your most complex codebase, investing in harness design (bug corpus, triage pipeline, reproducible test cases) over model selection
    • Add 'AI-powered security testing evidence' to your enterprise sales preparation materials before it appears in RFPs
    • Commission a red-team exercise assuming AI-powered attackers can chain exploits in minutes, not days — update threat model document accordingly

    Sources:A staff engineer opened the build logs at 11pm on a Tuesday · A security engineer watched an automated tool chain three low-severity findings · A security engineer ran her team's standard pen test this week · A compliance lead at a mid-sized AI company opened the same three regulator pages · Anthropic's $900B raise + AI cybercrime confirmation

◆ QUICK HITS

  • Update: Anthropic overtook OpenAI in business adoption (34.4% vs 32.3% per Ramp) — Anthropic quadrupled YoY while OpenAI grew 0.3%; the Ramp data measures billing relationships, not usage volume

    A head of platform at a mid-size SaaS company opened her vendor dashboard on Monday

  • GPT-5.5 Instant reduces hallucinations 52.5% vs GPT-5.3 Instant and is now the free-tier ChatGPT default — reassess features previously parked on reliability grounds

    A designer on a mid-sized SaaS team spent six weeks this spring polishing the onboarding flow

  • Microsoft's agent memory architecture stabilizes at 400-500 memories with 97.2% retention precision — first production benchmark for persistent agent features; spec yours against this number

    A head of sales loaded the target account list on Monday

  • Only 15% of organizations have the data foundation for agentic AI, yet they're spending millions anyway — add data readiness qualification gates before enterprise contracts, not after

    A head of sales loaded the target account list on Monday

  • Duolingo reversed its blanket 'evaluate all employees on AI usage' mandate after finding ~20% slop rate and performative adoption — measure output quality, not tool logins

    Duolingo's 20% AI slop rate is your quality bar

  • AI persona drift quantified: significant degradation occurs within 8 dialogue rounds due to attention decay — add drift detection to acceptance criteria for any multi-turn AI feature

    AI persona drift quantified at 8 rounds

  • Google Gemini is leaking private phone numbers from training data to users — any product on foundation models needs output-layer PII detection, not just input filtering

    A user asked Gemini a routine question and got back someone else's phone number

  • Claude Code /goal command enables fully unattended multi-turn sessions with evaluator-judge pattern (Haiku model checks completion against measurable conditions) — reference architecture for any autonomous AI workflow

    A staff engineer kicked off Anthropic's autonomous coding mode on a Tuesday afternoon

  • Abridge's wedge-to-platform playbook: 80-100M medical conversations (data that doesn't exist in EHRs), $5.3B valuation, monthly enterprise releases in a regulated vertical — the three-act ladder (save time → save money → save lives) sequences by trust earned

    A clinician finishes a patient visit and, instead of typing notes for forty minutes

  • Google's Universal Commerce Protocol embeds Affirm + Klarna BNPL directly into Gemini and Search — if you touch payments or checkout, evaluate integration feasibility this quarter

    Google's Universal Commerce Protocol is your next integration decision

◆ Bottom line

The take.

You have 30 days before Anthropic's June 15 pricing change eliminates the 70-90% inference discount your team may be unknowingly relying on — model the cost impact this week, pilot OpenAI's free Codex offer before it expires, and build per-customer cost telemetry before the next feature launch. Simultaneously, SAP, ServiceNow, and Salesforce just made 'can agents call your product directly?' the new enterprise shortlist question. The PM who survives both shifts isn't the one coordinating — it's the one who can model the economics, ship the headless API, and instrument the outcomes without waiting for three other teams to unblock them.

— Promit, reading as Product ·

Frequently asked

What exactly changes on June 15 for Claude usage through tools like Cursor and Cline?
Anthropic is splitting third-party Claude tool usage into a separate credit pool sized to your plan's dollar value, with overages billed at full API rates. The implicit 70-90% discount developers were riding through harnesses like Cursor, Cline, Zed, OpenCode, and T3 Code disappears, producing roughly a 5-10x cost increase on those workflows.
How should I decide between renegotiating with Anthropic and switching to OpenAI's Codex offer?
Use two axes: whether Claude is load-bearing or exploratory, and whether the harness is replaceable with Anthropic-native tooling. Load-bearing and replaceable means renegotiate inside the 30-day window. Load-bearing and not replaceable means pilot Codex on the free 2-month offer this week. Exploratory work should move to whichever vendor is currently subsidizing.
Why couldn't ServiceNow identify which workloads burned its Anthropic budget?
Anthropic does not ship per-user telemetry, so ServiceNow, PagerDuty, and National Life Group cannot attribute consumption to specific users or workflows. The practical implication is that per-customer, per-feature AI cost attribution dashboards have to be built on the customer side before the next AI feature ships, not after the invoice arrives.
Is another Anthropic pricing change likely before the IPO?
Probably yes. Anthropic hired a CFO and is targeting an October 2026 IPO, and the S-1 needs revenue-per-user numbers that implicit subsidies cannot produce. PMs building on Claude should model at least one more pricing adjustment before October and treat the current 50% rate-limit bump as a grace period rather than a concession.
What should I have ready before the 30-day window closes?
Three artifacts: a cost-impact model of the new credit structure across all third-party Claude usage, a Codex pilot running on at least one production workflow under the free offer, and a draft of per-customer AI cost attribution so the next sprint planning has real numbers. Without those, the invoice on June 15 lands without context or alternatives.

◆ Same day, different angle

Read this day as…

◆ Recent in product

Keep reading.