Published: 2026-06-20
Analysis & perspective

Why Better Models Can Break Your Agents: The Case for Harness Maintenance

Chapters / key moments (click to jump — plays here on the page)

Nate B Jones uses Vercel’s sales agent — which got better after the team deleted 80% of its tools — to argue the real 2026 agent story is maintenance, not building. The "harness" (the workbench around the model: sources, memory, tools, permissions, proof, stop conditions) must be continuously pruned and rebuilt, because agents break in two directions: the world around them drifts, and the model inside them improves.

Source video

"Don't build more AI agents until you watch this" by Nate B JonesWatch on YouTube →

Key Takeaways

  • Vercel studied a top sales rep’s real workflow, built an agent around it, then improved it by removing tools — "the beginner instinct is to add; the maintenance instinct is to ask what to remove."
  • Agents break when the model gets better, not just worse: a harness built to constrain a weak model can trap a stronger one, or hand a now-capable model too much reach.
  • Agents inherit the crud of the systems around them — a stale wiki or a drifted CRM field that’s merely annoying to a human becomes dangerous when a proactive agent acts on it.
  • Both OpenAI (Codex) and Anthropic (Claude Code) are winning by maintaining the harness — terminal, browser, computer use, memory, approvals, sandboxing, logs — not just shipping a smarter model.
  • Everyone has a "harness" whether they call it that or not: your project folders, memory, source docs, approval habits and verification loop. Ask what yours is, how you ship it, and what you’ll need to delete later.
  • A five-point checkup for any serious agent: what is it reading, what can it touch, what is its job now, what proof does it bring back, and is it still delivering value?

Weekly Digest — In Your Inbox

Get the week's top AI agent news, updates, and guides — every Friday.