Addy Osmani’s “Loop Engineering” hints at autonomous coding agents

Addy Osmani says coding agents may be shifting from prompt-by-prompt use to goal-driven loops that plan, split work, verify results, and repeat. His thread maps the core building blocks—and the token, security, and “exit condition” pitfalls. [https://x.com/addyosmani/status/2064127981161959567](htt…

June 10, 2026

•

Harness

TL;DR

Loop Engineering: Agents shift from “prompt, wait” to continuous plan/split/verify/repeat cycles until goals are met
Key building blocks: Scheduled automations, worktrees, skills, plugins/connectors, sub-agents, plus external memory layer
Tooling claim: Claude Code and Codex already include these components; emphasis moves to workflow design
Workflow examples: Recurring triage jobs, isolated branches, and separate verifier agents coordinating iterations
Risks noted: Token costs can climb quickly; quality control needs close oversight
Common concerns: Observability, memory files, security, and exit conditions; limits matter on paid plans

Addy Osmani’s “Loop Engineering” thread argues that coding agents may be moving past the old “prompt, wait, prompt again” routine and toward systems that keep finding work, splitting it up, checking results, and repeating until a goal is met. Osmani presents the idea as an early and somewhat uncertain direction for agent workflows, while warning that token costs can climb quickly and that quality control still needs a close eye.

The thread reduces the concept to a small set of building blocks: scheduled automations, worktrees, skills, plugins and connectors, and sub-agents, plus a separate memory layer that keeps state outside the conversation. Osmani claims that both Claude Code and Codex already ship those pieces, which makes the model less about a single tool and more about how the workflow gets designed.

He then walks through how those parts appear to fit together in real use, from recurring triage jobs to isolated branches and separate verifier agents. The thread draws a clear line between the older habit of manually driving every turn and a newer approach where the system itself keeps asking, checking, and routing work until something is finished.

The replies that follow focus on the rough edges rather than the novelty. Commenters raise familiar concerns about token budgets, observability, memory files, security, and exit conditions, while Osmani notes in follow-up posts that those trade-offs are real and that limits still matter, especially on paid plans.

For developers watching how agentic tooling is changing, the thread is a compact tour of both the promise and the friction. The full post goes deeper into the mechanics, the caveats, and the examples behind the idea.

Source: Addy Osmani on X

Continue the conversation on Slack

Did this article spark your interest? Join our community of experts and enthusiasts to dive deeper, ask questions, and share your ideas.

Join our community

AI’s next leap: long-running agents that persist beyond chat

A recent article by Addy Osmani explores “long-running agents” that can keep working across sessions without losing state. It outlines the key architectural patterns and why today’s agents still stall when context, memory, and verification break down.

May 8, 2026

1 shared tag

New paper says agentic coding scaling needs smarter reuse

Joongwon Kim and coauthors argue test-time scaling for long-horizon coding agents depends less on more sampling and more on carrying forward useful rollout information. Their summary-based RTV and PDR methods boost results on SWE-Bench Verified and Terminal-Bench v2.0.

Apr 27, 2026

1 shared tag

Anthropic open-sources Claude harness for autonomous vulnerability patching

Anthropic has just rolled out Defending Code Reference Harness, an open-source reference for using Claude to find and fix vulnerabilities. It includes guided Claude Code skills plus a sandboxed C/C++ pipeline with ASAN and gVisor, though the repo is unmaintained.

Jun 10, 2026

1 shared tag

Continue the conversation on Slack

Related Articles

AI’s next leap: long-running agents that persist beyond chat

New paper says agentic coding scaling needs smarter reuse

Anthropic open-sources Claude harness for autonomous vulnerability patching