Tag

Harness

All content about Harness, organized for fast scanning.

6 itemsUpdated Jun 10, 2026

In Brief

Recent developments in autonomous coding agents highlight a shift towards goal-driven loops that enhance their efficiency and effectiveness. Innovations include open-source tools for vulnerability patching and improved architectural frameworks that significantly increase developer productivity. Additionally, there is a growing focus on creating long-running agents capable of maintaining context and memory across sessions, addressing current limitations in agent performance.

Timeline

Last 2 months. Hover a dot to preview the title.

2 months agoToday

InsightJun 10, 2026
Addy Osmani’s “Loop Engineering” hints at autonomous coding agents
Addy Osmani says coding agents may be shifting from prompt-by-prompt use to goal-driven loops that plan, split work, verify results, and repeat. His thread maps the core building blocks—and the token, security, and “exit condition” pitfalls. [https://x.com/addyosmani/status/2064127981161959567](htt…
02
InsightJun 10, 2026
Anthropic open-sources Claude harness for autonomous vulnerability patching
Anthropic has just rolled out Defending Code Reference Harness, an open-source reference for using Claude to find and fix vulnerabilities. It includes guided Claude Code skills plus a sandboxed C/C++ pipeline with ASAN and gVisor, though the repo is unmaintained.
- Security
03
NewsMay 22, 2026
Basis reveals monorepo overhaul for AI agents, boosting velocity
Basis shared how it reorganized its monorepo to better support coding agents, built around “canonical” context and a six-layer AGENTS.md and skills architecture. The company claims the changes drove 5x higher token usage per developer and 2.5x faster weekly commits.
- Agent File
04
NewsMay 22, 2026
How to build AI agents from first principles, not frameworks
Anshuman Mishra lays out a bottom-up recipe for agent training using a tiny text-to-diagram task. The key: start with a strict environment and reward loop, use SFT to learn valid actions, then apply RL to optimize behavior—and watch for reward hacking.
- LLM
NewsMay 8, 2026
AI’s next leap: long-running agents that persist beyond chat
A recent article by Addy Osmani explores “long-running agents” that can keep working across sessions without losing state. It outlines the key architectural patterns and why today’s agents still stall when context, memory, and verification break down.
InsightApr 27, 2026
New paper says agentic coding scaling needs smarter reuse
Joongwon Kim and coauthors argue test-time scaling for long-horizon coding agents depends less on more sampling and more on carrying forward useful rollout information. Their summary-based RTV and PDR methods boost results on SWE-Bench Verified and Terminal-Bench v2.0.

Synthesized from recent coverage

In Brief

Timeline

Last 2 months. Hover a dot to preview the title.

2 months agoToday

Browse all tags

Timeline

Addy Osmani’s “Loop Engineering” hints at autonomous coding agents

Anthropic open-sources Claude harness for autonomous vulnerability patching

Basis reveals monorepo overhaul for AI agents, boosting velocity

How to build AI agents from first principles, not frameworks

AI’s next leap: long-running agents that persist beyond chat

New paper says agentic coding scaling needs smarter reuse