Boris Cherny’s X thread argues that Claude Opus is well suited to long-running work, with the Anthropic staffer offering a set of tips for keeping the model busy for hours or days and pointing to a benchmark graphic that appears to favor Opus 4.8 on multi-hour software tasks.
Cherny lists five practices for extended autonomous runs: enable auto mode for permissions so Claude does not keep asking for approval; use dynamic workflows to orchestrate hundreds or thousands of agents; nudge the model with commands such as “/goal” or “/loop”; run Claude Code in the cloud; and give the model a way to self-verify its work end to end.
For web work, Cherny states that “Claude in Chrome” is preferable to Playwright or Chromium MCP for E2E testing, calling it “more powerful and more token-efficient.” In another reply, he adds that the most important ingredient is “self-verification” paired with dynamic workflows, with prompts aimed at testing results in a browser and looking for “edge cases and ui issues.”
The thread also includes a few caveats from other commenters. One user notes that such workflows seem more manageable when acceptance criteria are clear, while another asks about costs on enterprise accounts. Cherny responds that he thinks about the problem in terms of ROI rather than absolute cost, arguing that the same manual work can amount to “weeks or even months of engineering time.”
He also dismisses the idea that the command set needs to be manually driven, writing that those controls are “not designed for people to invoke them,” and that the model should be told what needs to happen so it can invoke the right skills itself. Later, when asked about mobile workflows, he replies simply: “Just tell claude to use a workflow.”
Cherny also tells one commenter to run “/usage” to see a breakdown of the specific skills, mcps, and plugins consuming tokens. When asked about long sessions and context issues, he states that “Context rot isn’t a thing with 4.8 imo,” though that remains his view rather than an independently verified conclusion.
Source: X post by Boris Cherny



