New paper says agentic coding scaling needs smarter reuse
Joongwon Kim and coauthors argue test-time scaling for long-horizon coding agents depends less on more sampling and more on carrying forward useful rollout information. Their summary-based RTV and PDR methods boost results on SWE-Bench Verified and Terminal-Bench v2.0.