Cursor launches Composer 2.5 with efficiency claims, doubled usage

Cursor has just rolled out Composer 2.5, calling it its most powerful model yet with better long-task performance and reliability on complex instructions. The company claims up to 10x efficiency and doubled included usage for the next week, though user reactions are mixed.

cursor cover

TL;DR

  • Composer 2.5 released: Positioned as more intelligent, reliable with complex instructions, and better for long-running tasks
  • Temporary quota change: Included usage of Composer 2.5 doubled for the next week
  • Efficiency claim: Reported as up to 10× more efficient than similarly capable models
  • Training changes: Scaled training, more complex RL environments, and text feedback during RL across long rollouts
  • Model base: Built on same open-source base as Composer 2, Moonshot’s Kimi K2.5
  • Benchmarks and feedback: CursorBench claim within 1.6% of Opus; mixed reactions, including 200k context concerns

Cursor introduced Composer 2.5, which the company describes as its "most powerful model yet," claiming it is "more intelligent," better suited to sustained work on long-running tasks, and more reliable at following complex instructions. Cursor also stated that included usage of the model will be doubled for the next week.

In a follow-up post, the company claimed Composer 2.5 is "up to 10x more efficient than similarly capable models." Cursor attributed the update to scaled training, more complex RL environments, and new learning methods, including what it described as text feedback during RL to help assign credit across rollouts spanning "hundreds of thousands of tokens."

Cursor also said Composer 2.5 is built on the same open-source base as Composer 2, Moonshot’s Kimi K2.5. In the same thread, the company added that it is working with SpaceXAI on a much larger model trained from scratch with "10x more total compute," citing Colossus 2’s "million H100-equivalents" and saying the effort could mark "a major leap in model capability."

Replies to the announcement were mixed. Some users asked whether the new model was based on Kimi 2.6 or whether it was open source, while others questioned how it compares on everyday coding tasks and rate limits. One commenter pointed to Cursor’s own benchmark figures, including a claim that Composer 2.5 sits within 1.6% of Opus on CursorBench hard tasks, while another criticized the numbers as self-reported. A separate reply noted a 200k context window as a blocker.

Source: Cursor

Continue the conversation on Slack

Did this article spark your interest? Join our community of experts and enthusiasts to dive deeper, ask questions, and share your ideas.

Join our community