Cursor Composer 2.5: Training a Coding Agent with Targeted Feedback and 25x More Tasks

Cursor’s Composer 2.5, built on Moonshot’s Kimi K2.5, uses a novel self-distillation RL technique and 25x more synthetic training tasks. Priced at $0.50/M input tokens.
artificial-intelligence
software-engineering
Author

Kabui, Charles

Published

2026-05-24

Keywords

cursor-composer, coding-agent, reinforcement-learning, kimi-k25, synthetic-data