Context Window Stress Test
Context HandlingEvaluates how well tools maintain accuracy when working with large codebases that exceed typical context windows.
Methodology
Each tool was tasked with making 5 specific changes across a 50,000-line TypeScript monorepo with 200+ files. Changes required understanding cross-module dependencies, shared types, and configuration files. Scored on changes correctly made, broken imports, missing updates, and whether the project compiles after changes.
| Tool | Changes Correct (/5)Higher is better | Broken Imports (count)Lower is better | Compiles Clean (yes/no)Higher is better | Time (min)Lower is better |
|---|---|---|---|---|
| Claude Code | 5 | 0 | 1 | 6.5 |
| Sourcegraph Cody | 4 | 1 | 0 | 9 |
| Cursor | 4 | 1 | 1 | 8 |
| Aider | 3 | 2 | 0 | 7.5 |
| Augment Code | 4 | 1 | 0 | 10 |