Why Our AI Coding Workflow Split Into Three Distinct Modes — And the Hotkeys That Switch Between Them

We don't 'use AI coding.' We run three workflows: greenfield autonomy (Claude Code), long-refactor persistence (Codex), daily editing (Windsurf). Each has a terminal alias, a model policy, and a cost ceiling. Here's the full map.

Published 2026-06-13

Why Our AI Coding Workflow Split Into Three Distinct Modes — And the Hotkeys That Switch Between Them

TL;DR: One tool can’t win at autonomy, context persistence, and daily editing simultaneously. We mapped each workflow to the tool that wins its axis: cc for greenfield, cx for 3–5 hr refactors, wind for daily type-heavy work. Total: ~$135/mo for 2 seats. Full workflow →

The Context

Two-dev team, 5 repos, 6–8 hrs/day AI-assisted. Tried forcing everything into one tool (Cursor, then Windsurf, then Claude Code). Each failed at something: Cursor lost context on long refactors; Claude Code too verbose for quick edits; Codex chat UI friction for terminal work. Stopped chasing “the one tool” and built a workflow map instead.

What We Tested

WorkflowTypical DurationContext NeedAutonomy NeedTool WinnerWhy Loser Failed
Greenfield feature (new API, component, infra)45–90 minLow (fresh context)High (test-write-debug loops)Claude CodeCursor: no autonomous loops; Codex: chat UI friction
Long refactor (migration, schema change, auth rewrite)3–5 hrsCritical (dev server, DB, tunnels)Medium (guided steps)CodexCursor: loses context at 90 min; Claude Code: session-only
Daily editing (type-heavy, bug fixes, small features)15–60 minLowLowWindsurfCursor: $20 mystery limits; Claude Code: verbose for quick edits
Model-flexible heavy lift (Terraform, security review)2–4 hrsMediumMediumCline (BYOK)All others: locked model routing

The Pivot Point

April 2026: Terraform refactor across 3 envs. Needed Sonnet for architecture, DeepSeek V3 for repetitive blocks, Opus for security review. Tried in Cursor — three separate chats, each hitting limits. Tried in Codex — can’t switch models mid-session. Cline + OpenRouter did it in one 4-hr session, $4.20 API cost. Realized: workflow ≠ tool. Workflow = (task type × context need × autonomy need × model need). Built the map.

What We Use Now

Terminal aliases in shared dotfiles (synced via chezmoi):

# ~/.config/chezmoi/dot_zshrc.tmpl
alias cc='claude-code'              # Greenfield, autonomous
alias cc-opus='cc --model opus --reason'  # Flagged Opus use
alias cx='codex'                    # Long refactors, persistent
alias wind='windsurf'               # Daily editing, Cascade
alias cline='code --extension cline'      # Model-flexible heavy lift

Workflow decision tree (printed on desk):

New feature / greenfield? → cc
  └─ Needs Opus (auth/infra/payments)? → cc-opus "reason"
Refactor / migration / debug > 90 min? → cx
  └─ Needs model switching? → cline
Daily edits / bug fixes / type-heavy < 60 min? → wind
  └─ Concurrent FE/BE? → wind (Cascade 3 agents)

Cost ceiling: $135/mo team (Claude Code pool $100 + Windsurf 2×$15 + Codex in existing ChatGPT Plus). Cline BYOK variable (~$10-20/mo). Hard cap enforced by monthly trackmycodes review.

When You’d Choose Differently

  • Solo dev: Pick ONE primary (Cursor or Windsurf or Claude Code) + one backup. Complexity not worth it.
  • Team > 5: Standardize on one tool for collaboration (Cursor Business or Windsurf Teams) — workflow splitting creates onboarding friction.
  • Enterprise compliance: Copilot Business or Cursor Business — audit trails, SSO, data residency.

Tool Crucible Rating

DimensionRating (1–5)Notes
Overall4.5Map > tool; eliminates “which tool?” paralysis
Ease of Use3Requires team discipline + shared aliases
Value5Right tool per task = no overage, no context loss
SupportN/AInternal framework

This is part of our AI Coding Tool Evaluation series. See full workflow: The Three-Mode AI Coding Workflow Map (2026 Edition)

Last reviewed 2026-06-13. See our methodology and affiliate policy.