Why Our AI Coding Workflow Split Into Three Distinct Modes — And the Hotkeys That Switch Between Them

We don't 'use AI coding.' We run three workflows: greenfield autonomy (Claude Code), long-refactor persistence (Codex), daily editing (Windsurf). Each has a terminal alias, a model policy, and a cost ceiling. Here's the full map.

Published 2026-06-13

Why Our AI Coding Workflow Split Into Three Distinct Modes — And the Hotkeys That Switch Between Them

TL;DR: One tool can’t win at autonomy, context persistence, and daily editing simultaneously. We mapped each workflow to the tool that wins its axis: cc for greenfield, cx for 3–5 hr refactors, wind for daily type-heavy work. Total: ~$135/mo for 2 seats. Full workflow →

The Context

Two-dev team, 5 repos, 6–8 hrs/day AI-assisted. Tried forcing everything into one tool (Cursor, then Windsurf, then Claude Code). Each failed at something: Cursor lost context on long refactors; Claude Code too verbose for quick edits; Codex chat UI friction for terminal work. Stopped chasing “the one tool” and built a workflow map instead.

What We Tested

Workflow	Typical Duration	Context Need	Autonomy Need	Tool Winner	Why Loser Failed
Greenfield feature (new API, component, infra)	45–90 min	Low (fresh context)	High (test-write-debug loops)	Claude Code	Cursor: no autonomous loops; Codex: chat UI friction
Long refactor (migration, schema change, auth rewrite)	3–5 hrs	Critical (dev server, DB, tunnels)	Medium (guided steps)	Codex	Cursor: loses context at 90 min; Claude Code: session-only
Daily editing (type-heavy, bug fixes, small features)	15–60 min	Low	Low	Windsurf	Cursor: $20 mystery limits; Claude Code: verbose for quick edits
Model-flexible heavy lift (Terraform, security review)	2–4 hrs	Medium	Medium	Cline (BYOK)	All others: locked model routing

The Pivot Point

April 2026: Terraform refactor across 3 envs. Needed Sonnet for architecture, DeepSeek V3 for repetitive blocks, Opus for security review. Tried in Cursor — three separate chats, each hitting limits. Tried in Codex — can’t switch models mid-session. Cline + OpenRouter did it in one 4-hr session, $4.20 API cost. Realized: workflow ≠ tool. Workflow = (task type × context need × autonomy need × model need). Built the map.

What We Use Now

Terminal aliases in shared dotfiles (synced via chezmoi):

# ~/.config/chezmoi/dot_zshrc.tmpl
alias cc='claude-code'              # Greenfield, autonomous
alias cc-opus='cc --model opus --reason'  # Flagged Opus use
alias cx='codex'                    # Long refactors, persistent
alias wind='windsurf'               # Daily editing, Cascade
alias cline='code --extension cline'      # Model-flexible heavy lift

Workflow decision tree (printed on desk):

New feature / greenfield? → cc
  └─ Needs Opus (auth/infra/payments)? → cc-opus "reason"
Refactor / migration / debug > 90 min? → cx
  └─ Needs model switching? → cline
Daily edits / bug fixes / type-heavy < 60 min? → wind
  └─ Concurrent FE/BE? → wind (Cascade 3 agents)

Cost ceiling: $135/mo team (Claude Code pool $100 + Windsurf 2×$15 + Codex in existing ChatGPT Plus). Cline BYOK variable (~$10-20/mo). Hard cap enforced by monthly trackmycodes review.

When You’d Choose Differently

Solo dev: Pick ONE primary (Cursor or Windsurf or Claude Code) + one backup. Complexity not worth it.
Team > 5: Standardize on one tool for collaboration (Cursor Business or Windsurf Teams) — workflow splitting creates onboarding friction.
Enterprise compliance: Copilot Business or Cursor Business — audit trails, SSO, data residency.

Tool Crucible Rating

Dimension	Rating (1–5)	Notes
Overall	4.5	Map > tool; eliminates “which tool?” paralysis
Ease of Use	3	Requires team discipline + shared aliases
Value	5	Right tool per task = no overage, no context loss
Support	N/A	Internal framework

This is part of our AI Coding Tool Evaluation series. See full workflow: The Three-Mode AI Coding Workflow Map (2026 Edition)

Last reviewed 2026-06-13. See our methodology and affiliate policy.