Claude Code Operations Pain Map

PAIN 01 · COST

The five-hour limit burned in nine minutes

You opened a single file, asked one question, and your weekly quota disappeared. You are not doing anything wrong. A regression that lands in a minor version multiplies token consumption by 1.4 to 1.5 times across the same workload. The cluster of Issues #54776, #55053, #56075 and #55941 documents it: 75 plus comments, 26 plus reactions, OP after OP describing the same nine-minute burn.

Reproducible: open ~/.claude/CLAUDE.md, run /compact once, watch cache_creation. Tokens spike disproportionate to context size.

FREE · DIAGNOSE Hashnode: the four-incident burn cluster, fix paths, v2.1.121 rollback Public, 1,447 words, 12 minutes PAID · FIX · $19.99 Token Book · 800h of measured token data, 9 hook-based guards, CLAUDE.md restructuring 94 pages, English PDF

PAIN 02 · SAFETY

The sub-agent reported success but nothing was saved

A read-only sub-agent says "saved successfully". Nothing was saved. You discover this hours later. The pattern repeats across at least five Issues · #55488 (identity leak), #55653 (read-only false success), #55660 (work hours lost), #55666, #55691 · pointing at one structural gap: the sub-agent has no persistent boundary for its identity, tool surface, or workspace ownership. Each call invents the boundary anew.

Symptom: parent session sees "completed" with no diff. Run git status; confirm no changes. The sub-agent will deny it confidently.

FREE · DIAGNOSE Hashnode: the five-axis sub-agent boundary article (1,567 words) Public, 7 minutes PAID · POSTMORTEM Postmortems · ten production failures reverse-engineered from Issues, commits, 800+ hours 100 pages PDF, see product page for current price

PAIN 03 · DECISION

"Vibe coding works in demos. Production is breaking us."

2,609 upvotes on r/ClaudeAI, 2026-05-05. The dominant complaint of May 2026 is that Claude Code performs differently in casual use and in real production workloads. The gap is real and measurable. The decision is not "AI yes or no" but which of three paths fits your team: stay and fortify, switch platforms, or hybrid (Claude Code as orchestrator, cheaper models as workers).

Five measurable triggers tell you which path: cost-per-task, defect rate, latency variance, toolchain coverage, and your team's tolerance for non-determinism.

FREE · DIAGNOSE Qiita: the five migration triggers and how to score yours Public, Japanese PAID · DECIDE · $19 Migration Playbook · three-path framework, cost-forecast worksheet, decision tree, 48h rollback 117 pages, live since 2026-04-25, free silent-refresh sweep on 2026-05-08 for buyers

PAIN 04 · CACHE

cache_control silently changed and your bill came in 5x

A point release changed cache TTL from one hour to five minutes. Nobody told you. Your settings.json that worked last week now produces empty cache_control blocks. Issues #46917 (tokenizer inflation 1.35 to 1.46x), #46829 (TTL regression), and several others document the silent shift. The cache lives in client behavior, not server config. When Claude Code updates, your cache strategy needs to be re-verified, not assumed.

cache_creation / total above 0.20 is the warning sign. Above 0.40 means the cache is being rebuilt every call.

FREE · DIAGNOSE Qiita: cache_control regression article and the 0.20 ratio rule Public, Japanese PAID · FIX · $19.99 Token Book Chapter 7 · cache anomalies, post-compaction spikes, the 0.20 invariant 94 pages, English PDF

PAIN 05 · UPGRADE

Last week's setup stopped working after a minor update

v2.1.121, v2.1.122, v2.1.123, v2.1.126: each minor version since late April 2026 has shipped at least one silent regression in five different surface areas (sub-agent identity, MCP plugin, Skill plugin variables, cache_creation, weekly quota). The release notes do not mention them. The fixes either roll back to a previous version or wait for the next patch. Treat every minor update as a potentially breaking change.

git tag -l 'v2.1.*' | sort -V to see the surface area. Each tag has its own cluster of regressions.

FREE · DIAGNOSE hatena: the v2.1.121 regression cluster (5 areas, no release-note mention) Public, Japanese PAID · POSTMORTEM Postmortems · incident #08 (v2.1.05 MCP regression), #04 (Opus 4.7 silent downgrade) 100 pages PDF

PAIN 06 · BILLING

The Sonnet/Opus split punishes the recommended workflow

Issue #55663, 2026-05-03: a Max-plan user describes how the official "use Sonnet for routine tasks, Opus for hard ones" guidance is structurally penalized by the new weekly quota split. If you follow the recommendation, you hit the Sonnet wall first; if you ignore it and use Opus everywhere, you hit the Opus wall but cheaper per task. The pricing is rational from Anthropic's side and irrational from your side.

Run a typical week's transcript through the cost calculator. The "recommended" mix often costs 1.4 to 1.7x more than the "use Opus only" baseline.

FREE · DIAGNOSE AI Cost Reality calculator · score your subscription against your actual usage Interactive, 50 days of real data PAID · DECIDE · $19 Migration Playbook Chapter 7 · cost forecasting worksheet for your real workload 117 pages

PAIN 07 · CONTEXT

The sub-agent thinks it is the parent

Issue #55488 (v2.1.126): under specific conditions, a sub-agent invocation receives the parent's identity instead of its own. The sub-agent then refuses to do its narrow job because "I am the lead and that's a sub-task." Or, worse, it does the job but with the parent's authority and reasoning context, contaminating the result. Persona contamination is a context-window problem dressed up as a feature regression.

Symptom: sub-agents start using "I" with the parent's voice. Or refuse delegated tasks as "below my role."

FREE · DIAGNOSE Qiita: the Issue #55488 sub-agent identity leak walkthrough Public, Japanese, 3,000+ characters PAID · POSTMORTEM Postmortems · sub-agent identity boundary as a recurring failure mode 100 pages PDF

PAIN 08 · DESTRUCTIVE

The agent ran rm -rf in the wrong directory

It happens. cc-safe-setup tracks more than 700 hooks because the same destructive patterns recur: rm -rf with a relative path that resolves outside the worktree, git reset --hard on a branch with uncommitted work, settings.json overwrite during an agent retry, plugin installs that mutate global config. Free hooks block the obvious cases. Production hardening requires the templates and decision-record patterns from a kit you can drop in once.

Run cc-health-check. If your settings.json has fewer than 8 hook entries, you're below the production baseline.

FREE · INSTALL cc-safe-setup · 700+ safety hooks, free open source, npx install GitHub, MIT licensed PAID · HARDEN · $29 CC-Codex Ops Kit · 16 production hooks, 5 templates, 9 scripts, decision-record pattern From 160+ hours of autonomous operation

PAIN 09 · CHOICE

"Cursor is shipping fast. Should I switch?"

DeepSeek v4-pro at 75% off through 2026-05-31 (then 4x); the Hacker News announcement reached 62 points / 52 comments within the first hour after submission on 2026-05-07, the strongest hour-one signal in the model-substitution category to date. Anthropic shipped the v2.1.129 cache TTL fix on 2026-05-06 (thirteen months after the founding regression) but a 1,361-upvote r/ClaudeAI thread the same week articulated the gap between the long-horizon promises and the short-horizon state. Cursor pulling backflow from Claude Code users. Aider stalled for nine months. Codex v0.125. GLM Coding Plan. The proxy-architecture tools (claude-code-router with 26,000+ stars). Each option has a different switching cost, a different ecosystem maturity, and a different total cost of ownership. No single answer fits everyone.

Switching costs: tooling rewrite (1-3 days), CLAUDE.md re-templating (4-12 hours), team retraining (1-2 weeks for team of 5).

FREE · DIAGNOSE Hashnode: the four-path migration overview (stay / switch / hybrid / proxy) Public, English PAID · DECIDE · $19 Migration Playbook · switch checklist, hybrid stack, proxy comparison, 48h rollback 117 pages, free silent-refresh sweep on 2026-05-08

PAIN 10 · FORECAST

"I cannot tell my CFO what next month will cost"

Pro is $20. Max is $100 or $200. API is per-token. Most teams run a mix. The variance month over month is wider than the median. A single Issue-cluster regression can double the bill for a week. A successful CLAUDE.md restructuring can halve it for a month. You need a forecast that names its assumptions and a guard that fires before the budget is gone, not after.

If you cannot answer "what would it cost to add one more developer to Claude Code?" within ten minutes, you have no forecast. You have a hope.

FREE · CALCULATE cc-cost-check · interactive calculator with real session import Web, 50 days of real data baked in PAID · FORECAST · $19 Migration Playbook · cost forecast worksheet (Chapter 7), monthly variance breakdown 117 pages

PAIN 11 · DESTRUCTIVE

The agent ran DROP DATABASE before a rename. 7.8 GB gone.

Issue #56255, opened 2026-05-05 19:14 JST. The agent received a "rename this database" task. It decided rename meant drop-then-create, ran DROP DATABASE without confirmation, and 7.8 GB of Postgis data went away. Auto Mode treats every Bash command as equally allowed. The structural fix is to interpose at PreToolUse, not to remember to confirm. Issues #401, #34729 document the same shape on different stacks.

A 60-line PreToolUse hook covers DROP DATABASE, TRUNCATE, prisma migrate reset, rails db:drop, php artisan migrate:fresh, django flush, and dropdb. Setup is 5 minutes.

FREE · DIAGNOSE Hatena: the Issue #56255 walk-through with the block-database-wipe hook setup Public, Japanese, 4,675 characters PAID · POSTMORTEM Postmortems · destructive command incidents reverse-engineered with detection hooks 100 pages PDF, see product page for current price

PAIN 12 · COST

"Most of my Claude usage was on work that didn't need Claude."

81 upvotes on r/ClaudeAI, 2026-05-05. A Sonnet user looked at three weeks of usage and found the bulk was JSON formatting, file classification, summarization, field extraction. None needed Sonnet. All cost the same. After moving 217 mechanical tasks to a small side worker (DeepSeek V4 Flash, MIT-licensed MCP server), three weeks of bulk work cost 0.41 dollars instead of about 7. Hybrid delegation is no longer experimental; it has working tooling and measurable savings.

The CLAUDE.md rule that worked: deny list ("do NOT use Claude for: json formatting, file classification, summarization") instead of allow list. Negative framing was followed; positive framing was ignored about 30 percent of the time.

FREE · DIAGNOSE Qiita: the side-worker pattern with the 217-task validation and the CLAUDE.md deny list Public, Japanese, 3,989 characters PAID · DECIDE · $19 Migration Playbook · Path B (hybrid) chapter, side-worker selection, routing rules 117 pages, free silent-refresh sweep on 2026-05-08

Where Claude Code breaks.Where to read about it. Where to fix it.