← Token Book FREE CHAPTER
"I'm not doing anything and my quota keeps disappearing" — this is the most common complaint from Claude Code users. But you are doing something. Tokens are being consumed in places you can't see.
Every time Claude Code starts a turn, it loads the internal system prompt + your CLAUDE.md + tool definitions + MCP configs. The longer your CLAUDE.md, the higher the base cost per turn. 100 lines vs 500 lines makes a massive difference.
As conversation grows, past exchanges accumulate. If the prompt cache is working, these are reused cheaply. But when the cache breaks, the same prompt costs several times more (covered in Chapter 8 — 48 diagnostic symptoms).
Every file read, file write, and Bash execution consumes tokens. The output of ls -la vs cat large-file.txt differs by orders of magnitude. Tool results have an internal 200K token cap — exceed it and you get truncation → retry → more consumption.
Each subagent maintains its own context. Convenient, but overuse causes token explosion. One user lost 101K+ tokens to a subagent retry loop (#46968).
cat of a big file can cost 50K+ tokens/compact during long sessionsGet the full Token Book
10 chapters · 48 diagnostic symptoms · Copy-paste templates
Max plan ($200/month) saves ~10% = $20/month. Pays for itself in 1 month.
Free: Token Checkup — 5-question diagnostic, takes 30 seconds