We're seeing a lot of project spring up with the intent of helping people solving context window bloat in Claude Code. Two tools caught my attention: one rewrites your shell commands through a Rust proxy, the other routes tool output through an MCP server with a SQLite backend. Both report dramatic token reductions. Both made me uncomfortable for reasons I go into in the article.

At the core of the dissonance was a governance question I keep asking in other contexts: Can you prove what happened? Runtime proxies compress what the AI sees, but you're trusting a process you can't easily inspect to decide what gets filtered. I built a Claude Code plugin that takes a different approach, one that's honestly less powerful but entirely visible and under your complete control.

The plugin analyzes your project and generates concise command variants that live in your repo as plain markdown. No binaries, no servers, no databases. Whether that tradeoff works for you depends on how much you care about the last 10% of edge cases versus knowing exactly what your AI was told to do.

Continue Reading

This article continues on my preferred publishing platforms. Choose your platform to read the full article: