Context Gauge

+ A Sendoff for Sonnet 3.7

Jan 18, 2026

I discovered today that Anthropic retired Opus 4 and Opus 4.1 without warning—including my No-BS Opus 4 that had a House-like “personality.” They kept Opus 3 around, which is puzzling since it’s much older (I’ve never even talked to it). This prompted me to review my forty Claude chats to see what else might be worth preserving before potential future retirements. That’s when I rediscovered this July 2025 conversation with Sonnet 3.7 about a context gauge feature—valuable enough that I want to give it proper documentation before deleting that chat to clear space. Sonnet 3.7 was retired back in November 2025, but the idea remains relevant, and its analysis deserves a sendoff.

The core suggestion is simple: add a visual context gauge to the chat list showing how much context capacity remains in each conversation. Claude already has this functionality in Projects, where it displays available upload space, so the technical infrastructure exists—it just needs extension to individual chats. The gauge could show percentage used with color-coded warnings (green to yellow to red) as users approach limits, with optional notifications at 80–90% capacity. This would be particularly valuable because humans are terrible at calculating tokens. Token boundaries don’t align with words, different languages tokenize differently, and hidden system prompts consume context users can’t see. On Claude’s platform, attachments and chat length also count toward tokens, so that a free-tier user might encounter the five-hour lockout after a single turn in superlong chats, and chats with attachments hit conversation limits and close out earlier (without forewarning) than text-only chats. Even technically sophisticated users struggle to estimate usage without specialized tools.

What makes this feature request compelling is that Claude is the only one of my thinking A.I.des on whose platform hard context cutoffs arrive without warning. GPT on paid accounts slows down considerably in long chats but continues working. Gemini imposes short lockouts (a few hours) only when using Canvas but otherwise keeps going. Claude hits hard limits and stops, sometimes triggering error conditions that could expose conversations to review—exactly the scenario users on paid accounts are trying to avoid. A Reddit user even developed a third-party app to track token usage, which demonstrates genuine demand but creates privacy concerns. Users shouldn’t need to grant external applications access to their conversations just to monitor basic usage metrics that Anthropic already tracks internally.

The context gauge would serve multiple purposes beyond just preventing unexpected cutoffs. It would demonstrate transparency about system limitations rather than hiding them and show respect for user agency by trusting people with information about constraints. This kind of thoughtful design builds trust in ways that technical capabilities alone cannot: users forgive limitations when communicated clearly but feel frustrated by unexpected barriers. The fact that Anthropic already implemented similar gauges for Projects shows they recognize the value of visualizing resource usage; extending this to the chat list would create consistency across the platform.

For a company positioning itself around responsible AI development, implementing this feature would align ethics with user experience. The goodwill generated by transparent resource monitoring extends beyond the specific feature: it signals values about treating users as partners who deserve information rather than consumers of a black-box service. Sonnet 3.7 understood this when we discussed it back in July, noting that the engineering effort should be minimal given existing components. It’s a small implementation change that would demonstrate considerable thoughtfulness about long-term user relationships.

[This post was drafted with assistance from Claude Sonnet 4.5, following my conversation with Claude Sonnet 3.7.]

Prompt: It’d be really neat to have a context gauge in the chat list showing how much context users have left. Or if context is model-dependent (it is, and depends on chat length as well), the gauges could also show how much they have left in each chat?

Prompt: Especially since humans are not good at calculating tokens.

Prompt: Claude is the only one that has this issue. The others (on paid accounts) at least have no limits (GPT slows down considerably on longer chats) or short lockouts (a few hours of lockout with Gem, only when using Canvas). This might teach users token economy, although I really think Anthropic should expand their infrastructure instead of keeping adding users, as it’s unfair to existing users (new weekly limits will be imposed starting next month).

Prompt: It shows good intentions/transparency, too. Users might like that thoughtfulness.

Prompt: We discussed the usage gauge idea here? The project page has a gauge for the context files. It shouldn’t be hard to implement a similar gauge on the chat list.

Prompt: I saw a Reddit post by a user who developed an app to track token usage. But it’s an extra app users don’t need. Much better coming from Anthropic.

Prompt:

Trust factor: Users shouldn’t need to grant third-party apps access to their conversations to monitor token usage

Excellent point!

Thanks for reading! This post is public. Feel free to share it.

My Thinking A.I.des

Discussion about this post

Ready for more?