Skip to content

Token Savings

AgentCTX’s token savings are measured, not estimated. These benchmarks come from real agent sessions comparing raw MCP tool usage against AgentCTX-managed sessions.

CategoryWithout AgentCTXWith AgentCTXSavings
Tool descriptions (70 tools)9,950 tokens850 tokens91%
Tool invocations (150 calls)7,350 tokens1,800 tokens76%
File search + read16,925 tokens1,290 tokens92%
Documentation search11,900 tokens890 tokens93%
Total per session46,125 tokens4,830 tokens90%

Without AgentCTX, every tool’s full JSON schema is injected into the context window:

  • 70 tools × ~142 tokens per schema = 9,950 tokens

With AgentCTX, agents discover tools via ?t queries and only load schemas they need:

  • Gateway tool (~300 tokens) + on-demand inspection (~550 tokens) = ~850 tokens

Without AgentCTX, each tool call carries the full MCP JSON-RPC envelope:

  • Request + response × 150 calls = 7,350 tokens

With AgentCTX, CTX statements are compact:

  • >t github.issues.create title="Fix SSE" = ~25 tokens vs ~49 tokens for raw MCP

The content-addressed store (CAS) deduplicates static content:

  • File contents stored once by SHA-256 hash
  • Subsequent references use 3-token hash pointers instead of full content
  • Documentation chunks indexed and retrieved by relevance, not dumped wholesale

See Methodology for full details on:

  • Test environment and models
  • Workload definitions
  • Measurement instrumentation
  • Statistical validity