Token Economics

Cost per correct answer. Why agent loops cost 10–100× more than baseline.

The Core Math

CKG
$0.000506
per correct answer
RAG
$0.013046
per correct answer
GraphRAG
$0.020098
per correct answer

CKG is 26× cheaper than RAG, 40× cheaper than GraphRAG. At enterprise scale (1M queries/month), that's $13M saved annually.

Where the Tokens Go

In a typical RAG system:

With CKG:

⚠️ Runaway Agent Loops: The Real Cost

⚠️ Disclaimer: This excludes runaway-loop incidents
The $13M annual savings assumes single-shot queries. In agentic workflows, ~70% of session tokens carry history the model no longer needs. When agents loop, context accumulates quadratically. Our baseline does NOT account for runaway costs — incidents where a single agent session burns 10–100× more tokens than expected.

Scenario: Agent Loop Cost Explosion

Without CKG: Context accumulation spiral

Iteration 1: 12K context
  → tool_result (3K tokens)
  → re-send entire context on next call

Iteration 2: 15K (original + result)
Iteration 3: 19K
Iteration 4: 24K
Iteration 5: 31K
Iteration 6: 41K
Iteration 7: 54K
Iteration 8: 71K

8 iterations = 247K tokens (not 96K baseline)
Cost: $7.41 per session (vs $0.002 expected)
With CKG: Fixed knowledge footprint

Iteration 1: 12K context + 274 tokens compiled knowledge
Iteration 2: 12K context + 274 tokens (knowledge re-arrives compiled, not history)
Iteration 3: 12K context + 274 tokens
...
Iteration 8: 12K context + 274 tokens

8 iterations = 98K tokens (stays within budget)
Cost: $0.003 per session (3,000× cheaper than RAG runaway)

The Three Cost Problems CKG Solves

  1. Token inflation (per query): RAG bloats queries with loosely-matched text. CKG compiles knowledge once, reuses it across all queries. 65× fewer tokens per query.
  2. Accumulation in loops: Agents send full conversation history on every call. CKG knowledge re-arrives at 274 tokens, not 71K, keeping costs linear not quadratic.
  3. Retry cycles: RAG failures trigger refinement loops (rerank, expand chunks, query expansion). CKG guarantees first-pass accuracy (0% hallucination rate), eliminating retry overhead.

Enterprise Impact

For a mid-market company deploying agents across 3 teams (10 agents, 2M queries/month, avg 5 iterations per session):

Learn More

Read the open CKG Compiler benchmark (PDF)
What is Retrieval Density Score (RDS)?
What is GAO? (Generative Agent Optimization)

Calculate your runaway-loop risk.

Book a token economics walkthrough