Skip to content

API Reference

TokenPak exposes a REST API on the proxy port for telemetry, session management, and admin operations.

Base URL: http://localhost:8766


Authentication

Most endpoints require no authentication for local use. Admin and team endpoints require the X-Admin-Token header:

curl -H "X-Admin-Token: your-admin-secret" http://localhost:8766/v1/...

Telemetry

GET /v1/telemetry/summary

Overall statistics summary.

Response:

{
  "total_requests": 847,
  "total_tokens_in": 2140000,
  "total_tokens_saved": 891000,
  "total_cost_usd": 10.42,
  "compression_rate": 0.416,
  "since": "2026-01-01T00:00:00Z"
}


GET /v1/telemetry/sessions

List recorded sessions with optional filters.

Query Parameters:

Param Type Description
since date Start date (ISO 8601)
until date End date
model string Filter by model name
agent string Filter by agent name
min_cost float Minimum request cost (USD)
compressed_only bool Only compressed requests
limit int Max results (default: 100)
offset int Pagination offset

Response:

{
  "sessions": [
    {
      "id": "sess_abc123",
      "timestamp": "2026-03-05T14:23:11Z",
      "model": "claude-3-5-sonnet-20241022",
      "agent": "cali",
      "tokens_in": 4231,
      "tokens_in_compressed": 2847,
      "tokens_out": 612,
      "cost_usd": 0.0041,
      "compression_rate": 0.327,
      "recipe": "python-strip-comments",
      "latency_ms": 1243
    }
  ],
  "total": 23,
  "has_more": false
}


GET /v1/telemetry/export

Export session data.

Query Parameters:

Param Type Description
format string csv or json (default: json)
since date Start date
until date End date

Response: CSV or JSON file download.


GET /v1/telemetry/team

Team-wide aggregated stats. Requires admin token.

Response:

{
  "team_stats": {
    "total_requests": 1243,
    "total_cost_usd": 45.20,
    "total_tokens_saved": 3100000,
    "active_agents": 3
  },
  "by_agent": [
    {
      "agent": "cali",
      "requests": 542,
      "cost_usd": 19.40,
      "tokens_saved": 1320000
    }
  ]
}


GET /v1/telemetry/agents/{agent_id}

Per-agent detail. Requires admin token.

Response:

{
  "agent": "cali",
  "requests_today": 14,
  "cost_today_usd": 0.42,
  "requests_month": 542,
  "cost_month_usd": 19.40,
  "budget_remaining_usd": 80.60,
  "compression_rate": 0.41,
  "top_models": [
    { "model": "claude-3-5-sonnet", "requests": 380, "cost_usd": 14.20 }
  ]
}


Session Replay

GET /v1/replay/list

List replayable sessions.

Query Parameters:

Param Type Description
last int Return last N sessions
agent string Filter by agent

POST /v1/replay/{session_id}

Replay a session with optional overrides.

Request body:

{
  "compress": false,
  "model": "gpt-4o-mini",
  "diff": true
}

Response: Replay result with optional diff vs original.


Proxy Status

GET /v1/status

Proxy health and stats.

Response:

{
  "status": "ok",
  "version": "0.1.1",
  "uptime_seconds": 86400,
  "compression": {
    "enabled": true,
    "mode": "hybrid",
    "threshold_tokens": 4500
  },
  "session": {
    "requests": 23,
    "tokens_saved": 18341,
    "cost_usd": 0.042
  }
}


GET /v1/health

Detailed health check.

Response:

{
  "proxy": "ok",
  "database": "ok",
  "index": "ok",
  "compression_pipeline": "ok",
  "version": "0.1.1"
}


Budget

GET /v1/budget/status

Current budget status.

Response:

{
  "monthly_usd": 50.0,
  "spent_usd": 23.4,
  "remaining_usd": 26.6,
  "pct_used": 46.8,
  "alert_at_pct": 80,
  "on_exceeded": "warn"
}


Models

GET /v1/models

List of distinct models seen in session history.

Response:

{
  "models": [
    "claude-3-5-sonnet-20241022",
    "gpt-4o",
    "gpt-4o-mini"
  ]
}


Routing

GET /v1/routing/rules

List active routing rules.

Response:

{
  "rules": [
    {
      "pattern": ".*test.*",
      "model": "gpt-4o-mini",
      "created": "2026-03-01T10:00:00Z"
    }
  ]
}


POST /v1/routing/test

Test which model a prompt would be routed to.

Request:

{
  "prompt": "write unit tests for auth.py"
}

Response:

{
  "model": "gpt-4o-mini",
  "matched_rule": ".*test.*",
  "fallback": "claude-3-5-sonnet-20241022"
}