First 100M tokens saved — free

Compress context before your agents read it

Compression only — not a model provider. The install script redirects your tools to Klood; your existing keys still authenticate with Anthropic or OpenAI. Tool outputs, logs, code and files are compressed before the request reaches your provider.

$ curl -fsSL https://kloodproject.com/install.sh | bash

Install sets base URLs in your shell. Open a new terminal, then run claude, codex, or your agent. Proxy not working?

All your tools. One place.

One install, all agents. Each tool keeps its own provider key — Klood only changes where requests go first.

Claude
Cursor
OpenAI Codex
Copilot CLI
Ollama
Aider

What gets compressed

Klood sits in the middle: compress context, then pass through to Anthropic or OpenAI — whichever protocol your agent used.

Your agent K Klood Anthropic · OpenAI
SmartCrusher

Tool & JSON outputs

Search results, API responses, test output — structured data compacted before the model reads it.

70–90% smaller
CodeCompressor

Source code

AST-aware parsing keeps signatures and imports, drops noise from Python, JS, Go, Rust and more.

Structure preserved
Log crush

Logs & diffs

Build logs, stack traces, git diff output — debugging context without eating your window.

Up to 94% smaller
RAG route

RAG & files

Retrieved chunks and pasted files are routed and compressed before the model ever sees them.

Pre-model trim
CCR

Reversible compression

Originals stay cached. If the model needs full detail, it retrieves the uncompressed source on demand — nothing is lost, just deferred.

Fetch full source anytime

Start free. Scale when you need to.

Track compression in your dashboard. No credit card to get started.

After free tier

Pay as you save

Coming soon

  • Same compression pipeline
  • Usage-based billing on tokens saved
  • Team dashboards (planned)
Get notified

Built for daily agent work

One curl install

macOS, Linux, Windows — routes all your agents to our API. No infra keys.

Your keys stay yours

Use your Anthropic and OpenAI accounts. We compress and forward — we don't resell models.

Always-on hosted API

Managed, redundant infrastructure — scaled for agent traffic. Nothing to run on your laptop.

Benchmark-backed

60–95% token reduction on real agent workloads with minimal accuracy loss on standard evals.

Measured token savings

Published eval results — pick a category below.

Typical agent payloads — tool JSON, shell output, build logs. Measured on Apple M-series CPU.

Suite tokens
23,9218,110
Tokens saved 15,811
Overall reduction 66.1%

Tool JSON

100 log entries · SmartCrusher
90.6%
3,163297
1ms

Tool JSON

500 log entries · SmartCrusher
83.1%
9,5261,614
2ms

Shell output

200 lines · log crush
85.5%
3,238469
1ms

Build log

200 lines · log crush
93.9%
2,412148
1ms

grep results

150 hits · already minimal
0%
2,6242,624
<1ms

Python source

~480 lines · AST preserved
0%
2,9582,958
<1ms
gpt-4o-mini N = 100 Eval tier 1

Standard accuracy

lm-eval harness · baseline vs compressed

GSM8K · Math
Baseline0.870
Compressed0.870
Δ0.000
TruthfulQA · Factual
Baseline0.530
Compressed0.560
Δ+0.030

With compression

SQuAD v2 QA

97% accuracy preserved 19% fewer tokens

Before/After
BFCL Tools

97% accuracy preserved 32% fewer tokens

LLM-as-Judge

100 production log entries — locate error, code, resolution, and affected count.

Input tokens
10,1441,260
Correct answers
4/44/4
Token reduction 87.6%

Scrapinghub Article Extraction · 181 HTML pages

F1 score
0.9580.919
Recall
100%98.2%
Token reduction 94.9%

Common questions

A hosted compression proxy — not an LLM API. Your agent sends requests to api.kloodproject.com instead of api.anthropic.com or api.openai.com. We shrink tool outputs, logs, code and files, then forward the request to the real provider using your API key.

Because Klood does not sell tokens or host models. You pay Anthropic for Claude, OpenAI for GPT, etc. — exactly as before. The install script only redirects *_BASE_URL to Klood; your keys are forwarded unchanged to the matching provider after compression.

By API protocol, not by you picking a provider in Klood. Claude Code hits /v1/messages (Anthropic wire) → forwarded to Anthropic. Codex and GPT in Cursor hit /v1/chat/completions (OpenAI wire) → forwarded to OpenAI. The installer sets both base URLs so every tool finds Klood; our server picks the upstream from the request shape.

No. End users only run the install command — then your normal Anthropic or OpenAI keys, same as before.

macOS, Linux (Ubuntu, Debian, etc.) and Windows. The installer sets environment variables for your shell or user profile. Cursor needs one paste in Settings.

Klood is measured on GSM8K, TruthfulQA, SQuAD and tool-use tasks with minimal accuracy delta. Reversible compression (CCR) means the model can fetch full originals when it needs them — so you don't lose information, you defer it. See the benchmark table for token numbers.

Tool return values, JSON blobs, stack traces, build logs, retrieved RAG chunks, pasted files and long diffs — the noisy context agents accumulate. Your prompts and API keys are handled normally; the savings come from slimming what the model has to read.

Your key stays on your machine and is sent to our API over HTTPS with each request — same as calling Anthropic or OpenAI directly. We don't train models on your data.

Route your agents through Klood

One install command. Open a new terminal. Use Claude, Cursor, or Codex with compressed context.

Install now