About

Token Compressor is a two-stage pipeline that compresses prompts before they reach an LLM. The first stage uses a local model (llama3.2:1b via Ollama) to rewrite prompts to their semantic minimum while preserving all conditionals and negations. The second stage validates compression quality by computing cosine similarity between original and compressed embeddings using nomic-embed-text, falling back to the original if similarity drops below a configurable threshold.

The system requires Ollama running locally and exposes a compress_prompt tool. It achieves 40-60% token reduction across English and Swedish prompts while maintaining semantic fidelity through the embedding validation gate.

Is this your project?

Claim this listing to manage your page, access analytics, and unlock upgrades. Verification takes 60 seconds.

Compare

Token Compressor vs Arize Phoenix Token Compressor vs Superglue Token Compressor vs Nx Console

List Your Project

Join the directory Ai agents read. Free forever.

Submit Your Project

Token Compressor

About

Is this your project?

Share This Project

Embed Badge

Compare

Similar Projects

Arize Phoenix

Superglue

Nx Console

List Your Project