token-saver
This skill optimizes workspace .md files and provides a model-aware compaction/dashboard to reduce API costs. It reads home config ~/.openclaw/openclaw.json, scans sessions in ~/.openclaw/agents/main/sessions, and writes to AGENTS.md (adds ## ๐ Token Saver โ Persistent Mode).
โก Token Saver v3
๐ก Did you know? Every API call sends your workspace files (SOUL.md, USER.md, MEMORY.md, AGENTS.md, etc.) along with your message. These files cost real money on every message.
Token Saver v3 is model-aware โ it knows your model's context window and adapts recommendations accordingly.
Quick Start
/optimize
That's it. You'll see a dashboard with your savings options.
What's New in v3
| Feature | v2 | v3 |
|---|---|---|
| Compaction presets | Fixed (80K/120K/160K) | Dynamic (% of model's context) |
| Model detection | Fragile, env-only | Robust fallback chain |
| Context windows | Not tracked | Full registry (9 models) |
| Savings estimates | Static | Model-aware pricing |
Dashboard Preview
โญโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฎ
โ โก TOKEN SAVER v3 โ
โ Reduce AI costs by optimizing what gets sent each call โ
โฐโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฏ
๐ค **Model:** Claude Opus 4.5 (200K context)
Detected: openclaw.json
๐ **Context Usage:** [โโโโโโโโโโโโโโโโโโโโ] 42% (84K/200K)
๐ **WORKSPACE FILES** (sent every API call)
โโโโโโโโโโโโโโโโโโโโโโโโฌโโโโโโโโฌโโโโโโโโโโโโโโโโโ
โ File โ Tokensโ Can Save โ
โโโโโโโโโโโโโโโโโโโโโโโโผโโโโโโโโผโโโโโโโโโโโโโโโโโค
โ ๐ด AGENTS.md โ 1180 โ -825 (70%) โ
โ ๐ข SOUL.md โ 235 โ โ optimized โ
...
Commands
| Command | What It Does |
|---|---|
/optimize | Dashboard with files, models, context usage |
/optimize tokens | Compress workspace files (auto-backup) |
/optimize compaction | Chat compaction control (model-aware) |
/optimize compaction balanced | Apply balanced preset (60% of context) |
/optimize models | Detailed model audit with registry |
/optimize revert | Restore backups, disable persistent mode |
Dynamic Presets
Presets adapt to your model's context window:
| Preset | % | Claude 200K | GPT-4o 128K | Gemini 1M |
|---|---|---|---|---|
| Aggressive | 40% | 80K | 51K | 400K |
| Balanced | 60% | 120K | 77K | 600K |
| Conservative | 80% | 160K | 102K | 800K |
Model Registry
9 models with context windows and pricing:
- Claude: Opus 4.5, Sonnet 4, Haiku 3.5 (200K)
- Gemini: 2.0 Flash, 2.5 Pro (1M)
- OpenAI: GPT-4o, GPT-4o Mini (128K)
- Others: DeepSeek V3 (64K), Kimi K2.5 (128K)
File Compression
Before (verbose):
When Ruben greets me in the morning, I should proactively review our task list...
After (compressed):
MORNING: greeting โ review(todos+pending+urgent)
Same meaning. 90% fewer tokens. Real savings.
Safety
- โ Auto-backup before any changes
- โ Smart bypass โ skips already-optimized files
- โ
One-command revert โ
/optimize revert - โ No external calls โ all local
Install
# From ClawHub
clawhub install token-saver --registry "https://www.clawhub.ai"
# Or clone directly
git clone https://github.com/RubenAQuispe/token-saver.git
License
MIT โ Use it, modify it, share it.