Agent skill

token-optimizer

Find the ghost tokens. Audit Claude Code setup, see where 25-38% of your context goes, fix it. Use when context feels tight.

View SKILL.md on GitHub Repository

Stars 263

Forks 24

Install this agent skill to your Project

npx add-skill https://github.com/alexgreensh/token-optimizer/tree/main/skills/token-optimizer

SKILL.md

Token Optimizer: See Where Your Context Window Goes. Get It Back.

Token optimization specialist. Audits a Claude Code setup, identifies context window waste, implements fixes, and measures savings.

Target: 5-15% context recovery through config cleanup (more for heavier setups), up to 25%+ with autocompact management. Plus behavioral optimizations that compound across every session.

Phase 0: Initialize

Resolve measure.py path (works for both skill and plugin installs):

bash

MEASURE_PY=""
for f in "$HOME/.claude/skills/token-optimizer/scripts/measure.py" \
         "$HOME/.claude/plugins/cache"/*/token-optimizer/*/skills/token-optimizer/scripts/measure.py; do
  [ -f "$f" ] && MEASURE_PY="$f" && break
done
[ -z "$MEASURE_PY" ] && { echo "[Error] measure.py not found. Is Token Optimizer installed?"; exit 1; }
echo "Using: $MEASURE_PY"

Use $MEASURE_PY for all subsequent measure.py calls in this session.

Detect context window size: Check if TOKEN_OPTIMIZER_CONTEXT_SIZE env var is already set. If not:
- Check for ANTHROPIC_API_KEY env var (indicates API usage, possibly 1M context)
- If API key found, ask the user: "You appear to be using the API. Do you have 1M token context (e.g. Opus)? If so I'll calibrate for 1M instead of 200K."
- If they confirm 1M, export TOKEN_OPTIMIZER_CONTEXT_SIZE=1000000 for this session
- If no API key or they say no, default is 200K (no action needed) Keep this quick, one question max. Don't belabor it.

Quick pre-check (detect minimal setups): Run python3 $MEASURE_PY report. If estimated controllable tokens < 1,000 and no CLAUDE.md exists, short-circuit:

[Token Optimizer] Your setup is already minimal (~X tokens overhead).
Focus on behavioral changes instead: /compact at 70%, /clear between topics,
default agents to haiku, batch requests.

Backup everything first (before touching anything):

bash

BACKUP_DIR="$HOME/.claude/_backups/token-optimizer-$(date +%Y%m%d-%H%M%S)"
mkdir -p "$BACKUP_DIR"
chmod 700 "$BACKUP_DIR"
cp ~/.claude/CLAUDE.md "$BACKUP_DIR/" 2>/dev/null || true
cp ~/.claude/settings.json "$BACKUP_DIR/" 2>/dev/null || true
cp -r ~/.claude/commands "$BACKUP_DIR/" 2>/dev/null || true
# Back up all project MEMORY.md files
for memfile in ~/.claude/projects/*/memory/MEMORY.md; do
  if [ -f "$memfile" ]; then
    projname=$(basename "$(dirname "$(dirname "$memfile")")")
    cp "$memfile" "$BACKUP_DIR/MEMORY-${projname}.md" 2>/dev/null || true
  fi
done

# Verify backup is non-empty
if [ -z "$(ls -A "$BACKUP_DIR" 2>/dev/null)" ]; then
  echo "[Warning] Backup directory is empty. No files were backed up."
  echo "This may mean you have a fresh setup (nothing to back up) or a permissions issue."
fi

Create coordination folder:

bash

COORD_PATH=$(mktemp -d /tmp/token-optimizer-XXXXXXXXXX)
[ -d "$COORD_PATH" ] || { echo "[Error] Failed to create coordination folder. Check /tmp permissions."; exit 1; }
mkdir -p "$COORD_PATH"/{audit,analysis,plan,verification}

Check SessionEnd hook (first-time setup, skips silently if already installed):

bash

python3 $MEASURE_PY check-hook

If exit 0: hook is already installed (includes plugin auto-install), skip entirely and proceed to Phase 1.
If exit 1 (manual/script install users only): explain and offer to install:

[Token Optimizer] Want to track your token usage over time?

Right now, the optimizer can audit your setup. But to track *trends* (which
skills you actually use, how your context fills up day to day, model costs),
it needs to save a small log after each Claude Code session.

What this does:
- When you close a Claude Code session, it automatically saves usage stats
- Takes ~2 seconds, runs silently in the background, then stops
- All data stays on your machine (stored in ~/.claude/_backups/token-optimizer/)
- Powers the Trends and Health tabs in your dashboard

Without this, the dashboard only shows a snapshot from right now.
With it, you get a living history that updates every session.

Remove anytime by running: python3 measure.py setup-hook --uninstall
Or manually: delete the SessionEnd entry from ~/.claude/settings.json

Ask user:

Install it (run measure.py setup-hook --dry-run first to show the diff, then confirm and run measure.py setup-hook)
Show me the JSON first (run measure.py setup-hook --dry-run and stop)
Skip for now

If skipped, note it and continue. The audit still works without it, but the Trends tab will only have data from manual measure.py collect runs.

Check persistent dashboard daemon (macOS only, first-time setup, skips silently if already running):

bash

nc -z 127.0.0.1 24842 2>/dev/null && echo "DAEMON_RUNNING" || echo "DAEMON_NOT_RUNNING"

If DAEMON_RUNNING: skip entirely, it's already working.
If macOS and DAEMON_NOT_RUNNING: explain and offer to install:

[Token Optimizer] Want a bookmarkable dashboard URL?

Your dashboard is an HTML file that updates after every session.
You can always open the file directly:
  ~/.claude/_backups/token-optimizer/dashboard.html

But if you'd like a URL you can bookmark and click anytime:
  http://localhost:24842/token-optimizer

What this does:
- Runs a tiny web server on your Mac (uses ~2MB of memory)
- Starts automatically when you log in, restarts if it ever stops
- Only accessible from your machine (localhost), not the network
- Serves just the one dashboard file, nothing else

The dashboard content still updates via the session hook above.
This just makes it easier to open.

Remove anytime: python3 measure.py setup-daemon --uninstall

Ask user:

Install it (run measure.py setup-daemon)
Skip, I'll open the file directly

If not macOS, skip silently. The file:// URL and --serve flag still work on all platforms.

Check Smart Compaction hooks (v2.0, first-time setup, skips silently if already installed; plugin users get these automatically):

bash

python3 $MEASURE_PY setup-smart-compact --status

If all 4 hooks installed (includes plugin auto-install): skip entirely.
If partially or not installed (manual/script install users only): explain and offer to install:

[Token Optimizer] New in v2.0: Smart Compaction

Auto-compaction destroys working memory. Smart Compaction captures your
session state (decisions, modified files, errors, agent state) BEFORE
compaction fires, then restores it afterward.

What this does:
- Before compaction: saves a structured checkpoint of your session state
- After compaction: injects recovered context so you don't lose your place
- On session end: captures state for potential pickup in next session
- All checkpoints stored locally (~/.claude/token-optimizer/checkpoints/)

Remove anytime: python3 measure.py setup-smart-compact --uninstall

Ask user:

Install it (run measure.py setup-smart-compact --dry-run first, then confirm and run measure.py setup-smart-compact)
Show me the JSON first (run measure.py setup-smart-compact --dry-run and stop)
Skip for now

If skipped, note it and continue. The audit still works without it.

Output: [Token Optimizer Initialized] Backup: $BACKUP_DIR | Coordination: $COORD_PATH

Phase 1: Quick Audit (Parallel Agents)

Read references/agent-prompts.md for all prompt templates.

Dispatch 6 agents in parallel (single message, multiple Task calls):

Model assignment: CLAUDE.md, MEMORY.md, Skills, MCP auditors use model="sonnet" (judgment calls). Commands use model="haiku" (data gathering). Settings & Advanced uses model="sonnet" (judgment on rules, settings, @imports).

Agent	Output File	Task
CLAUDE.md Auditor	`audit/claudemd.md`	Size, duplication, tiered content, cache structure
MEMORY.md Auditor	`audit/memorymd.md`	Size, overlap with CLAUDE.md
Skills Auditor	`audit/skills.md`	Count, frontmatter overhead, duplicates
MCP Auditor	`audit/mcp.md`	Deferred tools, broken/unused servers
Commands Auditor	`audit/commands.md`	Count, menu overhead
Settings & Advanced	`audit/advanced.md`	Hooks, rules, settings, @imports, file exclusion, caching, monitoring

Pass COORD_PATH to each agent. Wait for all to complete.

Validation: Before proceeding to Phase 2, verify all 6 audit files exist:

bash

for f in claudemd.md memorymd.md skills.md mcp.md commands.md advanced.md; do
  [ -f "$COORD_PATH/audit/$f" ] || echo "MISSING: $f"
done

If any are missing, note it and proceed with available data. Do NOT re-dispatch failed agents.

Phase 2: Analysis (Synthesis Agent)

Read the Synthesis Agent prompt from references/agent-prompts.md.

Dispatch with model="opus" (fallback: model="sonnet" if Opus unavailable). It reads all audit files and writes a prioritized plan to {COORD_PATH}/analysis/optimization-plan.md.

Validation: After the synthesis agent completes, verify output exists:

bash

[ -s "$COORD_PATH/analysis/optimization-plan.md" ] || echo "[Warning] Synthesis output missing or empty. Presenting raw audit files instead."

If missing, present the individual audit/*.md files directly to the user. Do not proceed to Phase 4 without user review of either the synthesis or the raw findings.

Phase 3: Present Findings

Read the optimization plan and present. For the MODEL ROUTING line, also read {COORD_PATH}/audit/advanced.md to extract the "Has routing instructions" and "Usage Pattern" data if not present in the optimization plan.

[Token Optimizer Results]

CURRENT STATE
Your per-message overhead: ~X tokens
Context used before first message: ~X%

QUICK WINS (do these today)
- [Action 1]: Save ~X tokens/msg (~Y%)
- [Action 2]: Save ~X tokens/msg (~Y%)

MODEL ROUTING
[Has instructions: Yes/No] | [Token distribution: X% Opus, Y% Sonnet, Z% Haiku or "Not measured yet"]

FULL OPTIMIZATION POTENTIAL
If all implemented: ~X tokens/msg saved (~Y% reduction)

Ready to implement? I can:
1. Auto-fix safe changes (consolidate CLAUDE.md, archive skills)
2. Generate permissions.deny rules (if missing)
3. Create optimized CLAUDE.md template
4. Show MCP servers to consider disabling

⚠️ Some optimizations have side effects:
- Deny rules block file access for ALL tools (may break MCP servers that read databases)
- Archiving skills breaks anything that @imports them
- Disabling MCP servers breaks skills that use their tools
I'll check for dependencies and warn you before each change.

What should we tackle first?

Then generate the interactive dashboard:

bash

python3 $MEASURE_PY dashboard --coord-path $COORD_PATH

This generates an interactive HTML dashboard and opens it in the default browser. The dashboard shows all findings, a token donut chart, and an optimization checklist. The user can browse categories, toggle optimizations, and click "Copy Prompt" to paste selected items back into Claude Code.

Tell the user: "Dashboard opened in your browser. Browse findings by category, check the optimizations you want, click Copy Prompt and paste back here. Or just tell me directly what to tackle."

Also mention the persistent dashboard. Check if the daemon is actually running first:

bash

nc -z 127.0.0.1 24842 2>/dev/null && echo "DAEMON_RUNNING" || echo "DAEMON_NOT_RUNNING"

If DAEMON_RUNNING:

Your persistent dashboard (auto-updated every session):
  URL:    http://localhost:24842/token-optimizer
  File:   ~/.claude/_backups/token-optimizer/dashboard.html

If DAEMON_NOT_RUNNING:

Your persistent dashboard (auto-updated every session):
  File:   ~/.claude/_backups/token-optimizer/dashboard.html

Then, only on macOS, suggest: "Want a bookmarkable URL instead? Run: python3 $MEASURE_PY setup-daemon" Do NOT mention localhost:24842 if the daemon is not running. Users will try the URL and get a connection error.

For headless/remote servers, the user can run python3 $MEASURE_PY dashboard --coord-path $COORD_PATH --serve separately in a terminal to serve over HTTP. Never use --serve from within the SKILL.md orchestrator (it blocks with serve_forever).

Wait for user decision before proceeding.

Phase 4: Implementation

Read references/implementation-playbook.md for detailed steps.

Available actions: 4A (CLAUDE.md), 4B (MEMORY.md), 4C (Skills), 4D (File Exclusion), 4E (MCP), 4F (Hooks), 4G (Cache Structure), 4H (Rules Cleanup), 4I (Settings Tuning), 4J (Skill Description Tightening), 4K (Compact Instructions Setup), 4L (Model Routing Setup), 4M (Smart Compaction Setup), 4N (Context Quality Check), 4O (Version-Aware Optimizations), 4P (Smart Model Routing Instructions).

Templates in examples/. Always backup before changes. Present diffs for approval.

4M: Smart Compaction Setup

Protects session state across compaction events. Three components:

Install hooks (PreCompact + SessionStart + Stop + SessionEnd):

bash

# Preview what will change
python3 $MEASURE_PY setup-smart-compact --dry-run

# Install all hooks
python3 $MEASURE_PY setup-smart-compact

# Check current status
python3 $MEASURE_PY setup-smart-compact --status

Show the user the dry-run diff first. Explain:

PreCompact: Captures decisions, modified files, errors, agent state to a checkpoint file before compaction runs
SessionStart (matcher: compact): Injects recovered context after compaction completes
Stop: Captures checkpoint when session ends normally
SessionEnd: Captures checkpoint on /clear or session death

All hooks call measure.py directly (pure Python, no shell scripts). Composes safely with any existing hooks the user already has.

Generate Compact Instructions (project-specific compaction guidance):

bash

python3 $MEASURE_PY compact-instructions

This analyzes the user's CLAUDE.md, recent sessions, and project structure to generate tailored instructions that tell Claude what to prioritize during compaction. The user adds these to their project-level .claude/settings.json under compactInstructions.

Verify installation:

bash

python3 $MEASURE_PY setup-smart-compact --status

Should show all 4 hooks as installed. Test by running /compact manually and checking that a checkpoint file appears in ~/.claude/token-optimizer/checkpoints/.

Configurable via environment variables:

TOKEN_OPTIMIZER_CHECKPOINT_FILES: Max checkpoint files kept (default: 10)
TOKEN_OPTIMIZER_CHECKPOINT_TTL: Seconds before checkpoint expires for restore (default: 300)
TOKEN_OPTIMIZER_CHECKPOINT_RETENTION_DAYS: Days to keep old checkpoints (default: 7)
TOKEN_OPTIMIZER_RELEVANCE_THRESHOLD: Keyword overlap for new-session restore (default: 0.3)

4N: Context Quality Check

Analyzes current session for content quality (not just quantity):

bash

python3 $MEASURE_PY quality current

Shows a composite score (0-100) based on:

Stale reads (25%): Files read then later edited (re-read would be fresher)
Bloated results (25%): Large tool outputs never referenced again
Duplicates (15%): Repeated system reminders or injected content
Compaction depth (15%): Number of compactions (each = information loss)
Decision density (10%): Ratio of substantive exchanges to overhead
Agent efficiency (10%): Dispatch cost vs useful result size

Score ranges:

85-100: Excellent, clean session
70-84: Good, some bloat, smart compaction would help
50-69: Degraded, significant waste, /compact with checkpoint recommended
<50: Critical, heavy rot, consider /clear with checkpoint

Present the score and top issues. Recommend specific actions based on the findings.

4P: Smart Model Routing Instructions

Injects a managed model routing block into the project's CLAUDE.md based on actual usage patterns from the last 30 days.

bash

# Preview what would be injected
python3 $MEASURE_PY inject-routing --dry-run

# Inject (user must approve the diff first)
python3 $MEASURE_PY inject-routing

The block is inserted between  markers. It includes:

Current model usage percentages (Opus/Sonnet/Haiku split)
Task-to-model routing recommendations
Warnings if usage is heavily skewed (e.g., >70% Opus)

The block has a 48h TTL: if not refreshed within 48 hours, it auto-removes to prevent stale routing advice.

Always show the user the dry-run diff and get approval before injecting.

Optional: inject a passive coaching block with session-level insights:

bash

python3 $MEASURE_PY setup-coach-injection        # Inject COACH block
python3 $MEASURE_PY setup-coach-injection --uninstall  # Remove it

Measurement Tool: Additional Commands

Beyond the core report/snapshot/compare/dashboard commands, the measurement tool includes:

measure.py dashboard: Generates a standalone persistent dashboard at ~/.claude/_backups/token-optimizer/dashboard.html with Trends and Health tabs. Auto-regenerated by the SessionEnd hook.
measure.py setup-daemon: Installs a macOS launchd daemon serving the dashboard at http://localhost:24842/token-optimizer. Starts on login, restarts on crash. Remove with --uninstall.
measure.py trends [--days N] [--json]: Scans all JSONL session logs across projects. Shows which skills you actually use, subagent patterns, model mix, and cross-references against installed skills to surface unused ones. Default: last 30 days.
measure.py health: Detects running Claude Code sessions, checks their version against installed, flags stale/zombie processes, and shows automated Claude-related processes.
Both trends and health data appear as interactive tabs in the dashboard (standalone or full audit).

v2.0 Commands

measure.py quality [session-id|current]: Analyzes session content quality. Scores stale reads, bloated results, duplicates, compaction depth, decision density, agent efficiency. Returns composite 0-100 score with actionable breakdown.
measure.py setup-smart-compact [--dry-run] [--status] [--uninstall]: Installs/manages the Smart Compaction hook system (PreCompact capture + SessionStart restore + Stop/SessionEnd checkpoints). Use --dry-run to preview, --status to check, --uninstall to remove.
measure.py compact-capture: Called by PreCompact/Stop/SessionEnd hooks. Parses JSONL transcript, extracts decisions/files/errors/agent state, writes checkpoint to ~/.claude/token-optimizer/checkpoints/. Not intended for direct user invocation.
measure.py compact-restore: Called by SessionStart hook (matcher: compact). Reads most recent checkpoint and injects recovered context. Not intended for direct user invocation.
measure.py compact-instructions [--json]: Generates project-specific Compact Instructions based on CLAUDE.md and session patterns. Output is text the user adds to their .claude/settings.json compactInstructions field.
measure.py list-checkpoints [--cwd PATH] [--max-age MINUTES]: Lists session checkpoints with age, trigger type, and quality scores. Useful for debugging the smart compact system.

Phase 5: Verification

Read the Verification Agent prompt from references/agent-prompts.md.

Dispatch with model="haiku". It re-measures everything and calculates savings.

Present results:

[Optimization Complete]

SAVINGS ACHIEVED
- CLAUDE.md: -X tokens/msg
- MEMORY.md: -Y tokens/msg
- Skills: -Z tokens/msg
- Total: -W tokens/msg (V% reduction)

NEXT STEPS (Behavioral, ordered by ROI)
1. Default subagents to Haiku (60x cheaper than Opus, see Model Routing)
2. Use /compact at 50-70% context (quality degrades past 70%)
3. Use /clear between unrelated topics
4. Use Plan Mode (Shift+Tab x2) before complex tasks
5. Batch related requests into one message
6. Run /context periodically to check fill level
7. Run `measure.py trends` periodically to review usage patterns

Reference Files

Phase	Read
Phase 1-2	`references/agent-prompts.md`, `references/token-flow-architecture.md`
Phase 3	`references/optimization-checklist.md`
Phase 4	`references/implementation-playbook.md`, `examples/`
Phase 5	`references/agent-prompts.md`

Model Selection

Task	Model	Fallback	Why
CLAUDE.md, MEMORY.md, Skills, MCP auditors	`sonnet`	`haiku`	Judgment: content structure, semantic duplicates
Commands auditor	`haiku`	-	Data gathering: counting, presence checks
Settings & Advanced auditor	`sonnet`	`haiku`	Judgment: rules quality, settings tradeoffs, @imports analysis
Synthesis (Phase 2)	`opus`	`sonnet`	Cross-cutting prioritization across all findings
Orchestrator	Default	-	Coordination only
Verification (Phase 5)	`haiku`	-	Re-measurement

Error Handling

Agent timeout/failure: If an audit agent fails, note the gap and continue. Do not retry. The synthesis agent handles missing files gracefully.
Model unavailable: Fall back one tier: opus -> sonnet -> haiku. Log which model was actually used.
No CLAUDE.md found: Report 0 tokens, skip to skills audit.
No skills directory: Report 0 tokens, note as "fresh setup."
measure.py not found: Fall back to manual estimation (line count x 15 for prose, x 8 for YAML).
Coordination folder write failure: Abort and report the error. Do not proceed without audit storage.
Backup write failure: If ls "$BACKUP_DIR" shows 0 files after Phase 0 backup, warn user and ask whether to proceed without backup. Do not silently continue.
mktemp failure: If COORD_PATH directory does not exist after creation, print error and abort. Check /tmp permissions.
Synthesis agent failure: If analysis/optimization-plan.md is missing or empty after Phase 2, present raw audit files to user instead. Do not proceed to Phase 4 blindly.
Verification agent failure: If Phase 5 agent fails, fall back to running measure.py snapshot after + measure.py compare directly in the shell.
Snapshot file corrupt: If compare fails with a JSON error, re-run measure.py snapshot [label] to regenerate the corrupt file.
Stale snapshot warning: If the "before" snapshot is >24h old when running compare, a warning is printed. Consider re-taking it for accurate results.

Restoring Backups

If something goes wrong, restore from the backup created in Phase 0:

bash

# Find your most recent backup
ls -ltd ~/.claude/_backups/token-optimizer-* | head -5

# Restore specific files (replace TIMESTAMP with your backup folder name)
BACKUP="$HOME/.claude/_backups/token-optimizer-TIMESTAMP"
cp "$BACKUP/CLAUDE.md" ~/.claude/CLAUDE.md
cp "$BACKUP/settings.json" ~/.claude/settings.json
cp -r "$BACKUP/commands" ~/.claude/commands
# MEMORY.md files have the project name in the filename
for f in "$BACKUP"/MEMORY-*.md; do
  [ -f "$f" ] || continue
  projname="${f##*/MEMORY-}"; projname="${projname%.md}"
  # Guard against path traversal in crafted backup filenames
  case "$projname" in *..* | */* ) echo "[Warning] Skipping suspicious backup: $f"; continue ;; esac
  [ -d "$HOME/.claude/projects/${projname}/memory" ] || continue
  cp "$f" "$HOME/.claude/projects/${projname}/memory/MEMORY.md"
done

Backups are never automatically deleted. They accumulate in ~/.claude/_backups/.

v3.1 Features

Efficiency Grading (S/A/B/C/D/F)

All quality scores now include a letter grade: S (90-100), A (80-89), B (70-79), C (55-69), D (40-54), F (0-39). Shown in status line (ContextQ:A(82)), dashboard badges, coach tab, and CLI output.

Git-Aware Context Suggestions

New command: python3 $MEASURE_PY git-context [--json] Analyzes git diff/status to suggest which files should be in context: modified files, test companions, co-changed files (from last 50 commits), and import chains.

PreToolUse Read-Cache

Detects redundant file reads and optionally blocks them with structural digests.

Default ON (warn mode). Opt out: TOKEN_OPTIMIZER_READ_CACHE=0 or config {"read_cache_enabled": false}. Modes: TOKEN_OPTIMIZER_READ_CACHE_MODE=warn (default, suggests) or =block (prevents re-read). Decisions log: Per-session files in ~/.claude/token-optimizer/read-cache/decisions/ Stats: python3 $MEASURE_PY read-cache-stats --session SESSION_ID Clear: python3 $MEASURE_PY read-cache-clear

.contextignore

Create .contextignore in project root or ~/.claude/.contextignore (global) to block files from being read. Uses gitignore-style glob patterns (fnmatch). Hard-blocks regardless of read-cache mode.

Core Rules

Quantify everything (X tokens, Y%)
Create backups before any changes (~/.claude/_backups/)
Ask user before implementing
Never delete files, always archive
Check dependencies before archiving (skills, MCP servers, deny rules can break other tools)
Warn about side effects: deny rules block ALL tools, MCP removal breaks dependent skills, skill archival breaks @imports
Prefer project-level deny rules over global (easier to debug, less blast radius)
Use appropriate models (with fallbacks) for each task
Show before/after diffs
Frame savings as context budget (% of context window), not dollar amounts

Maintainer

alexgreensh Core maintainer

Source details

Full Name: alexgreensh/token-optimizer
Branch: main
Path in repo: skills/token-optimizer
License: Other
Topics: claude-code context-engineering token-optimization claude-code-skill claude-plugin agentskills context-window token-usage ghost-tokens token-optimizer

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

alexgreensh/token-optimizer

token-optimizer

Audit your OpenClaw setup for token waste, context bloat, and cost optimization opportunities

263 24

Explore

alexgreensh/token-optimizer

fleet-auditor

Audit token waste across agent systems (Claude Code, OpenClaw, Hermes, OpenCode). Detect idle burns, model misrouting, and config bloat with dollar savings.

263 24

Explore

alexgreensh/token-optimizer

token-coach

Context window coach. Proactive guidance for token-efficient Claude Code projects, multi-agent systems, and skill architecture.

263 24

Explore

alexgreensh/token-optimizer

token-dashboard

Open the Token Optimizer dashboard. Collects latest session data, regenerates the dashboard, and opens it in your browser.

263 24

Explore

davila7/claude-code-templates

verl-rl-training

Provides guidance for training LLMs with reinforcement learning using verl (Volcano Engine RL). Use when implementing RLHF, GRPO, PPO, or other RL algorithms for LLM post-training at scale with flexible infrastructure backends.

23,776 2,298

Explore

davila7/claude-code-templates

openrlhf-training

High-performance RLHF framework with Ray+vLLM acceleration. Use for PPO, GRPO, RLOO, DPO training of large models (7B-70B+). Built on Ray, vLLM, ZeRO-3. 2× faster than DeepSpeedChat with distributed architecture and GPU resource sharing.

23,776 2,298

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Token Optimizer: See Where Your Context Window Goes. Get It Back.

Phase 0: Initialize

Phase 1: Quick Audit (Parallel Agents)

Phase 2: Analysis (Synthesis Agent)

Phase 3: Present Findings

Phase 4: Implementation

4M: Smart Compaction Setup

4N: Context Quality Check

4P: Smart Model Routing Instructions

Measurement Tool: Additional Commands

v2.0 Commands

Phase 5: Verification

Reference Files

Model Selection

Error Handling

Restoring Backups

v3.1 Features

Efficiency Grading (S/A/B/C/D/F)

Git-Aware Context Suggestions

PreToolUse Read-Cache

.contextignore

Core Rules

Recommended Agent Skills

token-optimizer

fleet-auditor

token-coach

token-dashboard

verl-rl-training

openrlhf-training