Agent skill
treat
Prune bloated session with a prescription. Removes progress ticks, stale reads, duplicate content, and more.
Install this agent skill to your Project
npx add-skill https://github.com/Ruya-AI/cozempic/tree/main/plugin/skills/treat
SKILL.md
Apply a pruning prescription to the current session. Default is standard if no argument given.
Steps
-
Diagnose first — show the user what they're working with:
bashcozempic current --diagnose -
Dry-run the treatment — show savings without applying:
bashcozempic treat current -rx $ARGUMENTSIf no argument was provided, use
standard:bashcozempic treat current -rx standard -
Show results — present the dry-run output including token savings (the
Tokens:line). Always surface both byte and token savings. -
Ask confirmation — use AskUserQuestion to confirm before applying.
-
Apply on confirmation:
bashcozempic treat current -rx $ARGUMENTS --execute -
Tell the user: "Treatment applied. A backup was created automatically. To resume with the pruned session, exit and run
claude --resume."
Prescriptions
| Rx | Strategies | Typical Savings |
|---|---|---|
gentle |
progress-collapse, file-history-dedup, metadata-strip | 40-55% |
standard |
gentle + thinking-blocks, tool-output-trim, stale-reads, system-reminder-dedup | 50-70% |
aggressive |
standard + error-retry-collapse, document-dedup, mega-block-trim, envelope-strip | 70-95% |
Safety
- Always dry-run first — never execute without showing the user what will change
- Backups are automatic (timestamped .bak files)
- Never touches uuid/parentUuid — conversation DAG stays intact
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
guard
Protect Claude Code sessions from context overflow by running a background daemon that monitors session size and auto-prunes before compaction hits. Use when the user says "guard", "protect session", "context getting long", "prevent compaction", "session management", or is running agent teams that need continuous context protection.
reload
Treat the current session and auto-resume in a new terminal window.
diagnose
Analyze Claude Code session bloat — shows token count, context usage %, and bloat breakdown. Use when the user asks about session size, context usage, or when you notice the context window is getting full.
doctor
Run health checks on Claude Code configuration and sessions. Use when troubleshooting Claude Code issues.
verl-rl-training
Provides guidance for training LLMs with reinforcement learning using verl (Volcano Engine RL). Use when implementing RLHF, GRPO, PPO, or other RL algorithms for LLM post-training at scale with flexible infrastructure backends.
openrlhf-training
High-performance RLHF framework with Ray+vLLM acceleration. Use for PPO, GRPO, RLOO, DPO training of large models (7B-70B+). Built on Ray, vLLM, ZeRO-3. 2× faster than DeepSpeedChat with distributed architecture and GPU resource sharing.
Didn't find tool you were looking for?