Agent skill
ironbee-analyze
Run IronBee session analysis with semantic interpretation of verification metrics, issues, and fixes
Install this agent skill to your Project
npx add-skill https://github.com/ironbee-ai/ironbee-cli/tree/main/src/clients/cursor/commands/ironbee-analyze
SKILL.md
IronBee Session Analysis
Run ironbee analyze --json --detailed via terminal to get raw session metrics, then provide a semantic analysis.
Steps
- Run
ironbee analyze --json --detailed(all sessions) orironbee analyze <session-id> --json(specific session) via terminal - Parse the JSON output
- Analyze and interpret the results:
Time Analysis
- Is the coding:verification:fix ratio healthy? (High fix% = inefficient coding)
- Is time to first verification reasonable? (Too long = agent coding without testing)
Verification Quality
- What is the first-pass success rate? Why might it be low?
- Are enough pages and checks being tested? (Low numbers = superficial testing)
Code Changes
- Which files are hot files? Are they expected to change frequently?
- Which files are problematic? What might be causing repeated fixes?
- Is there edit churn? (Same file in multiple fix cycles = root cause not resolved)
Fix Effectiveness
- Are fixes working? (Low fix success rate = agent not understanding the problem)
- Is re-fail rate high? What patterns do you see in the issues?
Semantic Analysis (LLM-powered)
- Group similar issues from verdict data into categories (e.g., "contrast issues", "event handler bugs", "API errors")
- Identify recurring patterns across sessions
- Suggest root causes for frequently failing files
Scoring
- Interpret the efficiency, quality, and confidence scores
- Compare with project averages if cross-session data is available
Output Format
Provide a structured report with:
- Executive summary (1-2 sentences)
- Key findings (bullet points)
- Recommendations (actionable next steps)
- Risk areas (files/patterns to watch)
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
ironbee-verify
Trigger browser verification of code changes. Args: (default), full, visual, functional
verl-rl-training
Provides guidance for training LLMs with reinforcement learning using verl (Volcano Engine RL). Use when implementing RLHF, GRPO, PPO, or other RL algorithms for LLM post-training at scale with flexible infrastructure backends.
openrlhf-training
High-performance RLHF framework with Ray+vLLM acceleration. Use for PPO, GRPO, RLOO, DPO training of large models (7B-70B+). Built on Ray, vLLM, ZeRO-3. 2× faster than DeepSpeedChat with distributed architecture and GPU resource sharing.
gguf-quantization
GGUF format and llama.cpp quantization for efficient CPU/GPU inference. Use when deploying models on consumer hardware, Apple Silicon, or when needing flexible quantization from 2-8 bit without GPU requirements.
Claude Code Guide
Master guide for using Claude Code effectively. Includes configuration templates, prompting strategies "Thinking" keywords, debugging techniques, and best practices for interacting with the agent.
qdrant-vector-search
High-performance vector similarity search engine for RAG and semantic search. Use when building production RAG systems requiring fast nearest neighbor search, hybrid search with filtering, or scalable vector storage with Rust-powered performance.
Didn't find tool you were looking for?