Sponsored by

Find leads on Reddit on auto pilot

Agent skills
analyze-results

Agent skill

analyze-results

Analyze ML experiment results, compute statistics, generate comparison tables and insights. Use when user says "analyze results", "compare", or needs to interpret experimental data.

View SKILL.md on GitHub Repository

Stars 6,306

Forks 582

Install this agent skill to your Project

npx add-skill https://github.com/wanshuiyin/Auto-claude-code-research-in-sleep/tree/main/skills/analyze-results

SKILL.md

Analyze Experiment Results

Analyze: $ARGUMENTS

Workflow

Step 1: Locate Results

Find all relevant JSON/CSV result files:

Check figures/, results/, or project-specific output directories
Parse JSON results into structured data

Step 2: Build Comparison Table

Organize results by:

Independent variables: model type, hyperparameters, data config
Dependent variables: primary metric (e.g., perplexity, accuracy, loss), secondary metrics
Delta vs baseline: always compute relative improvement

Step 3: Statistical Analysis

If multiple seeds: report mean +/- std, check reproducibility
If sweeping a parameter: identify trends (monotonic, U-shaped, plateau)
Flag outliers or suspicious results

Step 4: Generate Insights

For each finding, structure as:

Observation: what the data shows (with numbers)
Interpretation: why this might be happening
Implication: what this means for the research question
Next step: what experiment would test the interpretation

Step 5: Update Documentation

If findings are significant:

Propose updates to project notes or experiment reports
Draft a concise finding statement (1-2 sentences)

Output Format

Always include:

Raw data table
Key findings (numbered, concise)
Suggested next experiments (if any)

Maintainer

wanshuiyin Core maintainer

Source details

Full Name: wanshuiyin/Auto-claude-code-research-in-sleep
Branch: main
Path in repo: skills/analyze-results
License: MIT License
Topics: claude-code claude claude-code-skills mcp mcp-server llm codex gpt openai ai-tools machine-learning ai-research autonomous-agent deep-learning paper-review research-automation paper-writing aris idea-generation ml-research

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

wanshuiyin/Auto-claude-code-research-in-sleep

ablation-planner

Use when main results pass result-to-claim (claim_supported=yes or partial) and ablation studies are needed for paper submission. Codex designs ablations from a reviewer's perspective, CC reviews feasibility and implements.

wanshuiyin/Auto-claude-code-research-in-sleep

paper-plan

Generate a structured paper outline from review conclusions and experiment results. Use when user says "写大纲", "paper outline", "plan the paper", "论文规划", or wants to create a paper plan before writing.

wanshuiyin/Auto-claude-code-research-in-sleep

idea-discovery-robot

Workflow 1 adaptation for robotics and embodied AI. Orchestrates robotics-aware literature survey, idea generation, novelty check, and critical review to go from a broad robotics direction to benchmark-grounded, simulation-first ideas. Use when user says "robotics idea discovery", "机器人找idea", "embodied AI idea", "机器人方向探索", "sim2real 选题", or wants ideas for manipulation, locomotion, navigation, drones, humanoids, or general robot learning.

wanshuiyin/Auto-claude-code-research-in-sleep

training-check

Periodically check WandB metrics during training to catch problems early (NaN, loss divergence, idle GPUs). Avoids wasting GPU hours on broken runs. Use when training is running and you want automated health checks.

wanshuiyin/Auto-claude-code-research-in-sleep

paper-plan

Generate a structured paper outline from review conclusions and experiment results. Use when user says "写大纲", "paper outline", "plan the paper", "论文规划", or wants to create a paper plan before writing.

wanshuiyin/Auto-claude-code-research-in-sleep

idea-discovery-robot

Workflow 1 adaptation for robotics and embodied AI. Orchestrates robotics-aware literature survey, idea generation, novelty check, and critical review to go from a broad robotics direction to benchmark-grounded, simulation-first ideas. Use when user says \"robotics idea discovery\", \"机器人找idea\", \"embodied AI idea\", \"机器人方向探索\", \"sim2real 选题\", or wants ideas for manipulation, locomotion, navigation, drones, humanoids, or general robot learning.

Didn't find tool you were looking for?