Agent skill

gen-test-plan

Analyze repo, detect stack, trace changes to user-facing entry points, generate YAML test plan

View SKILL.md on GitHub Repository

Stars 44

Forks 5

Install this agent skill to your Project

npx add-skill https://github.com/existential-birds/beagle/tree/main/plugins/beagle-testing/skills/gen-test-plan

SKILL.md

Generate Test Plan

Analyze the repository's tech stack, branch changes vs default, and generate an executable YAML test plan focused on user-facing impact.

Arguments

--base <branch>: Base branch to diff against (default: main)
Path: Target directory (default: current working directory)

Step 1: Gather Repository Context

bash

# Get current branch
git rev-parse --abbrev-ref HEAD

# Get default base branch (try origin/main, then origin/master)
git rev-parse --verify origin/main >/dev/null 2>&1 && echo "main" || echo "master"

# Get changed files vs base
git diff --name-only $(git merge-base HEAD origin/main)..HEAD

# Get commit messages for context
git log --oneline $(git merge-base HEAD origin/main)..HEAD

Capture:

current_branch: Branch name
base_branch: Default branch to compare against
changed_files: List of modified files
commit_messages: What the PR is about

Step 2: Detect Tech Stack

See references/stack-discovery.md for stack detection commands, entrypoint discovery, port discovery, and trace rules.

Step 3: Discover User-Facing Entry Points

Grep for route definitions based on detected stack:

Python (FastAPI/Flask):

bash

grep -rn "@app\.\(get\|post\|put\|delete\|patch\)" --include="*.py" | head -20
grep -rn "@router\.\(get\|post\|put\|delete\|patch\)" --include="*.py" | head -20

Node.js (Express/Fastify):

bash

grep -rn "app\.\(get\|post\|put\|delete\)" --include="*.ts" --include="*.js" | head -20
grep -rn "router\.\(get\|post\|put\|delete\)" --include="*.ts" --include="*.js" | head -20

React Router:

bash

grep -rn "createBrowserRouter\|<Route\|path=" --include="*.tsx" --include="*.jsx" | head -20

Go (net/http, gin, chi):

bash

grep -rn "http.HandleFunc\|r.GET\|r.POST\|router.Get\|router.Post" --include="*.go" | head -20

Build a map of:

API endpoints: method + path + file:line
UI routes: path + component + file:line

Step 4: Trace Changes to Entry Points

For each changed file, determine if it affects user-facing functionality:

Direct entry point change — File contains route definitions
Import chain analysis — Find what imports the changed file and trace up to entry points
Architecture-aware tracing — Read the project's CLAUDE.md, README, or architecture docs to understand data flow and module relationships, rather than relying solely on grep
Document the trace path in test context

Import Chain Analysis by Ecosystem

bash

# Python — from/import
grep -rn "from.*<module>\|import.*<module>" --include="*.py"

# TypeScript/JavaScript — import/require
grep -rn "from.*<module>\|require.*<module>" --include="*.ts" --include="*.tsx" --include="*.js" --include="*.jsx"

# Elixir — alias/import/use
grep -rn "alias.*<Module>\|import.*<Module>\|use.*<Module>" --include="*.ex" --include="*.exs"

# Go — package references
grep -rn "<package>\." --include="*.go"

If the ecosystem is not covered above, or grep results are inconclusive, read the project's CLAUDE.md, README, or architecture docs to understand the module graph and trace the data flow from changed files to user-facing entry points.

Classify Affected Entry Points

After identifying all affected entry points, classify each one:

Category	Description	Examples	Priority
Core functionality	Entry points where the feature does its actual work for the end user	Chat endpoint, API action, data processing pipeline, generation flow	High — test first
Configuration/admin	Entry points where the feature is set up, toggled, or configured	Settings page, admin dashboard, preference toggles, dropdown selections	Lower — test after core

Classification rules:

Ask: "If a user wanted to use this feature (not configure it), which entry point would they interact with?" — that's core functionality
A settings page that adds a new dropdown option is configuration; the endpoint that actually uses that option is core functionality
The same changed file (e.g., a new provider module) may affect both a settings page and a functional endpoint — both must be traced

Requirement: At least one test must target a core functionality entry point before generating configuration/admin tests. If no core functionality entry point can be identified, explicitly document why and flag this for manual review.

Output: For each affected entry point, document:

Which changed files affect it
The import/dependency chain
Classification: Core functionality or Configuration/admin
Why this entry point needs testing

Step 5: Generate Test Cases

See references/test-case-generation.md for the detailed API/browser templates, prioritization rules, and test-case guidelines.

Step 6: Write YAML Test Plan

Create the test plan file:

bash

mkdir -p docs/testing

Write to docs/testing/test-plan.yaml:

yaml

version: 1
metadata:
  branch: <current_branch>
  base: <base_branch>
  generated: <ISO timestamp>
  changes_summary: |
    <Summary of what this PR changes based on commit messages and diff>

setup:
  stack:
    - type: <node|python|go|docker>
      package_manager: <pnpm|npm|yarn|uv|poetry|none>
  commands:
    - <install command>
    - <run command>
  health_checks:
    - url: http://localhost:<port>/health
      timeout: 30
    - url: http://localhost:<frontend_port>
      timeout: 30

tests:
  # API test example:
  - id: TC-01
    name: <API test name>
    context: |
      <Why this test exists, which changes affect it>
    steps:
      - action: curl
        method: GET
        url: http://localhost:<port>/<path>
    expected: |
      <Expected behavior in natural language>

  # Browser test example (always use agent-browser CLI commands):
  - id: TC-02
    name: <UI test name>
    context: |
      <Why this test exists, which changes affect it>
    steps:
      - run: agent-browser open http://localhost:<port>/<path>
      - run: agent-browser snapshot -i
      - run: agent-browser click @<ref>
      - run: agent-browser snapshot -i
      - run: agent-browser screenshot evidence/tc-02.png
    expected: |
      <Expected behavior in natural language>
    evidence:
      screenshot: evidence/tc-02.png

Step 7: Report Summary

After generating the test plan:

markdown

## Test Plan Generated

**File:** `docs/testing/test-plan.yaml`
**Branch:** <current_branch> → <base_branch>

### Detected Stack

| Component | Type | Port |
|-----------|------|------|
| <component> | <type> | <port> |

### Tests Generated

| ID | Name | Type | Affected By |
|----|------|------|-------------|
| TC-01 | <name> | curl/browser | <files> |

### Entry Point Coverage

- **Covered:** <N> entry points with tests
- **Unchanged:** <M> entry points not affected by this PR

### Next Steps

1. Review the generated test plan at `docs/testing/test-plan.yaml`
2. Adjust test values and expectations as needed
3. Run tests with:

/beagle-testing:run-test-plan

Step 8: Verification

Before completing:

bash

# Verify file was created
ls -la docs/testing/test-plan.yaml

# Validate YAML syntax
python3 -c "import yaml; yaml.safe_load(open('docs/testing/test-plan.yaml'))" && echo "Valid YAML"

# Check required fields
grep -E "^version:|^metadata:|^setup:|^tests:" docs/testing/test-plan.yaml

Verification Checklist:

Test plan file created at docs/testing/test-plan.yaml
YAML is syntactically valid
At least one test case generated
Setup commands match detected stack
Health checks point to valid endpoints
Each test has id, name, steps, and expected fields
Behavioral coverage: At least one test exercises the primary behavioral change described in changes_summary. Re-read the changes_summary and commit messages — if they describe a capability (e.g., "adds Claude Code as a new LLM provider") but no test invokes that capability (e.g., sends a message through the provider), the plan fails verification. Add the missing core functionality test before completing.
No config-only plans: If all tests target configuration/admin entry points and zero tests target core functionality entry points, the plan is incomplete. Go back to Step 4, identify the core functionality entry points, and add tests for them.

Rules

Always create docs/testing/ directory if it doesn't exist
Generate at least one test per affected entry point
Include context explaining why each test matters (trace from changes)
Use natural language for expected field (agent will interpret)
Default to conservative port detection (8000 for API, 5173/3000 for frontend)
Browser automation steps MUST use agent-browser CLI commands (e.g., agent-browser open, agent-browser snapshot -i, agent-browser click @ref) — never use abstract action syntax
Always agent-browser snapshot -i before interacting with elements and after navigation/DOM changes
Use agent-browser screenshot <path> to capture evidence for browser tests
Use ${ENV_VAR} syntax for secrets, never hardcode credentials
If no user-facing changes detected, explain why and suggest manual verification

Maintainer

existential-birds Core maintainer

Source details

Full Name: existential-birds/beagle
Branch: main
Path in repo: plugins/beagle-testing/skills/gen-test-plan
License: Apache License 2.0
Topics: claude-code agent-skills typescript ai-agents developer-tools python react golang code-review rust langgraph swift elixir documentation swiftui fastapi axum phoenix pydantic-ai tokio

Featured Tools

Join Our Newsletter

Reviews PostgreSQL code for indexing strategies, JSONB operations, connection pooling, and transaction safety. Use when reviewing SQL queries, database schemas, JSONB usage, or connection management.

44 5

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Generate Test Plan

Arguments

Step 1: Gather Repository Context

Step 2: Detect Tech Stack

Step 3: Discover User-Facing Entry Points

Step 4: Trace Changes to Entry Points

Import Chain Analysis by Ecosystem

Classify Affected Entry Points

Step 5: Generate Test Cases

Step 6: Write YAML Test Plan

Step 7: Report Summary

Step 8: Verification

Rules

Recommended Agent Skills

review-python

review-verification-protocol

sqlalchemy-code-review

fastapi-code-review

pytest-code-review

postgres-code-review