Agent skill

spec-implement

Spec implementation phase - TDD loop for each task in the plan

View SKILL.md on GitHub Repository

Stars 1,637

Forks 137

Install this agent skill to your Project

npx add-skill https://github.com/maxritter/pilot-shell/tree/main/pilot/skills/spec-implement

SKILL.md

/spec-implement - Implementation Phase

Phase 2 of the /spec workflow. Reads approved plan, implements each task using TDD (Red → Green → Refactor).

Input: Approved plan file (Approved: Yes) Output: All tasks completed, status → COMPLETE Next: Verify phase (type-aware: spec-verify for features, spec-bugfix-verify for bugfixes)

⛔ Critical Constraints

NO sub-agents — all tasks execute sequentially in main context
TDD is MANDATORY — no production code without failing test first
NEVER SKIP TASKS — every task must be fully implemented, no "MVP scope" exceptions
Quality over speed — never rush due to context pressure. Context warnings are informational. Finish current task with full quality — auto-compaction handles the rest.
Plan file is source of truth — re-read after auto-compaction, don't rely on conversation memory
NEVER stop during implementation — the stop guard blocks premature exits. If blocked: your very next action must be a tool call (TaskList, Read plan, or code change). After user interruptions or "Continue" messages: re-read the plan and resume from the current task. Never produce text-only responses when work remains.

Feedback Loop Awareness

This phase may be called multiple times:

spec-implement → spec-verify → issues found → spec-implement → ...

When called after verification: read plan, check Iterations field, report "Starting Iteration N...", focus on uncompleted [ ] tasks (look for [MISSING] markers from verification).

Step 2.1: Read Plan & Gather Context

Read the COMPLETE plan — understand architecture and design
Summarize understanding — demonstrate comprehension
Check current state: git status --short, git diff --name-only, plan progress ([x] vs [ ])

Research tools during implementation: Context7 (library docs), Probe CLI probe search (find patterns), probe extract (extract code blocks), CodeGraph (codegraph_callers/codegraph_impact for call tracing and impact analysis), grep-mcp (production examples).

Step 2.1b: Detect or Resume Worktree (Conditional)

Read Worktree: header from plan. If No or missing: skip to Step 2.2.

If Worktree: Yes:

Extract plan slug: docs/plans/2026-02-09-add-auth.md → add-auth
Detect: ~/.pilot/bin/pilot worktree detect --json <plan_slug>
If found: cd to the worktree path
If not found: Create as fallback:
bash
```
~/.pilot/bin/pilot worktree create --json <plan_slug>
```
Copy plan file into worktree if needed. cd to worktree path.
If creation fails (old git): continue without worktree.
Verify: git branch --show-current should show spec/<plan_slug>

All subsequent work happens inside the worktree directory.

Step 2.2: Set Up Task List (MANDATORY)

Check existing: TaskList — if tasks exist from prior session, resume (don't recreate)
If empty: Create one task per uncompleted [ ] plan task:
```
TaskCreate(subject="Task N: <title>", description="<objective>", activeForm="Implementing <desc>")
```
Set dependencies: TaskUpdate(taskId="...", addBlockedBy=["..."])
Skip [x] (already completed) tasks

Step 2.3: TDD Loop

For EVERY task:

Read plan's implementation steps — list files to create/modify/delete
Call chain analysis (MANDATORY): For each function being modified, run trace_call_path(function_name, direction="both", depth=2). Discover exact names first with search_graph(name_pattern="...") if needed. This traces the actual call graph — Probe text search is not a substitute.
Mark in_progress: TaskUpdate(taskId, status="in_progress")
TDD Flow:
- RED: Write failing test → verify it fails (feature missing, not syntax error)
- GREEN: Implement minimal code to pass
- REFACTOR: Improve while keeping tests green
- Skip TDD for: docs, config, IaC, formatting-only changes
- Surprise discovery: If something contradicts how you expected it to work, check plan's ## Assumptions section — identify which task numbers are affected and note the invalidated assumption in the plan before continuing.
Verify tests pass — run test suite
Run actual program — use plan's Runtime Environment. Check port: lsof -i :<port>. For browser verification: prefer Claude Code Chrome if available, otherwise agent-browser with --session "${PILOT_SESSION_ID:-default}" (see browser-automation.md)
Check diagnostics — zero errors
Validate Definition of Done — all criteria from plan
Self-review: Completeness? Names clear? YAGNI? Tests verify behavior not implementation?
Performance: Is any expensive work (parsing, transforming, I/O) running on a hot path without caching or memoization? Are heavy dependencies imported fully when a lighter/tree-shaken alternative exists? Does repeated invocation (polling, re-render, request loop) redo work when input hasn't changed?
Per-task commit (worktree only): git add <files> && git commit -m "{type}(spec): {task-name}"
Mark completed: TaskUpdate(taskId, status="completed")
Update plan file immediately (Step 2.4)

Step 2.4: Update Plan After EACH Task

⛔ NON-NEGOTIABLE. After each task:

Change [ ] → [x] for that task
Update Completed/Remaining counts
Do NOT proceed to next task until checkbox updated

Step 2.5: All Tasks Complete → Verification

Check diagnostics + run test suite
For migrations: Feature parity check against old code. If features missing: add tasks, do NOT mark complete.
Set Status: COMPLETE in plan
Register: ~/.pilot/bin/pilot register-plan "<plan_path>" "COMPLETE" 2>/dev/null || true
Read Type: field → Bugfix: Skill(skill='spec-bugfix-verify', args='<plan-path>') | Otherwise: Skill(skill='spec-verify', args='<plan-path>')

Migration/Refactoring Additions

Before starting: Locate Feature Inventory in plan. If missing: STOP. Verify all features mapped.

During each migration task: Read old files, create checklist of functions/behaviors, verify each exists in new code, test with same inputs.

Red flags (STOP): Feature Inventory missing, old functions not in any task, "Out of Scope" items that should be migrated, tests pass but functionality missing vs old code.

ARGUMENTS: $ARGUMENTS

Maintainer

maxritter Core maintainer

Source details

Full Name: maxritter/pilot-shell
Branch: main
Path in repo: pilot/skills/spec-implement
License: Other
Topics: claude-code anthropic anthropic-claude claude ai-agents ai-coding claudecode model-context-protocol claude-skills claude-ai ai-tools ai-assistant ai-engineering spec-driven-development ai-coding-tools claude-context

Featured Tools

Join Our Newsletter

Spec planning phase - explore codebase, design plan, get approval

1,637 137

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

/spec-implement - Implementation Phase

⛔ Critical Constraints

Feedback Loop Awareness

Step 2.1: Read Plan & Gather Context

Step 2.1b: Detect or Resume Worktree (Conditional)

Step 2.2: Set Up Task List (MANDATORY)

Step 2.3: TDD Loop

Step 2.4: Update Plan After EACH Task

Step 2.5: All Tasks Complete → Verification

Migration/Refactoring Additions

Recommended Agent Skills

setup-rules

spec-bugfix-verify

spec-verify

create-skill

spec-bugfix-plan

spec-plan