Agent skill
evaluate-sdlc-layers
Evaluate and iterate on the SDLC Layer Separation Architecture implementation. Runs validation checks (cross-references, doc completeness, layer metadata, integration points), produces a findings report, and supports iterative fixes. Use when validating first-pass implementation, before claiming layer work is complete, or when improving layer docs/schema.
Install this agent skill to your Project
npx add-skill https://github.com/Jamie-BitFlight/claude_skills/tree/main/.claude/skills/evaluate-sdlc-layers
SKILL.md
Evaluate SDLC Layers
Systematically evaluate the SDLC Layer Separation Architecture implementation and support iterative improvement. Treats the implementation as first-pass until validated.
Arguments
--dry-run— Run all checks, produce report only. Do not apply fixes.--fix— After evaluation, apply safe fixes for broken references, missing metadata, or obvious gaps. Report what was changed.- (no args) — Evaluate and produce report; offer to fix or delegate fixes.
Evaluation Checklist
Run each check and record PASS / FAIL / SKIP with evidence.
1. Cross-Reference Validation
For each linked path in plugins/development-harness/docs/sdlc-layers/ and related docs:
-
sam-definition.md— exists at.claude/skills/work-backlog-item/references/sam-definition.md -
plugins/development-harness/CLAUDE.md— exists -
stateless-agent-methodology/research/arl/PROVENANCE.md— exists (sibling repo or configured path) - Layer 0 docs →
TASK_FILE_FORMAT.md— exists atplugins/development-harness/docs/TASK_FILE_FORMAT.md - Layer 1 →
language-manifest-schema.md,role-resolution-protocol.md— exist in development-harness - Layer 2 →
plugins/development-harness/docs/layer-2/— exists with README, schema, pilot profiles
Evidence: List each path checked and result (exists / 404 / wrong content).
2. Doc Completeness
- Layer 0: All 9 docs present (README, sam-pipeline, arl-touchpoints, artifact-conventions, rt-ica-gate, verification-protocol, task-file-format, evidence-discipline, orchestrator-discipline)
- Layer 1: All 6 docs present (README, layer-1-overview, language-manifest-template, linting-discovery-protocol, workflow-pattern-taxonomy, harness-role-mapping)
- Layer 2: README, layer-2-overview, stack-profile-schema, stack-profile-template; pilot profiles python-fastapi, python-cli
- ARL: arl-meta-layer.md, arl-human-probing-design.md
Evidence: Glob or Read results for each expected file.
3. Knowledge-Explorer Layer Filter
-
uv run research/knowledge-explorer.py list --layer 0— returns entries withlayer: "0" -
uv run research/knowledge-explorer.py list --layer 1— returns entries withlayer: "1" -
uv run research/knowledge-explorer.py list --layer 2— returns entries withlayer: "2" - Entries without layer metadata are excluded when
--layeris used (expected)
Evidence: Paste command output for each.
4. Research Entry Layer Metadata
-
evaluation-testing/harness-engineering-openai.md— haslayer: "0" -
api-frameworks/fastapi.md,api-frameworks/tornado.md— havelayer: "2",language,stack -
developer-tools/copier-astral.md— haslayer: "1"(or2if stack-scaffold) -
research/README.md— has "Layer Mapping" section
Evidence: Grep for layer: in frontmatter of each.
5. Integration Points
-
work-backlog-itemSKILL — documents--language,--stack; references layer docs -
groom-backlog-itemSKILL — documents ARL human-probing integration; references arl-human-probing-design -
language-manifest-schema.md— has "Inherits from Layer 0";typecheck: (none); Conventions schema -
role-resolution-protocol.md— has "Layer 0 gates apply before role resolution" -
plugins/development-harness/CLAUDE.md— references layer model
Evidence: Grep or Read for key phrases.
6. Consistency with Plan
- Plan deliverables (from attached plan) — compare File and Directory Changes table to actual files
- Dependency order — Layer 0 → Layer 1 → Layer 2 → Research → SAM/ARL → ARL probing → work-backlog-item
Evidence: List any plan items not yet implemented or diverged.
Output Format
Produce a structured report:
## SDLC Layer Evaluation Report
Date: {YYYY-MM-DD}
### Summary
- Cross-Reference: {PASS|FAIL|PARTIAL} — {brief}
- Doc Completeness: {PASS|FAIL|PARTIAL}
- Knowledge-Explorer: {PASS|FAIL|PARTIAL}
- Research Metadata: {PASS|FAIL|PARTIAL}
- Integration Points: {PASS|FAIL|PARTIAL}
- Plan Consistency: {PASS|FAIL|PARTIAL}
### Findings
1. [Category] {finding} — {suggested fix}
2. ...
### Recommended Actions
- [ ] {action 1}
- [ ] {action 2}
Iteration
After evaluation:
- If
--fix: Apply safe fixes (broken paths, missing frontmatter fields, obvious typos). Report each change. - If no
--fix: Present findings; offer to create backlog items or apply fixes. - Re-run: After fixes, re-run evaluation to confirm improvements.
Experiments
Flow experiments and learnings live in sam-flow-experiments. Clone via SSH: git clone git@github.com:Jamie-BitFlight/sam-flow-experiments.git. When iterating, consider running experiments against concept fixtures to validate changes.
References
- plugins/development-harness/docs/sdlc-layers/
- verify skill — evidence discipline
- groom-backlog-item — orchestration pattern
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
ccc
This skill should be used when code search is needed (whether explicitly requested or as part of completing a task), when indexing the codebase after changes, or when the user asks about ccc, cocoindex-code, or the codebase index. Trigger phrases include 'search the codebase', 'find code related to', 'update the index', 'ccc', 'cocoindex-code'.
agent-browser
Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data from a page", "test this web app", "login to a site", "automate browser actions", or any task requiring programmatic web interaction.
delegate
Quick delegation template for sub-agent prompts. Use when assigning work to a sub-agent, before invoking the Agent tool, or when preparing prompts for specialized agents. Provides the WHERE-WHAT-WHY framework. For comprehensive delegation guidance, activate the agent-orchestration how-to-delegate skill.
swarm-spawning
Spawn agents and teammates in Claude Code swarms. Use when choosing between subagents vs teammates, selecting agent types (Explore, Plan, general-purpose, plugin agents), configuring spawn backends (in-process, tmux, iterm2), or setting environment variables for spawned agents.
knowledge-explorer
Manage the research/ knowledge base (KB) of tool and library research entries. Use when browsing KB topics, adding new research entries, updating existing entries with dated revisions, fetching GitHub repo metadata into a draft KB entry, or migrating old-format entries to skill-spec frontmatter. Triggers on tasks like "what do we have on X", "add this to the KB", "update the KB entry for Y", "fetch github info for owner/repo", or "migrate old entries".
design-anti-patterns
Enforce anti-AI UI design rules based on the Uncodixfy methodology. Use when generating HTML, CSS, React, Vue, Svelte, or any frontend UI code. Prevents "Codex UI" — the generic AI aesthetic of soft gradients, floating panels, oversized rounded corners, glassmorphism, hero sections in dashboards, and decorative copy. Applies constraints from Linear/Raycast/Stripe/GitHub design philosophy: functional, honest, human-designed interfaces. Triggers on: UI generation, dashboard building, frontend component creation, CSS styling, landing page design, or any task producing visual interface code.
Didn't find tool you were looking for?