Agent skills
approach-evaluation

Agent skill

approach-evaluation

Research industry standards and best practices, identify viable approaches for a given technical or architectural problem, and produce a structured factual comparison against project-specific constraints. Reports options — does not decide.

View SKILL.md on GitHub Repository

Stars 123

Forks 27

Install this agent skill to your Project

npx add-skill https://github.com/Fr-e-d/GAAI-framework/tree/main/.gaai/core/skills/cross/approach-evaluation

Metadata

Additional technical details for this skill

id: SKILL-APPROACH-EVALUATION-001
track: cross-cutting
author: gaai-framework
status: stable
version: 1.0
category: cross
updated at: 1772064000

SKILL.md

Approach Evaluation

Purpose / When to Activate

Activate when the invoking agent identifies a technical or architectural decision point where:

Multiple viable implementation approaches exist and the best choice is non-obvious
A technology, library, or service is being introduced for the first time in the project
No established convention exists in conventions.md for the problem domain
The problem touches a domain with well-known industry standards that should be considered
A prior approach failed or showed limitations (post-mortem driven re-evaluation)

Do NOT activate when:

A convention already exists in conventions.md for this exact problem
The approach is explicitly defined in the Story or a prior decision
The Story is Tier 1 / MicroDelivery with obvious implementation
The evaluation would delay delivery without reducing meaningful uncertainty

This skill researches and compares — it does not decide. The invoking agent (Planning Sub-Agent or Discovery Agent) reads the output and makes the decision.

Process

Phase 1 — Problem Framing

State the problem precisely: what capability is needed, what constraints apply
Read contexts/memory/index.md. Resolve and load:
- The project category file → extract tech stack, architectural boundaries, known constraints
- The patterns category file → extract established patterns and conventions
- The decisions category file → extract prior decisions on related topics Do not assume specific file paths — resolve from index.
Define evaluation criteria specific to this problem. Always include:
- Stack compatibility — does it work with the project's tech stack? (read from project context file, not hardcoded here)
- Constraint alignment — does it respect the architectural boundaries described in the project context?
- Operational fit — maintainability given team size and constraints described in project context
- Maturity — production readiness, community support, documentation quality
Add problem-specific criteria as needed (performance, cost, security, scalability, etc.)

Phase 2 — Industry Research

Research current industry standards and best practices for the problem:
- Use web search for current state-of-the-art and community consensus
- Use Context7 or documentation tools for library/framework specifics
- Check for established patterns in similar projects or architectures
Identify 2-3 viable approaches — not one, not ten
- Each approach must be genuinely viable (not a strawman)
- Include the "obvious" approach (what the LLM would default to) even if it may not be best
- Include at least one alternative that challenges the default
For each approach, gather factual evidence:
- How it works (brief mechanism description)
- Where it is used successfully (real examples, not hypothetical)
- Known limitations or failure modes
- Compatibility with edge compute / serverless environments (if relevant)

Phase 3 — Structured Comparison

Evaluate each approach against every criterion from Phase 1
Use factual evidence only — no "this feels better" reasoning
Flag any criterion where information is uncertain or unavailable
Note any approach that would require violating an existing convention or decision

Phase 4 — Trade-off Surfacing

For each approach, state explicitly:
- What you gain by choosing it
- What you lose or accept as a trade-off
- What it implies for future decisions (lock-in, reversibility)
If one approach is clearly dominated (worse on all criteria), note it but do not eliminate it — the agent decides

Outputs

markdown

# Approach Evaluation — {Story ID or Decision Context}: {Problem Title}

## Problem Statement

{What needs to be solved, in one paragraph}

## Evaluation Criteria

| # | Criterion | Weight | Source |
|---|-----------|--------|--------|
| C1 | {criterion} | must-have / important / nice-to-have | {project context reference} |
| C2 | {criterion} | must-have / important / nice-to-have | {project context reference} |

## Approaches Identified

### Approach A — {Name}

**Mechanism:** {how it works — 2-3 sentences}
**Evidence:** {where it's used, maturity signals}
**Limitations:** {known failure modes or constraints}

### Approach B — {Name}

**Mechanism:** {how it works — 2-3 sentences}
**Evidence:** {where it's used, maturity signals}
**Limitations:** {known failure modes or constraints}

### Approach C — {Name} (if applicable)

**Mechanism:** {how it works — 2-3 sentences}
**Evidence:** {where it's used, maturity signals}
**Limitations:** {known failure modes or constraints}

## Comparison Matrix

| Criterion | Approach A | Approach B | Approach C |
|-----------|-----------|-----------|-----------|
| C1: {name} | {factual assessment} | {factual assessment} | {factual assessment} |
| C2: {name} | {factual assessment} | {factual assessment} | {factual assessment} |

## Trade-offs

### Approach A
- **Gains:** {what you get}
- **Costs:** {what you accept}
- **Lock-in:** {reversibility assessment}

### Approach B
- **Gains:** {what you get}
- **Costs:** {what you accept}
- **Lock-in:** {reversibility assessment}

## Open Questions

- {Any criterion where evidence is uncertain or missing}
- {Any constraint that needs human clarification}

## Sources

- {URL or reference for each factual claim}

Saves to contexts/artefacts/evaluations/{id}.approach-evaluation.md.

Quality Checks

Every criterion has a clear source (project context, not invented)
Every assessment in the comparison matrix is factual, not opinion
No approach is dismissed without evidence
No approach is favored without evidence
Trade-offs are explicit and symmetric (gains AND costs for each)
Sources are provided for industry claims
The evaluation does not contain a recommendation or decision
Uncertain information is flagged as uncertain, not presented as fact

Non-Goals

This skill must NOT:

Recommend or decide — the agent decides after reading the evaluation
Invent criteria not grounded in project context
Hallucinate library capabilities or industry practices — cite sources
Evaluate more than 3 approaches (focus drives quality)
Produce vague assessments ("this is generally good") — every claim must be specific and evidence-backed
Skip the "obvious" approach — even if the default seems suboptimal, it must be evaluated fairly

The best approach is the one that survives honest comparison — not the one that arrives first.

Maintainer

Fr-e-d Core maintainer

Source details

Full Name: Fr-e-d/GAAI-framework
Branch: main
Path in repo: .gaai/core/skills/cross/approach-evaluation
License: Other
Topics: claude-code ai-agents ai-coding codex-cli cursor gemini-cli agentic-coding context-engineering vibe-coding opencode autonomous-agents devtools windsurf ai-governance ai-developer-tools ai-memory-system

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

Fr-e-d/GAAI-framework

ci-watch-and-fix

Watch GitHub Actions CI after PR creation, detect failures, extract logs, apply minimal fixes, and re-push — keeping the delivery session alive until CI resolves or escalating after 3 cycles. Activate immediately after gh pr create and before marking the story done.

123 27

Explore

Fr-e-d/GAAI-framework

qa-review

Validate that implemented code fully satisfies Story acceptance criteria, respects rules, and introduces no regressions. This is the hard quality gate — no pass means no delivery. Activate after implementation is complete.

123 27

Explore

Fr-e-d/GAAI-framework

compose-team

Assemble the context bundles for each sub-agent based on evaluate-story output. Produces spawn-ready packages for Planning, Implementation, QA, or MicroDelivery sub-agents. Activate after evaluate-story, before spawning any sub-agent.

123 27

Explore

Fr-e-d/GAAI-framework

coordinate-handoffs

Validate sub-agent handoff artefacts, sequence phase transitions, and manage retry and escalation logic. Activate after each sub-agent terminates to determine next action.

123 27

Explore

Fr-e-d/GAAI-framework

implement

Generate correct, minimal, maintainable code that satisfies a validated Story's acceptance criteria against an execution plan. Activate when a Story is validated, a plan exists, and all prerequisites are unambiguous.

123 27

Explore

Fr-e-d/GAAI-framework

delivery-high-level-plan

Transform validated Stories into a clear, minimal, governed execution plan. Used by the Planning Sub-Agent as the first planning pass before prepare-execution-plan for Tier 2/3, or as the sole planning output for simple Stories.

123 27

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

Metadata

SKILL.md

Approach Evaluation

Purpose / When to Activate

Process

Phase 1 — Problem Framing

Phase 2 — Industry Research

Phase 3 — Structured Comparison

Phase 4 — Trade-off Surfacing

Outputs

Quality Checks

Non-Goals

Recommended Agent Skills

ci-watch-and-fix

qa-review

compose-team

coordinate-handoffs

implement

delivery-high-level-plan