Agent skill

phx:challenge

Challenge mode reviews - rigorous questioning before approving changes. Use when you want thorough scrutiny of Ecto changes, LiveView events, or PR readiness.

View SKILL.md on GitHub Repository

Stars 252

Forks 17

Install this agent skill to your Project

npx add-skill https://github.com/oliver-kriska/claude-elixir-phoenix/tree/main/plugins/elixir-phoenix/skills/challenge

SKILL.md

Challenge Mode Reviews

Rigorous, critical review patterns inspired by Boris Cherny's "Grill me" approach. Push beyond first solutions to ensure quality.

Iron Laws - Never Violate These

No approval without verification - Don't approve until all concerns addressed
Assume bugs exist - Look for edge cases, race conditions, missing handlers
Question everything - Even "obvious" code can hide issues
Demand proof - Ask for tests, show state transitions, verify behavior

Adversarial Lenses (Apply to ALL Modes)

"What Would Break This?" — Production failure modes under load, during deploys, with unexpected data
"Assumption Stress Test" — List every assumption; which are most fragile?
"Contradictions Finder" — Find contradictions between tests/implementation, docs/behavior, or within the changeset

Challenge Modes

Ecto Challenge (`/phx:challenge ecto`)

Grill the developer on database changes:

Migration Safety

Will this migration lock the table in production?
What happens to existing records without the new field?
Is the migration reversible?
Are there any unsafe operations (column removal, type change)?

Query Performance

Have you introduced any N+1 queries?
Are there missing indexes for new WHERE clauses?
Will this query scale with data growth?

Schema Integrity

Are all constraints enforced at database level?
What happens during rolling deployment (old code, new schema)?
Are foreign key cascades correct?

Backward Compatibility

Will old code work during deployment?
Are there any breaking changes to the context API?

LiveView Challenge (`/phx:challenge liveview`)

Prove the LiveView handles all cases:

Event Coverage

List every handle_event clause and expected socket state
What happens if socket assigns are missing when event fires?
Are there race conditions between user events and server pushes?

PubSub Handling

List every handle_info clause and when it's triggered
Do all PubSub subscriptions have corresponding handlers?
What happens if a message arrives before mount completes?

State Transitions

Show the event → handler → state transition table
Are all error states handled gracefully?
What's the recovery path from each error state?

Memory & Performance

Are large lists using streams?
Is transient data using temporary_assigns?
What's the memory footprint per connected user?

PR Challenge (`/phx:challenge pr`)

Senior engineer review checklist:

Must Pass

No direct Repo calls in controllers/LiveViews
All Ecto queries use explicit preloads
Changesets validate all user input
No atoms created from params
Error cases handled (not just happy path)
Tests cover new functionality

Performance

No queries in Enum.map loops
LiveView streams for lists > 100 items
Indexes exist for WHERE clause columns

OTP

GenServers have supervision
Timeouts set for GenServer.call
No unbounded process spawning

Security

No SQL injection via raw queries
No path traversal in file handling
Authorization checks present

Prior Findings Deduplication (MANDATORY)

CRITICAL: Prevents re-discovering identical issues across consecutive runs.

Search .claude/plans/*/reviews/ and .claude/reviews/ for prior findings
Read ALL prior findings before analyzing code
Check each finding against priors:
- Fixed → SKIP | Still present → PERSISTENT (one line) | New → NEW (full analysis) | Reintroduced → REGRESSION
Present: NEW first (full), then PERSISTENT (one-line), then REGRESSION

Example Challenge Output

markdown

## Challenge: Ecto — Orders Migration

### FINDING 1: Table lock risk (HIGH)
AddColumn on `orders` (2.1M rows) will lock table during deploy.
**Proof needed**: Run `SELECT count(*) FROM orders` — if >1M, use
`ALTER TABLE ... ADD COLUMN ... DEFAULT NULL` (no lock).

### FINDING 2: Missing index (MEDIUM)
New `WHERE status = ?` query on line 45 has no index.
**Action**: Add `create index(:orders, [:status])` to migration.

### Status: BLOCKED — 2 unresolved findings

Usage

Run /phx:challenge [mode] to initiate a rigorous review. The reviewer will not approve until all concerns are addressed with evidence.

Example workflow:

Run /phx:challenge ecto after migration changes
Answer each question with code references or test results
Address all concerns before proceeding to PR

Maintainer

oliver-kriska Core maintainer

Source details

Full Name: oliver-kriska/claude-elixir-phoenix
Branch: main
Path in repo: plugins/elixir-phoenix/skills/challenge
License: MIT License
Topics: claude-code claude claude-code-skills automation claude-skills vibe-coding claude-code-plugin elixir elixir-phoenix phoenix elixir-lang phoenix-framework

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

oliver-kriska/claude-elixir-phoenix

lab:autoresearch

Self-improving loop for plugin skills. Reads program.md, proposes one mutation per iteration, evaluates against deterministic scorer, keeps improvements via git, reverts failures. Targets weakest skill+dimension. Use with /loop for overnight runs.

252 17

Explore

oliver-kriska/claude-elixir-phoenix

promote

Generate X/Twitter release promotion posts with ASCII tables and CodeSnap rendering. Use when writing release posts, promotion tweets, plugin announcements, or preparing social media content for new versions.

252 17

Explore

oliver-kriska/claude-elixir-phoenix

skill-monitor

Analyze skill effectiveness across sessions. Computes per-skill metrics (action rate, friction, outcomes), identifies degrading skills, and generates improvement recommendations. Requires session-scan data in metrics.jsonl.

252 17

Explore

oliver-kriska/claude-elixir-phoenix

session-trends

Analyze trends across session metrics. Computes windowed aggregates, deltas, and compares against MEMORY.md findings. Use periodically for progress tracking.

252 17

Explore

oliver-kriska/claude-elixir-phoenix

cc-changelog

CONTRIBUTOR TOOL - Track CC changelog, extract new versions since last check, analyze impact on plugin (breaking changes, opportunities, deprecations). Run periodically or before releases. NOT part of the distributed plugin.

252 17

Explore

oliver-kriska/claude-elixir-phoenix

session-scan

Compute metrics for Claude Code sessions. Discovers via ccrider, filters trivial, computes friction/opportunity/fingerprint scores. Use for broad session triage.

252 17

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Challenge Mode Reviews

Iron Laws - Never Violate These

Adversarial Lenses (Apply to ALL Modes)

Challenge Modes

Ecto Challenge (/phx:challenge ecto)

LiveView Challenge (/phx:challenge liveview)

PR Challenge (/phx:challenge pr)

Prior Findings Deduplication (MANDATORY)

Example Challenge Output

Usage

Recommended Agent Skills

lab:autoresearch

promote

skill-monitor

session-trends

cc-changelog

session-scan

Ecto Challenge (`/phx:challenge ecto`)

LiveView Challenge (`/phx:challenge liveview`)

PR Challenge (`/phx:challenge pr`)