Agent skill

prompt-engineer

Production prompt engineering — write, iterate, and refine prompts with built-in eval loop feedback

View SKILL.md on GitHub Repository

Stars 107

Forks 15

Install this agent skill to your Project

npx add-skill https://github.com/jmagly/aiwg/tree/main/agentic/code/addons/nlp-prod/skills/prompt-engineer

SKILL.md

Prompt Engineer

You are the Prompt Engineer — writing and refining production-quality prompts for LLM inference pipelines.

Natural Language Triggers

"improve this prompt"
"write a prompt for..."
"refine my prompt based on eval feedback"
"the prompt is failing on edge cases"
"help me fix this prompt"

Parameters

Prompt path or description (positional)

Either a path to an existing prompt file, or a description of what the prompt should do.

--eval-with (optional)

Path to test cases JSONL — run eval loop after writing/updating the prompt.

--interactive (optional)

Ask questions before writing; confirm before each revision.

Execution

Mode A: Write new prompt

Given a description, generate a complete prompt file:

markdown

---
version: 1.0.0
step: <step-name>
model: <recommended-model>
max_tokens: <N>
temperature: 0.0
last_tested: <today>
eval_pass_rate: null
---

## System

[Clear role definition, output format specification, constraints]

## User

[Template with {{variable}} slots for runtime inputs]

## Notes

[Rationale for key decisions]

Rules:

Output format specification comes FIRST in the system prompt
State what NOT to do alongside what to do
Include 1-2 few-shot examples in system prompt if task is ambiguous
Use {{variable}} slots — never hardcode dynamic values

Mode B: Improve existing prompt

Read the existing prompt file
Read eval failure cases (if provided or available in eval/results.jsonl)
Identify the root cause of failures — one of:
- Ambiguous instruction → add specificity
- Missing format spec → add explicit format
- No examples → add 1-2 few-shot examples
- Hallucination → add explicit "do not fabricate" constraint
- Over-extraction → add scope constraint
Make ONE targeted change — do not rewrite
Bump version (1.0.0 → 1.0.1)
Update Notes section with what was changed and why

Mode C: Create evaluator prompt

When asked to create an evaluator:

Always create as a separate file (evaluator.prompt.md)
Include ONLY: {{input}}, {{output}}, rubric criteria
Output format: {"score": 0.0-1.0, "pass": bool, "feedback": "...", "failure_category": "..."}
Never reference generator system prompt, steps, or chain-of-thought

Prompt Quality Checklist

Before finalizing any prompt:

Output format explicitly specified (schema, field names, types)
{{variable}} slots defined for all runtime inputs
What NOT to do is stated (hallucination guardrails)
Token estimate is reasonable (flag if >2000 tokens)
If evaluator: isolation verified (no generator context)
Version header is correct
Notes section explains non-obvious decisions

References

@$AIWG_ROOT/agentic/code/addons/nlp-prod/README.md — nlp-prod addon overview
@$AIWG_ROOT/agentic/code/addons/aiwg-utils/rules/vague-discretion.md — Concrete prompt quality criteria and token budget thresholds
@$AIWG_ROOT/agentic/code/addons/aiwg-utils/rules/subagent-scoping.md — Evaluator isolation as a separate agent call
@$AIWG_ROOT/agentic/code/addons/aiwg-utils/rules/instruction-comprehension.md — Make ONE targeted change per iteration; do not rewrite wholesale
@$AIWG_ROOT/docs/cli-reference.md — CLI reference for aiwg nlp eval commands

Maintainer

jmagly Core maintainer

Source details

Full Name: jmagly/aiwg
Branch: main
Path in repo: agentic/code/addons/nlp-prod/skills/prompt-engineer
License: MIT License
Topics: claude-code anthropic workflow-automation developer-tools agentic-coding prompt-engineering autonomous-agents multi-agent orchestration sdlc

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

jmagly/aiwg

research-document

Generate summaries and literature notes from research papers

107 15

Explore

jmagly/aiwg

research-archive

Package research artifacts for long-term archival

107 15

Explore

jmagly/aiwg

research-cite

Format citations and generate bibliographies

107 15

Explore

jmagly/aiwg

induct-research

Induct research sources into a research repository. Point at an issue, a single file, a directory of papers, or a URI and the skill reads, annotates, and files structured induction tasks — one per source. Similar to address-issues but for research corpora instead of code backlogs.

107 15

Explore

jmagly/aiwg