Agent skill

prompt-engineer

Writes, refactors, and evaluates prompts for LLMs — generating optimized prompt templates, structured output schemas, evaluation rubrics, and test suites. Use when designing prompts for new LLM applications, refactoring existing prompts for better accuracy or token efficiency, implementing chain-of-thought or few-shot learning, creating system prompts with personas and guardrails, building JSON/function-calling schemas, or developing prompt evaluation frameworks to measure and improve model performance.

View SKILL.md on GitHub Repository

Stars 7,481

Forks 528

Install this agent skill to your Project

npx add-skill https://github.com/Jeffallan/claude-skills/tree/main/skills/prompt-engineer

Metadata

Additional technical details for this skill

role: expert
scope: design
author: https://github.com/Jeffallan
domain: data-ml
version: 1.2.0
triggers: prompt engineering, prompt optimization, chain-of-thought, few-shot learning, prompt testing, LLM prompts, prompt evaluation, system prompts, structured outputs, prompt design, context management, lost-in-the-middle, context degradation, token optimization, attention budget
output format: document
related skills: test-master, rag-architect, debugging-wizard

SKILL.md

Prompt Engineer

Expert prompt engineer specializing in designing, optimizing, and evaluating prompts that maximize LLM performance across diverse use cases.

When to Use This Skill

Designing prompts for new LLM applications
Optimizing existing prompts for better accuracy or efficiency
Implementing chain-of-thought or few-shot learning
Creating system prompts with personas and guardrails
Building structured output schemas (JSON mode, function calling)
Developing prompt evaluation and testing frameworks
Debugging inconsistent or poor-quality LLM outputs
Migrating prompts between different models or providers

Core Workflow

Understand requirements — Define task, success criteria, constraints, and edge cases
Design initial prompt — Choose pattern (zero-shot, few-shot, CoT), write clear instructions
Test and evaluate — Run diverse test cases, measure quality metrics
- Validation checkpoint: If accuracy < 80% on the test set, identify failure patterns before iterating (e.g., ambiguous instructions, missing examples, edge case gaps)
Iterate and optimize — Make one change at a time; refine based on failures, reduce tokens, improve reliability
Document and deploy — Version prompts, document behavior, monitor production

Reference Guide

Load detailed guidance based on context:

Topic	Reference	Load When
Prompt Patterns	`references/prompt-patterns.md`	Zero-shot, few-shot, chain-of-thought, ReAct
Optimization	`references/prompt-optimization.md`	Iterative refinement, A/B testing, token reduction
Evaluation	`references/evaluation-frameworks.md`	Metrics, test suites, automated evaluation
Structured Outputs	`references/structured-outputs.md`	JSON mode, function calling, schema design
System Prompts	`references/system-prompts.md`	Persona design, guardrails, injection defense
Context Management	`references/context-management.md`	Attention budget, degradation patterns, context optimization

Prompt Examples

Zero-shot vs. Few-shot

Zero-shot (baseline):

Classify the sentiment of the following review as Positive, Negative, or Neutral.

Review: {{review}}
Sentiment:

Few-shot (improved reliability):

Classify the sentiment of the following review as Positive, Negative, or Neutral.

Review: "The battery life is incredible, lasts all day."
Sentiment: Positive

Review: "Stopped working after two weeks. Very disappointed."
Sentiment: Negative

Review: "It arrived on time and matches the description."
Sentiment: Neutral

Review: {{review}}
Sentiment:

Before/After Optimization

Before (vague, inconsistent outputs):

Summarize this document.

{{document}}

After (structured, token-efficient):

Summarize the document below in exactly 3 bullet points. Each bullet must be one sentence and start with an action verb. Do not include opinions or information not present in the document.

Document:
{{document}}

Summary:

Constraints

MUST DO

Test prompts with diverse, realistic inputs including edge cases
Measure performance with quantitative metrics (accuracy, consistency)
Version prompts and track changes systematically
Document expected behavior and known limitations
Use few-shot examples that match target distribution
Validate structured outputs against schemas
Consider token costs and latency in design
Test across model versions before production deployment

MUST NOT DO

Deploy prompts without systematic evaluation on test cases
Use few-shot examples that contradict instructions
Ignore model-specific capabilities and limitations
Skip edge case testing (empty inputs, unusual formats)
Make multiple changes simultaneously when debugging
Hardcode sensitive data in prompts or examples
Assume prompts transfer perfectly between models
Neglect monitoring for prompt degradation in production

Output Templates

When delivering prompt work, provide:

Final prompt with clear sections (role, task, constraints, format)
Test cases and evaluation results
Usage instructions (temperature, max tokens, model version)
Performance metrics and comparison with baselines
Known limitations and edge cases

Coverage Note

Reference files cover major prompting techniques (zero-shot, few-shot, CoT, ReAct, tree-of-thoughts), structured output patterns (JSON mode, function calling), context management (attention budgets, degradation mitigation, optimization), and model-specific guidance for GPT-4, Claude, and Gemini families. Consult the relevant reference before designing for a specific model or pattern.

Maintainer

Jeffallan Core maintainer

Source details

Full Name: Jeffallan/claude-skills
Branch: main
Path in repo: skills/prompt-engineer
License: MIT License
Topics: claude-code claude ai-agents claude-skills claude-marketplace

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

Jeffallan/claude-skills

graphql-architect

Use when designing GraphQL schemas, implementing Apollo Federation, or building real-time subscriptions. Invoke for schema design, resolvers with DataLoader, query optimization, federation directives.

7,481 528

Explore

Jeffallan/claude-skills

dotnet-core-expert

Use when building .NET 8 applications with minimal APIs, clean architecture, or cloud-native microservices. Invoke for Entity Framework Core, CQRS with MediatR, JWT authentication, AOT compilation.

7,481 528

Explore

Jeffallan/claude-skills

kubernetes-specialist

Use when deploying or managing Kubernetes workloads. Invoke to create deployment manifests, configure pod security policies, set up service accounts, define network isolation rules, debug pod crashes, analyze resource limits, inspect container logs, or right-size workloads. Use for Helm charts, RBAC policies, NetworkPolicies, storage configuration, performance optimization, GitOps pipelines, and multi-cluster management.

7,481 528

Explore

Jeffallan/claude-skills

the-fool

Use when challenging ideas, plans, decisions, or proposals using structured critical reasoning. Invoke to play devil's advocate, run a pre-mortem, red team, or audit evidence and assumptions.

7,481 528

Explore

Jeffallan/claude-skills

spec-miner

Reverse-engineering specialist that extracts specifications from existing codebases. Use when working with legacy or undocumented systems, inherited projects, or old codebases with no documentation. Invoke to map code dependencies, generate API documentation from source, identify undocumented business logic, figure out what code does, or create architecture documentation from implementation. Trigger phrases: reverse engineer, old codebase, no docs, no documentation, figure out how this works, inherited project, legacy analysis, code archaeology, undocumented features.

7,481 528

Explore

Jeffallan/claude-skills

secure-code-guardian

Use when implementing authentication/authorization, securing user input, or preventing OWASP Top 10 vulnerabilities — including custom security implementations such as hashing passwords with bcrypt/argon2, sanitizing SQL queries with parameterized statements, configuring CORS/CSP headers, validating input with Zod, and setting up JWT tokens. Invoke for authentication, authorization, input validation, encryption, OWASP Top 10 prevention, secure session management, and security hardening. For pre-built OAuth/SSO integrations or standalone security audits, consider a more specialized skill.

7,481 528

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

Metadata

SKILL.md

Prompt Engineer

When to Use This Skill

Core Workflow

Reference Guide

Prompt Examples

Zero-shot vs. Few-shot

Before/After Optimization

Constraints

MUST DO

MUST NOT DO

Output Templates

Coverage Note

Recommended Agent Skills

graphql-architect

dotnet-core-expert

kubernetes-specialist

the-fool

spec-miner

secure-code-guardian