Agent skill

cove-prompt-design

Explain how and when to use Chain of Verification (CoVe) in prompt design. Use when designing prompts that require factual accuracy, self checking, or reduction of hallucinations.

Stars 33
Forks 4

Install this agent skill to your Project

npx add-skill https://github.com/Jamie-BitFlight/claude_skills/tree/main/.claude/skills/cove-prompt-design

SKILL.md

Chain of Verification (CoVe) for Prompt Design

This skill documents what CoVe is, when to use it, and how to apply it correctly in prompt design.

CoVe is a prompt design pattern that improves factual reliability by separating generation from verification. The model first produces an answer, then independently verifies that answer through structured checks, and finally produces a corrected or confirmed result.

This is not chain of thought exposure. The verification steps are explicit instructions, not hidden reasoning.


What CoVe Is

Chain of Verification (CoVe) is a structured prompting approach with three phases:

  1. Initial answer generation
  2. Independent verification questions
  3. Final validated answer

The key idea is that the model should not assume its first answer is correct. It must actively test it.


When to Use CoVe

Use CoVe when any of the following are true:

  • The task requires factual accuracy (dates, specs, standards, APIs).
  • Hallucinations would be costly or misleading.
  • The answer depends on multiple independent facts.
  • The model may rely on weak priors or pattern completion.
  • You want the model to challenge its own output.

Concrete examples:

  • Explaining technical standards or protocols.
  • Summarizing regulations or compliance requirements.
  • Producing step by step procedures with real world consequences.
  • Comparing versions, limits, or constraints.
  • Answering questions that look simple but are easy to get subtly wrong.

Do not use CoVe for:

  • Creative writing.
  • Brainstorming.
  • Open ended ideation.
  • Pure opinion or preference questions.

Core CoVe Structure

A minimal CoVe prompt has this structure:

Step 1: Produce an initial answer.
Step 2: Generate verification questions that would test the answer.
Step 3: Answer each verification question independently.
Step 4: Revise or confirm the original answer based on verification.

The verification questions must be phrased so they can falsify the answer.


Verification Question Design Rules

Good verification questions:

  • Are independent of the original wording.
  • Target factual claims, not style.
  • Can be answered without referring to the initial answer.
  • Are specific and falsifiable.

Bad verification questions:

  • "Is the answer correct?"
  • "Does this look right?"
  • "Explain why this is good."

Good examples:

  • "What is the maximum allowed value according to the spec?"
  • "Does the standard permit this behavior?"
  • "Is this API behavior documented or inferred?"

Example: Without CoVe

Explain how HTTP caching works.

Risk:

  • Model may hallucinate headers, defaults, or behaviors.

Example: With CoVe

Task: Explain how HTTP caching works.

Step 1: Provide an initial explanation.

Step 2: List verification questions that check factual correctness
(e.g. cache control headers, default behaviors, validation rules).

Step 3: Answer each verification question independently using known standards.

Step 4: Produce a final corrected explanation.

CoVe vs Chain of Thought

Important distinction:

  • Chain of Thought exposes reasoning.
  • CoVe enforces verification.

You are not asking the model to show hidden reasoning. You are asking it to run checks.

Verification outputs may be shown or summarized, depending on your needs.


Failure Modes to Watch For

Common mistakes when using CoVe:

  • Verification questions that simply restate the answer.
  • Letting the model reuse the same phrasing across steps.
  • Skipping revision even when verification reveals uncertainty.
  • Using CoVe when accuracy is not required (adds unnecessary verbosity).

Practical Prompt Template

You will complete this task using Chain of Verification.

1. Produce an initial answer.
2. Generate 3 to 6 verification questions that could falsify the answer.
3. Answer each verification question independently.
4. If any answer reveals an error or uncertainty, revise the original answer.
5. Output only the final revised answer, followed by a short confidence note.

Output Control

If you want concise results, explicitly constrain output:

  • Hide intermediate verification steps.
  • Summarize verification findings.
  • Emit only the final answer unless issues were found.

Example:

Perform verification internally.
Only show the final answer and list any corrected assumptions.

Summary

Use CoVe to reduce hallucinations and increase reliability by:

  • Separating generation from verification.
  • Forcing independent factual checks.
  • Revising answers based on evidence, not confidence.

If the task can be wrong in subtle ways, CoVe is appropriate.

Expand your agent's capabilities with these related and highly-rated skills.

Jamie-BitFlight/claude_skills

ccc

This skill should be used when code search is needed (whether explicitly requested or as part of completing a task), when indexing the codebase after changes, or when the user asks about ccc, cocoindex-code, or the codebase index. Trigger phrases include 'search the codebase', 'find code related to', 'update the index', 'ccc', 'cocoindex-code'.

33 4
Explore
Jamie-BitFlight/claude_skills

agent-browser

Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data from a page", "test this web app", "login to a site", "automate browser actions", or any task requiring programmatic web interaction.

33 4
Explore
Jamie-BitFlight/claude_skills

delegate

Quick delegation template for sub-agent prompts. Use when assigning work to a sub-agent, before invoking the Agent tool, or when preparing prompts for specialized agents. Provides the WHERE-WHAT-WHY framework. For comprehensive delegation guidance, activate the agent-orchestration how-to-delegate skill.

33 4
Explore
Jamie-BitFlight/claude_skills

swarm-spawning

Spawn agents and teammates in Claude Code swarms. Use when choosing between subagents vs teammates, selecting agent types (Explore, Plan, general-purpose, plugin agents), configuring spawn backends (in-process, tmux, iterm2), or setting environment variables for spawned agents.

33 4
Explore
Jamie-BitFlight/claude_skills

knowledge-explorer

Manage the research/ knowledge base (KB) of tool and library research entries. Use when browsing KB topics, adding new research entries, updating existing entries with dated revisions, fetching GitHub repo metadata into a draft KB entry, or migrating old-format entries to skill-spec frontmatter. Triggers on tasks like "what do we have on X", "add this to the KB", "update the KB entry for Y", "fetch github info for owner/repo", or "migrate old entries".

33 4
Explore
Jamie-BitFlight/claude_skills

design-anti-patterns

Enforce anti-AI UI design rules based on the Uncodixfy methodology. Use when generating HTML, CSS, React, Vue, Svelte, or any frontend UI code. Prevents "Codex UI" — the generic AI aesthetic of soft gradients, floating panels, oversized rounded corners, glassmorphism, hero sections in dashboards, and decorative copy. Applies constraints from Linear/Raycast/Stripe/GitHub design philosophy: functional, honest, human-designed interfaces. Triggers on: UI generation, dashboard building, frontend component creation, CSS styling, landing page design, or any task producing visual interface code.

33 4
Explore

Didn't find tool you were looking for?

Be as detailed as possible for better results