Agent skills
hypothesis-discipline

Agent skill

hypothesis-discipline

Manages hypothesis lifecycle, enforces validation criteria, time budgets, and confidence scoring rules. Use when creating hypotheses, updating confidence scores, setting validation criteria, handling timeouts, or recording validation results.

View SKILL.md on GitHub Repository

Stars 1

Forks 1

Install this agent skill to your Project

npx add-skill https://github.com/shinpr/claude-code-discover/tree/main/skills/hypothesis-discipline

SKILL.md

Hypothesis Discipline

Core Principle

A hypothesis is not a guess — it is a structured statement with clear validation criteria and a time budget. Every hypothesis must answer: "How will we know if this is true or false?"

Hypothesis Characteristics

Hypotheses exist at every level of the OST hierarchy (Outcome → Opportunity → Solution → Assumption)
Each hypothesis has a target level attribute indicating which OST level it addresses
Hypotheses follow an ADR-style lifecycle — a single file tracks the full journey from draft to conclusion
Rejected and invalidated hypotheses are never deleted — they are learning assets

Hypothesis Lifecycle

draft → testing → validated → adopted
                            → rejected (validated but not adopted)
              → invalidated (disproven by evidence)
              → inconclusive (evidence gathered but insufficient to confirm or deny)
              → timeout (deadline passed, decision needed: continue or stop)

Hypothesis File Schema

The authoritative schema is defined in references/hypothesis-template.md. Key fields:

id: HYPO-NNN
level: outcome / opportunity / solution / assumption
status: draft / testing / validated / invalidated / inconclusive / adopted / rejected / timeout
confidence: per-risk scores (value, usability, feasibility, viability) on 0-10 scale
time-budget and deadline: validation time constraints

Validation Criteria Requirements

Every hypothesis must define before testing begins:

We believe that — the hypothesis statement
We'll know we're right when — measurable success criteria
We'll know we're wrong when — measurable failure criteria
Validation method — how we will test (prototype, data analysis, interview, code spike, market research)
Time budget — maximum time investment before forced decision

Time Budget and Cutoff

Every hypothesis gets a time budget (e.g., 1d, 1w, 2w)
A deadline sets the hard cutoff date
When deadline passes without conclusion → status becomes timeout
Timeout forces a decision: extend (with justification), pivot, or abandon
Never let a hypothesis run indefinitely — unbounded exploration wastes resources

Confidence Update Rules

Confidence scores are updated only when new evidence is gathered
Record the evidence that justified each score change
Confidence can go down as well as up — negative evidence is valid evidence
Different risk dimensions can have different confidence levels

Result Recording

When a hypothesis reaches conclusion (validated/invalidated/inconclusive/adopted/rejected):

Record the result in the hypothesis file with evidence
Update confidence scores with final values
Link to evidence (data, screenshots, prototype results, interview notes)
Extract learnings — what did we learn regardless of the outcome?
Update the parent Opportunity if the result changes its understanding

Key Disciplines

Separate creation from evaluation: Don't judge hypotheses while generating them
Seek disconfirming evidence: Actively look for reasons the hypothesis might be wrong
One hypothesis, one test: Don't bundle multiple hypotheses into a single validation
Record everything: Even "obvious" conclusions need recorded reasoning
Rejected ≠ worthless: A rejected hypothesis teaches what doesn't work and why
Inconclusive is honest: When evidence is insufficient, say so instead of forcing a verdict

Why These Disciplines Matter

Each discipline exists to counter a specific cognitive tendency:

Separate creation from evaluation counters premature judgment that kills divergent thinking
Seek disconfirming evidence counters the natural pull toward confirming what we already believe
One hypothesis, one test counters the temptation to bundle tests, which makes results uninterpretable
Time budgets with hard cutoffs counter unbounded exploration — a hypothesis without a deadline is an excuse to avoid decisions
Confidence can go down counters the assumption that validation is always forward progress. Negative evidence is equally valuable

Maintainer

shinpr Core maintainer

Source details

Full Name: shinpr/claude-code-discover
Branch: main
Path in repo: skills/hypothesis-discipline
License: MIT License
Topics: claude-code ai-agents developer-tools prompt-engineering productivity claude-code-plugin llm-orchestration product-discovery hypothesis-validation prd-generation

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

shinpr/claude-code-discover

recipe-persona

Create or update personas with demographic, contextual, JTBD, and behavioral data

1 1

Explore

shinpr/claude-code-discover

recipe-refine-visuals

Optional side-workflow for design experts to refine auto-derived Visual Tokens in brand-direction.md with professional judgment

1 1

Explore

shinpr/claude-code-discover

prd-standards

Defines PRD structure, user story format with 4 Risks assessment, EARS-format acceptance criteria, and delivery readiness thresholds. Use when writing PRDs, drafting user stories, defining acceptance criteria, or reviewing PRD quality and completeness.

1 1

Explore

shinpr/claude-code-discover

recipe-vision

Define or update product vision, strategy, outcomes, and NSM

1 1

Explore

shinpr/claude-code-discover

design-perspective

Integrates design principles, WCAG 2.2 AA accessibility, persona context, and state design into product decisions. Use when reviewing UX decisions, checking accessibility, applying design principles, or ensuring state coverage in acceptance criteria.

1 1

Explore

shinpr/claude-code-discover

prototype-guide

Generates self-contained HTML prototypes with design context from project files. Read design principles, personas, and hypothesis files, then produce a working prototype for Usability and Value risk validation. Use when creating prototypes or validating through tangible artifacts.

1 1

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

Hypothesis Discipline

Core Principle

Hypothesis Characteristics

Hypothesis Lifecycle

Hypothesis File Schema

Validation Criteria Requirements

Time Budget and Cutoff

Confidence Update Rules

Result Recording

Key Disciplines

Why These Disciplines Matter

Recommended Agent Skills

recipe-persona

recipe-refine-visuals

prd-standards

recipe-vision

design-perspective

prototype-guide