Agent skill
hypothesis-discipline
Manages hypothesis lifecycle, enforces validation criteria, time budgets, and confidence scoring rules. Use when creating hypotheses, updating confidence scores, setting validation criteria, handling timeouts, or recording validation results.
Install this agent skill to your Project
npx add-skill https://github.com/shinpr/claude-code-discover/tree/main/skills/hypothesis-discipline
SKILL.md
Hypothesis Discipline
Core Principle
A hypothesis is not a guess — it is a structured statement with clear validation criteria and a time budget. Every hypothesis must answer: "How will we know if this is true or false?"
Hypothesis Characteristics
- Hypotheses exist at every level of the OST hierarchy (Outcome → Opportunity → Solution → Assumption)
- Each hypothesis has a target level attribute indicating which OST level it addresses
- Hypotheses follow an ADR-style lifecycle — a single file tracks the full journey from draft to conclusion
- Rejected and invalidated hypotheses are never deleted — they are learning assets
Hypothesis Lifecycle
draft → testing → validated → adopted
→ rejected (validated but not adopted)
→ invalidated (disproven by evidence)
→ inconclusive (evidence gathered but insufficient to confirm or deny)
→ timeout (deadline passed, decision needed: continue or stop)
Hypothesis File Schema
The authoritative schema is defined in references/hypothesis-template.md. Key fields:
id: HYPO-NNNlevel: outcome / opportunity / solution / assumptionstatus: draft / testing / validated / invalidated / inconclusive / adopted / rejected / timeoutconfidence: per-risk scores (value, usability, feasibility, viability) on 0-10 scaletime-budgetanddeadline: validation time constraints
Validation Criteria Requirements
Every hypothesis must define before testing begins:
- We believe that — the hypothesis statement
- We'll know we're right when — measurable success criteria
- We'll know we're wrong when — measurable failure criteria
- Validation method — how we will test (prototype, data analysis, interview, code spike, market research)
- Time budget — maximum time investment before forced decision
Time Budget and Cutoff
- Every hypothesis gets a time budget (e.g., 1d, 1w, 2w)
- A deadline sets the hard cutoff date
- When deadline passes without conclusion → status becomes
timeout - Timeout forces a decision: extend (with justification), pivot, or abandon
- Never let a hypothesis run indefinitely — unbounded exploration wastes resources
Confidence Update Rules
- Confidence scores are updated only when new evidence is gathered
- Record the evidence that justified each score change
- Confidence can go down as well as up — negative evidence is valid evidence
- Different risk dimensions can have different confidence levels
Result Recording
When a hypothesis reaches conclusion (validated/invalidated/inconclusive/adopted/rejected):
- Record the result in the hypothesis file with evidence
- Update confidence scores with final values
- Link to evidence (data, screenshots, prototype results, interview notes)
- Extract learnings — what did we learn regardless of the outcome?
- Update the parent Opportunity if the result changes its understanding
Key Disciplines
- Separate creation from evaluation: Don't judge hypotheses while generating them
- Seek disconfirming evidence: Actively look for reasons the hypothesis might be wrong
- One hypothesis, one test: Don't bundle multiple hypotheses into a single validation
- Record everything: Even "obvious" conclusions need recorded reasoning
- Rejected ≠ worthless: A rejected hypothesis teaches what doesn't work and why
- Inconclusive is honest: When evidence is insufficient, say so instead of forcing a verdict
Why These Disciplines Matter
Each discipline exists to counter a specific cognitive tendency:
- Separate creation from evaluation counters premature judgment that kills divergent thinking
- Seek disconfirming evidence counters the natural pull toward confirming what we already believe
- One hypothesis, one test counters the temptation to bundle tests, which makes results uninterpretable
- Time budgets with hard cutoffs counter unbounded exploration — a hypothesis without a deadline is an excuse to avoid decisions
- Confidence can go down counters the assumption that validation is always forward progress. Negative evidence is equally valuable
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
recipe-persona
Create or update personas with demographic, contextual, JTBD, and behavioral data
recipe-refine-visuals
Optional side-workflow for design experts to refine auto-derived Visual Tokens in brand-direction.md with professional judgment
prd-standards
Defines PRD structure, user story format with 4 Risks assessment, EARS-format acceptance criteria, and delivery readiness thresholds. Use when writing PRDs, drafting user stories, defining acceptance criteria, or reviewing PRD quality and completeness.
recipe-vision
Define or update product vision, strategy, outcomes, and NSM
design-perspective
Integrates design principles, WCAG 2.2 AA accessibility, persona context, and state design into product decisions. Use when reviewing UX decisions, checking accessibility, applying design principles, or ensuring state coverage in acceptance criteria.
prototype-guide
Generates self-contained HTML prototypes with design context from project files. Read design principles, personas, and hypothesis files, then produce a working prototype for Usability and Value risk validation. Use when creating prototypes or validating through tangible artifacts.
Didn't find tool you were looking for?