Agent skills
method-development

Agent skill

method-development

Design and iterate on new research methods with structured checkpoints, baselines, and validation.

View SKILL.md on GitHub Repository

Stars 163

Forks 31

Install this agent skill to your Project

npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/development/method-development

SKILL.md

STANDARD OPERATING PROCEDURE

Purpose

Develop, refine, and validate novel methods anchored to baselines and constraints.
Apply constraint hygiene and explicit ceilings to prevent premature claims.
Keep structure-first artifacts current for handoff and reproducibility.

Trigger Conditions

Positive: creating or adapting algorithms/pipelines; designing ablations; exploring new research ideas.
Negative: pure replication (use baseline-replication) or publication packaging (research-publication).

Guardrails

HARD / SOFT / INFERRED constraint buckets (compute, data, metrics, ethics) with sources.
Two-pass refinement on designs: structure vs. baselines, then epistemic/risks.
Require baseline parity before claiming improvements; document variance sources.
Confidence ceilings enforced per claim.

Inputs

Problem statement and success metrics.
Baselines to beat and constraints (data, compute, deadlines).
Risk tolerances and evaluation protocols.

Workflow

Problem Framing: Capture objectives, constraints, and baselines; confirm INFERRED assumptions.
Design Options: Propose candidates with expected impact; map to constraints.
Experiment Plan: Define ablations, datasets, metrics, and stopping rules.
Run & Observe: Execute experiments, log configs/seeds; compare to baselines.
Validate & Iterate: Analyze results, run adversarial checks, and refine or stop.
Package: Summarize findings, risks, and next steps; store artifacts and update references/examples.

Validation & Quality Gates

Baseline beat or variance explained; claims tied to evidence with ceilings.
Ablations cover key hypotheses; failures documented.
Reproducibility assets stored (configs, logs, seeds).

Response Template

**Objective & Constraints**
- HARD / SOFT / INFERRED.

**Design Candidates**
- Option → rationale → expected impact.

**Experiment Status**
- Runs, metrics vs. baseline, issues.

**Next Steps**
- Iterate, stop, or expand.

Confidence: 0.80 (ceiling: research 0.85) - based on current evidence and validation checks.

Confidence: 0.80 (ceiling: research 0.85) - reflects validated comparisons to baselines and logged experiments.

Maintainer

majiayu000 Core maintainer

Source details

Full Name: majiayu000/claude-skill-registry
Branch: main
Path in repo: skills/development/method-development
License: MIT License

Featured Tools

Join Our Newsletter

Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.

Recommended Agent Skills

Expand your agent's capabilities with these related and highly-rated skills.

majiayu000/claude-skill-registry

agent-ops-spec

Manage specification documents in .agent/specs/. Use when user provides requirements, acceptance criteria, or feature descriptions that need to be tracked and validated against implementation.

163 31

Explore

majiayu000/claude-skill-registry

agent-ops-state

Maintain .agent state files. Use at session start, after meaningful steps, and before concluding: read/update constitution/memory/focus/issues/baseline consistently.

163 31

Explore

majiayu000/claude-skill-registry

agent-ops-spec

Manage specification documents in .agent/specs/. Use when user provides requirements, acceptance criteria, or feature descriptions that need to be tracked and validated against implementation.

163 31

Explore

majiayu000/claude-skill-registry

agent-ops-testing

Test strategy, execution, and coverage analysis. Use when designing tests, running test suites, or analyzing test results beyond baseline checks.

163 31

Explore

majiayu000/claude-skill-registry

agent-ops-testing

Test strategy, execution, and coverage analysis. Use when designing tests, running test suites, or analyzing test results beyond baseline checks.

163 31

Explore

majiayu000/claude-skill-registry

agent-ops-state

Maintain .agent state files. Use at session start, after meaningful steps, and before concluding: read/update constitution/memory/focus/issues/baseline consistently.

163 31

Explore

Didn't find tool you were looking for?

Search AI Tools

Install this agent skill to your Project

SKILL.md

STANDARD OPERATING PROCEDURE

Purpose

Trigger Conditions

Guardrails

Inputs

Workflow

Validation & Quality Gates

Response Template

Recommended Agent Skills

agent-ops-spec

agent-ops-state

agent-ops-spec

agent-ops-testing

agent-ops-testing

agent-ops-state