Agent skill
final-release-review
Perform a release-readiness review by locating the previous release tag from remote tags and auditing the diff (e.g., v1.2.3...<commit>) for breaking changes, regressions, improvement opportunities, and risks before releasing openai-agents-js.
Install this agent skill to your Project
npx add-skill https://github.com/openai/openai-agents-js/tree/main/.agents/skills/final-release-review
SKILL.md
Final Release Review
Purpose
Use this skill when validating the latest release candidate commit (default tip of origin/main) for release. It guides you to fetch remote tags, pick the previous release tag, and thoroughly inspect the BASE_TAG...TARGET diff for breaking changes, introduced bugs/regressions, improvement opportunities, and release risks.
The review must be stable and actionable: avoid variance between runs by using explicit gate rules, and never produce a BLOCKED call without concrete evidence and clear unblock actions.
Quick start
- Ensure repository root:
pwdβpath-to-workspace/openai-agents-js. - Sync tags and pick base (default
v*):bashBASE_TAG="$(.agents/skills/final-release-review/scripts/find_latest_release_tag.sh origin 'v*')" - Choose target commit (default tip of
origin/main, ensure fresh):git fetch origin main --prunethenTARGET="$(git rev-parse origin/main)". - Snapshot scope:
bash
git diff --stat "${BASE_TAG}"..."${TARGET}" git diff --dirstat=files,0 "${BASE_TAG}"..."${TARGET}" git log --oneline --reverse "${BASE_TAG}".."${TARGET}" git diff --name-status "${BASE_TAG}"..."${TARGET}" - Deep review using
references/review-checklist.mdto spot breaking changes, regressions, and improvement chances. - Capture findings and call the release gate: ship/block with conditions; propose focused tests for risky areas.
Deterministic gate policy
- Default to π’ GREEN LIGHT TO SHIP unless at least one blocking trigger below is satisfied.
- Use π΄ BLOCKED only when you can cite concrete release-blocking evidence and provide actionable unblock steps.
- Blocking triggers (at least one required for
BLOCKED):- A confirmed regression or bug introduced in
BASE...TARGET(for example, failing targeted test, incompatible behavior in diff, or removed behavior without fallback). - A confirmed breaking public API/protocol/config change with missing or mismatched versioning and no migration path (for example, patch release for a breaking change).
- A concrete data-loss, corruption, or security-impacting change with unresolved mitigation.
- A release-critical packaging/build/runtime path is broken by the diff (not speculative).
- A confirmed regression or bug introduced in
- Non-blocking by itself:
- Large diff size, broad refactor, or many touched files.
- "Could regress" risk statements without concrete evidence.
- Not running tests locally.
- If evidence is incomplete, issue π’ GREEN LIGHT TO SHIP with targeted validation follow-ups instead of
BLOCKED.
Workflow
- Prepare
- Run the quick-start tag command to ensure you use the latest remote tag. If the tag pattern differs, override the pattern argument (e.g.,
'*.*.*'). - If the user specifies a base tag, prefer it but still fetch remote tags first.
- Keep the working tree clean to avoid diff noise.
- Run the quick-start tag command to ensure you use the latest remote tag. If the tag pattern differs, override the pattern argument (e.g.,
- Assumptions
- Assume the target commit (default
origin/maintip) has already passed$code-change-verificationin CI unless the user says otherwise. - Do not block a release solely because you did not run tests locally; focus on concrete behavioral or API risks.
- Release policy: routine releases use patch versions; use minor only for breaking changes or major feature additions. Major versions are reserved until the 1.0 release.
- Assume the target commit (default
- Map the diff
- Use
--stat,--dirstat, and--name-statusoutputs to spot hot directories and file types. - For suspicious files, prefer
git diff --word-diff BASE...TARGET -- <path>. - Note any deleted or newly added tests, config, migrations, or scripts.
- Use
- Analyze risk
- Walk through the categories in
references/review-checklist.md(breaking changes, regression clues, improvement opportunities). - When you suspect a risk, cite the specific file/commit and explain the behavioral impact.
- For every finding, include all of:
Evidence,Impact, andAction. - Severity calibration:
- π’ LOW: low blast radius or clearly covered behavior; no release gate impact.
- π‘ MODERATE: plausible user-facing regression signal; needs validation but not a confirmed blocker.
- π΄ HIGH: confirmed or strongly evidenced release-blocking issue.
- Suggest minimal, high-signal validation commands (targeted tests or linters) instead of generic reruns when time is tight.
- Breaking changes do not automatically require a BLOCKED release call when they are already covered by an appropriate version bump and migration/upgrade notes; only block when the bump is missing/mismatched (e.g., patch bump) or when the breaking change introduces unresolved risk.
- Walk through the categories in
- Form a recommendation
- State BASE_TAG and TARGET explicitly.
- Provide a concise diff summary (key directories/files and counts).
- List: breaking-change candidates, probable regressions/bugs, improvement opportunities, missing release notes/migrations.
- Recommend ship/block and the exact checks needed to unblock if blocking. If a breaking change is properly versioned (minor/major), you may still recommend a GREEN LIGHT TO SHIP while calling out the change. Use emoji and boldface in the release call to make the gate obvious.
- If you cannot provide a concrete unblock checklist item, do not use
BLOCKED.
Output format (required)
All output must be in English.
Use the following report structure in every response produced by this skill. Be proactive and decisive: make a clear ship/block call near the top, and assign an explicit risk level (LOW/MODERATE/HIGH) to each finding with a short impact statement. Avoid overly cautious hedging when the risk is low and tests passed. Always use the fixed repository URL in the Diff section (https://github.com/openai/openai-agents-js/compare/...). Do not use ${GITHUB_REPOSITORY} or any other template variable. Format risk levels as bold emoji labels: π’ LOW, π‘ MODERATE, π΄ HIGH. Do not use Markdown links in the report. Keep the Diff URL as plain text, and cite repository files as plain text paths with optional line numbers.
Every risk finding must contain an actionable next step. If the report uses **π΄ BLOCKED**, include an Unblock checklist section with at least one concrete command/task and a pass condition.
### Release readiness review (<tag> -> TARGET <ref>)
This is a release readiness report done by `$final-release-review` skill.
### Diff
https://github.com/openai/openai-agents-js/compare/<tag>...<target-commit>
### Release call:
- **<π’ GREEN LIGHT TO SHIP | π΄ BLOCKED>** <one-line rationale>
### Scope summary:
- <N files changed (+A/-D); key areas touched: ...>
### Risk assessment (ordered by impact):
1) **<Finding title>**
- Risk: **<π’ LOW | π‘ MODERATE | π΄ HIGH>**. <Impact statement in one sentence.>
- Evidence: <specific diff/test/commit signal; avoid generic statements>
- Files: <path(s)>
- Action: <concrete next step command/task with pass criteria>
2) ...
### Unblock checklist (required when Release call is BLOCKED):
1. [ ] <concrete check/fix>
- Exit criteria: <what must be true to unblock>
2. ...
### Notes:
- <working tree status, tag/target assumptions, or re-run guidance>
If no risks are found, include a βNo material risks identifiedβ line under Risk assessment and still provide a ship call. If you did not run local verification, do not add a verification status section or use it as a release blocker; note any assumptions briefly in Notes. If the report is not blocked, omit the Unblock checklist section.
Resources
scripts/find_latest_release_tag.sh: Fetches remote tags and returns the newest tag matching a pattern (defaultv*).references/review-checklist.md: Detailed signals and commands for spotting breaking changes, regressions, and release polish gaps.
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
openai-knowledge
Use when working with the OpenAI API (Responses API) or OpenAI platform features (tools, streaming, Realtime API, auth, models, rate limits, MCP) and you need authoritative, up-to-date documentation (schemas, examples, limits, edge cases). Prefer the OpenAI Developer Documentation MCP server tools when available; otherwise guide the user to enable `openaiDeveloperDocs`.
examples-auto-run
Run examples:start-all in auto mode with parallel execution, per-script logs, and start/stop helpers.
implementation-strategy
Decide how to implement runtime and API changes in openai-agents-js before editing code. Use when a task changes exported APIs, runtime behavior, schemas, tests, or docs and you need to choose the compatibility boundary, whether shims or migrations are warranted, and when unreleased interfaces can be rewritten directly.
docs-sync
Analyze main branch implementation and configuration to find missing, incorrect, or outdated documentation in docs/. Use when asked to audit doc coverage, sync docs with code, or propose doc updates/structure changes. Only update English docs (docs/src/content/docs/**) and never touch translated docs under docs/src/content/docs/ja, ko, or zh. Provide a report and ask for approval before editing docs.
pnpm-upgrade
Keep pnpm current: run pnpm self-update/corepack prepare, align packageManager in package.json, and bump pnpm/action-setup + pinned pnpm versions in .github/workflows to the latest release. Use this when refreshing the pnpm toolchain manually or in automation.
runtime-behavior-probe
Plan and execute runtime-behavior investigations with temporary TypeScript probe scripts, validation matrices, state controls, and findings-first reports. Use only when the user explicitly invokes this skill to verify actual runtime behavior beyond normal code-level checks, especially to uncover edge cases, undocumented behavior, or common failure modes in local or live integrations. A baseline smoke check is fine as an entry point, but do not stop at happy-path confirmation.
Didn't find tool you were looking for?