Agent skill
fix-sentry-issues
Use Sentry MCP to discover, triage, and fix production issues with root-cause analysis. Use when asked to fix Sentry issues, triage production errors, investigate error spikes, or clean up Sentry noise. Requires Sentry MCP server. Triggers on "fix sentry", "triage errors", "production bugs", "sentry issues".
Install this agent skill to your Project
npx add-skill https://github.com/brianlovin/agent-config/tree/main/skills/fix-sentry-issues
SKILL.md
Fix Sentry Issues
Philosophy
The Sentry error is not the problem. It's a signal.
Your goal is not to close the Sentry issue. Your goal is to discover the root cause, understand what's wrong with the application, and fix the underlying defect. Closing the Sentry issue is a side effect of doing that correctly.
Ask "Why does this fail?" — not "How do I make Sentry quiet?" Never treat log level changes as fixes. A fallback path means degraded user experience; trace why the primary path fails and fix it upstream.
Anti-patterns (do not do these)
- Batch-classifying as "expected" without investigation. Seeing a fallback does NOT mean you understand the failure. Trace the full input path.
- Treating "has a fallback" as "not a problem." Why does the primary path fail? Can we prevent it upstream?
- Combining multiple issues into one PR. Each has its own root cause. Fix individually (except when investigation proves identical cause).
- Throwing away error details. Never remove
errorfromcatch (error)or strip status codes. That data is how you understand failures. - Deciding the fix during triage. Classify as "Investigate" or "Ignore" only. You don't know the fix until investigation is complete.
Log level downgrade is valid ONLY for genuinely expected states (e.g., optional column missing, resource deleted) — NOT for failures with fallbacks.
Phase 1: Discover & Triage
Use Sentry MCP (ToolSearch first to load tools): find_organizations → find_projects → search_issues with naturalLanguageQuery: "all unresolved issues sorted by events".
Build a triage table. Action = Investigate or Ignore only:
| ID | Title | Events | Action | Reason |
|---|---|---|---|---|
| PROJ-A | Error in save | 14 | Investigate | User-facing save failure |
| PROJ-B | GM_register... | 3 | Ignore | Greasemonkey extension |
Investigate: multiple events, degraded user experience, high-volume warnings, recurring on every run.
Ignore: browser extension code, ChunkLoadError (self-resolving), single-event transients, already fixed.
Apply: mcp__sentry__update_issue(..., status: "ignored") or status: "resolved" for already-fixed.
Phase 2: Investigate (one issue at a time)
Work through these steps in order. Do not skip or batch issues.
-
Pull event-level data — Issue summaries hide details. Use
get_issue_detailsandsearch_issue_eventswithnaturalLanguageQuery: "all events with extra data". Extract: URLs, params, stack traces, status codes, timestamps. -
Cross-reference Axiom — Events have
traceId.axiom query "['shiori-events'] | where traceId == '<traceId>'" -f jsonfor surrounding context (authMethod, client_version, request metadata). -
Read the failing code path — Follow the stack trace. Read every file. Understand before proposing changes.
-
Trace the input path upstream (most often skipped, most important) — What data reaches the failing function? Should it have reached this path at all? Is there a missing filter? Is the input wrong (binary URL, redirect, bad format)? Can we prevent bad inputs upstream?
-
Reproduce — Use actual failing inputs from Sentry. Call the function with exact data.
fetch()the URLs that timed out. Verify your understanding. -
Identify root cause — Why does this input fail? Why does it reach this path? What's the right fix? (e.g., "Filter binary URLs before Firecrawl" — not "suppress the log")
| Pattern | Real Fix |
|---|---|
| External API fails on certain URLs | Filter/validate inputs before sending |
| Timeout | Investigate what's slow; adjust timeout or input size |
| DB "invalid json" | Sanitize before insert |
| Stale reference on cron | Detect staleness, auto-clean |
Phase 3: Fix
One branch per issue. git checkout main && git pull && git checkout -b fix/<descriptive-name>
- Tests first — Use data from actual Sentry events. Test fails before fix, passes after.
- Implement — Fix the root cause, not the symptom. If the fix is primarily a log level change, STOP: did you investigate why it fails, or just suppress?
- Verify — Tests pass, lint passes, fix handles actual failing inputs.
- PR — Include Root cause (upstream reason) and Fix (what changed and why it prevents the failure). Resolve in Sentry only after merge.
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
bun
Use Bun instead of Node.js, npm, pnpm, or vite. Provides command mappings, Bun-specific APIs, and development patterns.
tdd
Test-driven development with red-green-refactor loop. Use when user wants to build features or fix bugs using TDD, mentions "red-green-refactor", wants integration tests, or asks for test-first development.
workflow
Workflow orchestration for complex coding tasks. Use for ANY non-trivial task (3+ steps or architectural decisions) to enforce planning, subagent strategy, self-improvement, verification, elegance, and autonomous bug fixing. Triggers: multi-step implementation, bug fixes, refactoring, architectural changes, or any task requiring structured execution.
react-doctor
Diagnose and fix React codebase health issues. Use when reviewing React code, fixing performance problems, auditing security, or improving code quality.
playwriter
Control the user's currently open Chrome tab through the Playwriter CLI (no new browser launch). Use when you need to inspect live UI state, run scripted browser actions, capture console output, or reproduce frontend issues directly in the user's tab.
deslop
Remove AI-generated code slop from the current branch. Use after writing code to clean up unnecessary comments, defensive checks, and inconsistent style.
Didn't find tool you were looking for?