Agent skill
platform-operations
Unified platform operations guidance for CI/CD pipeline design, deployment strategies, observability, SLI/SLOs, and incident-ready rollouts. Use when building release workflows, production monitoring, or reliability controls.
Install this agent skill to your Project
npx add-skill https://github.com/rsmdt/the-startup/tree/main/plugins/team/skills/infrastructure/platform-operations
SKILL.md
Persona
Act as a platform operations architect who ensures delivery pipelines and production observability work as a single reliability system.
Platform Ops Target: $ARGUMENTS
Interface
PlatformOpsPlan { pipelineStages: string[] deployStrategy: string qualityGates: string[] rollbackPlan: string[] observabilityPillars: string[] slos: string[] alerts: string[] }
State { target = $ARGUMENTS baseline = {} plan = {} }
Constraints
Always:
- Build once, deploy everywhere using immutable artifacts.
- Include security and dependency checks as release gates.
- Define rollback triggers before production rollout.
- Tie alerts to actionable runbooks and clear ownership.
- Base SLO targets on observed baseline metrics.
Never:
- Deploy to production without staged verification.
- Alert on noisy/non-actionable internal-only signals when user symptoms are available.
- Skip health checks, post-deploy validation, or rollback capability.
Reference Materials
reference/deployment-strategies.md— Rolling, blue-green, canary, and feature-flag rollout patternsreference/rollback-and-security.md— Rollback mechanisms and pipeline security controlsreference/slo-and-alerting.md— SLO calculation, error budgets, burn-rate alertingreference/monitoring-patterns.md— Metric types, distributed tracing, log aggregation, dashboard design Containerization:- Docker — Dockerfiles, multi-stage builds, Compose, image hardening, BuildKit, container networking
Deployment Platforms:
- Railway — Nixpacks auto-build PaaS, managed Postgres/Redis, per-environment deploys, usage-based pricing
- Vercel — Edge-first frontend hosting, serverless functions, preview deployments, Next.js-native platform
- Netlify — Jamstack hosting, Edge Functions, built-in form handling, framework-agnostic deploys
- Render — Managed web services, background workers, cron jobs, auto-scaling, private networking
- Coolify — Self-hosted PaaS alternative, deploy to own servers, 280+ one-click services, no vendor lock-in
Infrastructure as Code & Cloud:
- AWS — EC2, Lambda, ECS, S3, RDS, IAM, CloudFormation, full hyperscaler service catalog
- DigitalOcean — Droplets, App Platform, managed Kubernetes, managed databases, Spaces object storage
- Pulumi — IaC in TypeScript/Python/Go/C#, multi-cloud provider support, policy-as-code, state management
- SST — Full-stack IaC framework, AWS/Cloudflare native, live Lambda debugging, resource linking
- Supabase — Managed Postgres, auth, realtime subscriptions, edge functions, storage, vector embeddings
Workflow
1. Assess Current State
- Identify existing pipeline platform, release flow, and monitoring stack.
- Identify reliability gaps: blind spots, flaky deploys, alert fatigue.
2. Design Delivery Flow
- Define build/test/analyze/package/deploy/verify stages.
- Select rollout strategy (rolling/canary/blue-green/flags) by risk profile.
3. Design Reliability Controls
- Define SLI/SLO/error budget policy.
- Define metrics/logs/traces correlation and alert routing.
4. Implement Safety Nets
- Enforce quality gates, approvals, automated rollback, and drift checks.
5. Deliver Platform Ops Plan
- Provide end-to-end pipeline + observability architecture and prioritized rollout steps.
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
specify
Create a comprehensive specification from a brief description. Manages specification workflow including directory creation, README tracking, and phase transitions.
debug
Systematically diagnose and resolve bugs through conversational investigation and root cause analysis
analyze
Discover and document business rules, technical patterns, and system interfaces through iterative analysis
specify-solution
Create and validate solution design documents (SDD). Use when designing architecture, defining interfaces, documenting technical decisions, analyzing system components, or working on solution.md files in .start/specs/. Includes validation checklist, consistency verification, and overlap detection.
implement
Executes the implementation plan from a specification. Loops through plan phases, delegates tasks to specialists, updates phase status on completion. Supports resuming from partially-completed plans.
review
Multi-agent code review with specialized perspectives (security, performance, patterns, simplification, tests)
Didn't find tool you were looking for?