Projects
Runbook Runner: Turning Static Docs into Actionable Automation
Designed and shipped a lightweight automation platform that transformed static operational documentation into guided, safe, and auditable workflows used across SRE, Security, and Product teams.
Edge Compute for Low-Latency Streaming
Built edge infrastructure across Fastly, Cloudflare, and CloudFront to keep music streaming responsive worldwide.
Rolling Out a Modern Incident Management Process
Designed and rolled out a clear, scalable incident management framework that strengthened cross-team coordination and measurably improved reliability across the platform.
Observability Governance & Internal Developer Portal
Blended log governance, cost-per-service reporting, and IDP integrations so teams own their telemetry budgets and insights.
Octobatch: Declarative Large-Scale Code Changes Across GitHub Repositories
Octobatch - a self-hosted tool for declaratively applying large-scale code changes across thousands of GitHub repos—preview, open PRs in bulk, and continuously reconcile them until merged.
Platform Automation & Observability at PlayStation
Led the Platform Automation and Observability teams powering PSNow, shrinking toil and surfacing actionable telemetry.
Service Discovery Modernization with Consul
Replaced AWS App Mesh with Consul to future-proof discovery, harden service-to-service auth, and decouple migrations.
SRE Operating Model Rollout
Codified SLOs, runbooks, and service tiers while coaching teams into an SRE mindset.