Seattle, Washington, United States
• Owned cost-optimization roadmap for data services, removing HAProxy and refining ALB–Istio routing with distributed caching to cut costs by 30%, improve p99 latency by 15% (42 ms), and enable $3 → $2 per M calls.
• Architected blue-green deployment for Agents using Java, Kubernetes, EFS, and S3, enabling zero-downtime rollouts, improving scalability by 40%, and driving $300K+ ARR through higher availability.
• Engineered a scalable Secret Rotation Service using Key Vault, Kubernetes, and REST APIs, automating credential rotation for 1K+ apps, reducing manual effort by ~70%, and ensuring reliability for AI workloads.
• Drove end-to-end observability using Prometheus, Grafana, Kibana, and Elasticsearch, defining custom SLOs and alerting to enhance incident detection, accelerate root-cause analysis, and cut regressions by 25%.
• Standardized monitoring runbooks, alerting, and code review practices across teams, cutting debugging time and MTTR 40%, while mentoring 3 engineers to full feature ownership, boosting team delivery velocity.