New York City Metropolitan Area
• Designed and built the Ads Infrastructure Management Console from the ground up, a capacity planning platform that reduced planning time by 10+ hours per cycle for pilot teams. Led frontend architecture using React/TypeScript with a step-by-step workflow that consolidated disjointed multi-page processes into single-page experiences. Platform launched to GA in Dec 2025, serving 3 pilot fleets with rollout to all Amazon Ads teams in 2026.
• Built an automated incident enrichment system that correlates deployment events (Apollo, Lambda, MCM, weblabs) with anomalous log patterns and posts contextual information to incident tickets. System processes 350+ tickets monthly across 150+ resolver groups, automatically surfacing relevant change events and log anomalies to accelerate root cause identification during incidents.
• Developed an automated alarm creation platform with intelligent threshold recommendations, enabling teams to create CloudWatch alarms in under 5 minutes versus manual CDK deployments taking 1+ day. Led a monitoring gap campaign that identified 70 services lacking critical CPU/memory alarms across EC2, ECS, Elasticsearch, and ElastiCache—achieving 97% remediation rate and closing operational blind spots before they could cause incidents.
• Re-architected a monolithic web application into a modular multi-service platform hosting 4 operational tools used by teams across Amazon Ads. Decomposition improved deployment pipeline speed, reduced test flakiness, accelerated developer velocity, and significantly decreased merge conflicts.