• Architected and delivered a scalable dynamic endpoint-discovery component for the ping monitoring
platform, enabling robust, high-availability health checks across all Workday customer tenants endpoints.
• Drove the platform’s multi-region rollout, coordinating deployment, configuration, and validation
to establish a second AWS region and improve global observability accuracy by removing cross-
region network noises.
• Architected and built Workday’s internal service-ping infrastructure, establishing a Tekton-based
self-service onboarding pipeline that streamlined adoption across teams.
• Led planning, PoC, and delivery of the Fed IL4 of the ping service. Collaborated cross-
functionally with Security, Compliance, and Federal teams to meet federal requirements and elevate
platform security posture.
• Designed, implemented, and deployed a Fluent Bit–based logging pipeline to ingest service logs
into Kibana, establishing the standard logging pattern and delivering cross-team knowledge transfer
adopted across the Observability organization.
• Resolved core conflicts between Watchdog and the customized Prometheus Operator, enabling
reliable platform-level health monitoring for the ping service and improving system stability.
• Designed and delivered a new onboarding workflow for newly acquired Workday services, enabling
them to integrate with the ping platform for tenant-level SLA calculation.
• Refined SRE alerting logic to eliminate false signals caused by manual operations during major
incidents, improving alert accuracy and ensuring more reliable MTTR pipeline calculations.
• On-call rotation responsibility for diagnosing false alerts and maintaining real-time service health
dashboard accuracy.