Redmond, Washington, United States
• Owner of distributed data lifecycle systems processing 12+ exabytes across all Microsoft
365 enterprise and consumer workloads
• Designed and scaled distributed deletion pipelines purging 1.3 EB/month of
user‐deleted data across worldwide farms, using Azure Job Queues in a
producer‐consumer pattern with redundant safety systems ensuring zero
accidental data loss
• Architected high‐throughput compression workflows delivering
$4.5M/month in cost savings across 125 PB/month, including backlog
processing of 211 PB of previously uncompressed data
• Built distributed orphan collection system using fault‐tolerant protocols
across geo‐distributed clusters, recovering $2.9M in storage costs (80 PB
total)
• Designed account lifecycle deletion pipeline achieving $1.8M/month
savings (50 PB/month) with zero data loss across billions of objects
• Redesigned data integrity scanning using distributed priority queue
architecture, improving SLA coverage from 86.81% → 99.86%
• Engineered fault‐tolerant failover systems for region‐down scenarios,
ensuring high availability for critical data pipelines across multiple
datacenters
• Architected safety‐critical data recovery mechanism achieving 100%
success rate for disaster recovery, safeguarding exabytes of production data
• Led infrastructure efficiency strategy, designing cost models for single‐LRS
deletion and root blob compression that captured $8M+ in annual savings
• Extended data integrity systems to support next‐gen storage platform
migration, ensuring correctness guarantees during live data movement at
exabyte scale
• Scaled org capacity by hiring and mentoring engineers across 4 systems as
service grew 5x
• As the designated AI Coding Champion of the org, leading adoption of
AI‐assisted development and conducting org‐wide seminars