Trust & Safety Measurement Platform
• Designed probabilistic uncertainty framework: bootstrapped labeling errors across experiments to construct empirical error distributions and derive confidence intervals. Analysis proved zero statistical power in a 5-year measurement initiative, leading to project cancellation and reallocation of 13 engineers (~50 eng-weeks/year saved)
• Owned upsampling system processing 2B+ samples/day, formulated as convex optimization over quality constraints and ML prediction scores to minimize estimation variance under cost targets
• Built real-time proxy metrics integrated into ranking systems: 20% lift in a P1 KPI, 650% increase in operational throughput, enabled $14M efficiency estimation across ranking experiments
• Built offline ML platform reducing model onboarding from 2 weeks to 2 hours and experiment iteration from 30 min to 2 min
• Optimized ML serving via caching/batching redesign: −20% compute, −30% p95 latency ($250k/yr savings)
Led a 10-engineer Better Engineering workstream: $4M/yr infra savings, test coverage 30% → 89%