Experience
2024 — Now
2024 — Now
San Francisco, California
• Develop and maintain a service that monitors AWS quotas and automatically submits increase requests across Salesforce Hyperforce, ensuring uninterrupted scalability for global customers
• Built and operate a deployment risk API used by Salesforce’s deployment service to determine whether production rollouts can proceed safely, reducing risk of service-impacting changes
• Provide technical support through internal channels, debugging complex infrastructure issues, guiding engineers, and expanding service coverage to GovCloud and GCP
2022 — 2024
2022 — 2024
San Francisco, California
• Served as Incident Commander during high-severity outages, coordinating response across teams and reducing mean time to recovery (MTTR) for mission-critical services
• Designed and implemented dashboards for key services to improve operational visibility, enabling faster diagnosis of availability and latency issues
• Led post-incident reviews and introduced preventative measures to reduce recurrence of high-impact incidents
2021 — 2022
2021 — 2022
San Francisco, California
• Led “Game Day” fault-injection exercises to uncover service availability vulnerabilities, improving resiliency across multiple Salesforce applications
• Developed AWS-based fault injections (e.g., EC2 termination, network ACL blocking) using Terraform and Spinnaker, reducing single points of failure
• Partnered with teams across Salesforce to implement findings from chaos experiments, strengthening disaster recovery capabilities
2020 — 2020
2020 — 2020
San Francisco, California
• Built a proof-of-concept chaos engineering tool for Kubernetes services, enabling automated fault injection for resilience testing
• Implemented gRPC-based communication between backend and injector using Go and Kubernetes client libraries
2019 — 2019
2019 — 2019
San Francisco, California
• Created a CLI-based support chatbot for internal build tools, leveraging NLP techniques to parse developer queries and recommend relevant documentation
• Applied deep learning models to analyze logs, automatically diagnose common failures, and suggest recovery actions
Education
University of California, Berkeley