Experience
2024 — Now
2024 — Now
Santa Monica, California, United States
Worked at Snap’s DeviceLab team. Managed a complex hybrid infrastructure of 3,000 Linux/macOS servers and 12,000 mobile devices/emulators across on-premise data centers and AWS, supporting 900,000 daily tests for teams including Autopilot, Lenses, and Snap Lab.
Developed a distributed cache system using Varnish, reducing data center bandwidth usage by 57%, enhancing network performance, and lowering operational costs.
Achieved 100% error categorization of internal errors within the Test Platform across Autopilot and DeviceCloud. Successfully identified and attributed various timeout issues to user tests or platform slowness, including iOS crash processing and Android APK installation complications.
Successfully launched a new DescribeDevices API, facilitating the DevProd team's migration to SnapCI. Improved the API's performance dramatically, reducing response time from 23 seconds to 3 milliseconds.
Evaluated and prototyped Vision Language Models (VLMs) and multimodal LLMs using the DroidRun framework to automate complex mobile device configuration and setup, significantly reducing manual intervention in the device lifecycle.
Architected an automated CI Deflaker service leveraging BigQuery and generative AI to detect, triage, and perform root-cause analysis on flaky tests, integrating the pipeline with Jira and Slack for automated ticket generation and incident reporting.
Redesigned the server deployment workflow to eliminate test interruptions during package upgrades by implementing a sophisticated drain/un-drain logic and fine-tuned Ansible parallelization strategies to maintain consistent device capacity during maintenance.
Optimized development velocity by integrating Cursor and Claude Code into the SDLC and deploying Model Context Protocol (MCP) agents to automate alert triaging and cross-tool coordination.
Created a new Golang gRPC API for DeviceCloud, providing customers with detailed health check failure information.
2022 — 2023
2022 — 2023
Santa Monica, California
2018 — 2022
2018 — 2022
Santa Monica, California
Worked in Hulu’s Cloud and Platform org, which manages the platform and infrastructure used by Hulu engineers to develop, build, test, and release software.
I worked in 3 different teams: Developer Tools team, Production Engineering team, and Observability Platform team.
2016 — 2018
2016 — 2018
Morrisville, North Carolina
Worked in a DevOps team (EngIT-CI) that provides continuous integration (CI) and delivery (CD) services within the company.
2015 — 2016
2015 — 2016
Milpitas, California
Built JavaScript pages and implemented backend Java APIs in WebEx Super Admin web application.
Education
William & Mary
Master of Science (MS), Marine Sciences
2011 — 2013