Experience
2024 — Now
2024 — Now
Sunnyvale, California, United States
• Senior engineer on Elastic Model Serving (EMS) platform, a large-scale internal ML inference system designed to optimize for and maximally utilize opportunistic (elastic) GPU capacity; platform improvements contributed to a 2.06% increase in Meta’s global Ads Score (primary ads revenue metric) in H2 2025.
• Led a P0, foundational initiative enabling the platform to serve inference requests across heterogeneous AI hardware types, future-proofing the system against hardware fragmentation. Owned extensive redesign and alignment for the next-generation EMS platform with mixed-hardware capability, and led a team of 6+ engineers on the implementation, internal A/B testing, and rollout with 0 downtime.
• As the reliability point-of-contact of the team, drove cross-stack reliability roadmapping across EMS data and control planes; migrated all production models to isolated prod environment & organized half-long workstreams across team, achieving ~43% crash rate reduction, ~9–10% increase for traffic served on elastic capacity, ~10–12× faster reaction times, and ~200% fewer oncall alerts.
• Drove the investigation and remediation for 30+ SEVs, delivering durable, postmortem-driven fixes for revenue-critical incidents.
• Built org-wide influence via mentoring junior engineers through oncall rotations and design reviews; led technical talks on platform architecture and next-generation serving capabilities.
2023 — 2024
2023 — 2024
Menlo Park, California, United States
• Project lead in implementing the Conversation Routing product, a Messenger/Instagram business-to-consumer thread-level handover protocol that allows businesses to coordinate multiple third-party service providers they are employing for different use cases. Led a team of engineers, product/content designers, product managers & partner manager to iterate on partner feedback & ship solutions to partner pain points.
• Co-project lead in implementing VoIP calling third-party APIs on Messenger, which allows Messenger businesses to call customers / receive customer calls through third-party software to ensure operation scalability. Owned design of the real-time calling interface that integrated with first-party messaging infrastructure while meeting strict latency, reliability, and third-party extensibility requirements. Led a team of engineers to deliver test-ready beta 2-months early; closely coordinated with external enterprise customers to kickstart beta testing by end of 2024 ahead-of-schedule.
• Designed the first end-to-end testing framework for Messenger and Instagram business messaging APIs, reducing the CI runtime by >90% and making critical tests push-blocking.
2022 — 2023
2022 — 2023
Menlo Park, California, United States
• Took ownership of a cross-functional messaging platform project within the first month of joining the company, delivering features across backend APIs, web, Android, and infra tooling.
• Improved API reliability through expanded test coverage, high-scale load testing, and test flakiness detection.
• Drove privacy, security, and compliance work, ranking among top contributors on the team.
2021 — 2021
2021 — 2021
Menlo Park, California, United States
• Implemented new cloud-to-access gateway (AGW) gRPC callpath in that utilizes deterministic serializations of streamed data, a.k.a. “digests”, to intelligently sync data downstream for our subscriber-management service.
• Created kubernetes deployment of cloud microservice that manages batch updates of digests and cached data objects in SQL store, with added concurrency protection for when interacting with multiple client microservices.
• Generalized tooling (Protobuf, SQL cachestore) used in gRPC endpoints to propagate the digests pattern across services.
• Changes included in Magma v1.6 release, estimated to reduce network load from 15.7TB to 0.054TB (/month/network).
2019 — 2021
• Developed Columbia freshman orientation website with React, HTML and CSS that accumulated over 5000 unique pageviews.
• Led team and developed Columbia Housing Review web page using React/NodeJS and MySQL stack, which allows for dynamic and concurrent user content generation.
• Facilitating the migration of the Columbia Spectator website, which received over 4 million page views last year, to React in collaboration with the Washington Post.
• Created training program for junior engineers in web development technologies, e.g. React, Node, SQL, Git.
Education
Columbia University
Bachelor of Science - BS
2018 — 2022
Shenzhen Middle School
High School Diploma
2015 — 2018