Sunnyvale, California, United States
I’m working on Google’s logging infrastructure, a distributed large-scale system that allows for the collection, storage, and management of business-critical event records for all Google products, and aiming to make them cost effective and accessible for business insights, in a manner that respects user privacy and complies with policy.
Project experiences on
Logs auditing: Designed and built a large scale Flume pipeline, which ingests 1% logging data of all Google’s products (~5PiB) and scans them against Google’s privacy policies daily.
Logs Access: Designed and implemented read access control of logs, to satisfy read permission management of data with different levels of privacy and security requirements.
Logs Launch Privacy: Design and built logs launch workflow with logs privacy review, which ensures logging plan compliance with Google’s logging policies.