• Designed and maintained high-throughput analytical services built on gRPC, ClickHouse, AWS, and Kubernetes, supporting complex SQL generation for customer-facing analytics and reporting queries.
• Led query optimization, benchmarking multiple execution strategies and resource configs; improved critical analytics query 5.7x faster and reduced memory usage by 2x under the same concurrent load.
• Developed a query performance testing framework and production-like load testing infrastructure (Locust + synthetic workloads) to validate performance at scale before release.
• Delivered async service clients and concurrency improvements; achieved ~2x throughput gains.
• Implemented and productionized customer-facing analytics features: multi-touch attribution, conversion reporting, portfolio reports, and cohort reporting.
• Designed and implemented automated integration tests in CI pipelines to validate end-to-end service behavior (gRPC APIs, SQL generation, and query correctness), significantly reducing production regression risk and improving release confidence.
• Built Airflow DAG pipelines to automate data warehouse hygiene and lifecycle management.
• Led incident mitigation and reliability engineering, including large-scale data cleanup (300M+ rows) in distributed ClickHouse clusters, EBS volume upgrades, dashboards, alerting, and operational runbooks.