New York, New York, United States
• Designing and building a self-serve data platform for near-real-time database replication via change data capture (CDC) into an Iceberg lakehouse -- foundational infra powering analytics, feature engineering, and ML workloads across Square, Cash App, and Afterpay.
• Led the re-architecture of the platform's orchestration layer onto Temporal, bringing durable execution and reliable long-running workflow guarantees to data movement pipelines.
• Built the Bronze layer of a Medallion architecture lakehouse archiving Square, Cash App, and Afterpay events into Iceberg, serving as the canonical source for downstream analytics and ML feature pipelines. Led the Iceberg Kafka Connector evaluation and POC, designed the schema, and implemented SMTs with custom Avro/Protobuf converters to unify heterogeneous payloads, and instrumented the connector with metrics and alerting for production readiness.
• Extended the platform to support DynamoDB as a CDC source by designing and building an in-house DynamoDB Streams connector.