Developed data infrastructure, pipelines, and analytics tools for processing product usage and billing data to support strategic decision-making and finance initiatives.
Core stack: Python, SQL, Kafka, Kinesis, Snowflake, Redshift, Postgres, MySQL, Redis, Airflow, Docker, and Databricks.
• Led a project involving cross-functional teams to migrate and backfill the CRM instance with zero downtime, ensuring uninterrupted business operations.
• Automated Kafka topic deployment with integrated data quality monitoring and recency alerts to enhance data reliability and operational efficiency.
• Enabled distributed compute for Airflow workers to make ETL infrastructure more scalable, reliable, and performant; added performance monitoring, logging, and health checks.