New York City Metropolitan Area
• Developing and maintaining our business critical ETL pipeline and scheduler in Python/BigQuery, as well as its UI built on React.
• Building a foundation for scaling data science, such as enabling Spark job submission and hosting JupyterLab instances on DataProc clusters.
• Collaborating with air quality scientists to build a data quality/data visualization engine to detect anomalous behavior in our data.
• Leading an effort to reorganize our data warehouse to align further with industry and academic air quality sensor data research.
• Mentoring engineers and supporting their efforts on large, impactful projects.
• Modernizing our use of Google Cloud Platform assets, using Terraform, and reinforcing our security posture across multiple projects.