Boulder, Colorado, United States
• Developed & maintain a data pipeline status application, built of microservices in AWS, that assures the quality of 40 billion data records each day & coordinates the downstream processing of that data in realtime
• Manage big data applications & architectures which orchestrate the entire data lifecycle including Kafka, Hadoop, AWS services, relational SQL & NoSQL databases, & Airflow
• Rearchitected an ancient data model, built for a single dataset, to manage more than 10 datasets, increasing data processing efficiency by over 20%
• Transitioned multiple data applications into AWS, decreasing monthly storage & processing costs by $50,000