• Developed Spark Streaming pipeline to feature engineer internal and AWS data, improving efficiency for community detection algorithm
• Fixed data serialization issues of internal data product, saving development hours and increasing reliability
• Simplified pipeline by consolidating three Spark jobs into one, alleviating need for AWS resources
• Participated in Agile methodologies, including standup and two-week sprints, to receive constant feedback