Greater New York City Area
2014-2015:
* Assist in building data infrastructure
* Work on ETL pipelines
* Developed open source workflow management software Dagobah
* Tech including Vertica, hadoop, S3, Exact Target, Pig, with my main development language being Python and occasional Java.
* Worked on large data aggregations for ExactTarget/Salesforce marketing cloud
2016:
* Infrastructure stabilized
* Legacy ETL maintenance
* Shifting towards data science
* Churn prediction
2017:
* Additional churn prediction modeling and cost impact reports
* Conversion prediction modeling
* User segmentation exploratory analysis
* Technologies: Sklearn/pandas/numpy, spark/sparkSQL, scala/scalding, zeppelin
* Helped search team improve search result relevance
2018:
* As of late 2017 we started the Applied Machine Learning team at Vimeo, focused on deploying machine learning models and technologies to production to impact various problems at Vimeo
* Fraud Detection
* Visual classification
* Dockerized classification platform
* Deployment to Google Cloud Platform
* Using GCP technologies such as GKE, Pub/Sub, Datastore
* POC for a hardware embedded realtime object detection using Convolutional Neural Networks (CNN)
* Research for marketing attribution models
* Modeled, architected, and deployed a visual search API for Vimeos new stock video service, utilizing GKE, keras, flask, gunicorn, and ANNOY, handling production traffic >1000 TPS
* Automated and deployed pipeline for visual search and visual tagging for new uploads in near-real time