Experience
2019 — Now
New York, New York, United States
tech lead in the AWS Analytics Open Data Analytics org. Our org is responsible for developing and maintaining Trino and Spark query engines.
I support an org of 40 across 4 teams and provide technical guidance and leadership .
Responsible for roadmap planning, design reviews, critical code reviews, owning critical portions of new code, leading multi year projects that span 8+ engineers, providing guidance across projects on my team and on sister teams, assisting with yearly talent/perf reviews, creating and driving proposals for new work, and ensuring that the teams in my org maintain a high engineering culture.
Things achieved during my time at Amazon:
* created a protocol within Trino that allows users to create secure views, and obfuscates portions of the query plan that they don’t have access to. Since these are secure views, the underlying implementation is considered to be privileged, so I created a protocol that ensures user predicates created outside the view do not leak information about details within the view.
* designed and developed a new predicate pushdown protocol. Lead to a reduction of 50% in TPCDS benchmarking with some queries seeing a 95% reduction.
* refactored and designed a new UDF protocol by adding new query plan rules to detect UDFs in the query plan.
* refactored existing Fine Grained Access Control implementation and designed a new protocol. Using this new design, I created a 2 year engineering roadmap for future FGAC enhancements.
* define yearly engineering roadmap by partnering with PM’s and engineering managers.
* led the adoption of Glue Connections as a centralized store for connection metadata. Wrote a proposal and presented it to the Athena GM. Got buy in across two orgs, and lead the implementation.
* engineer number 1 on the query federation team. I launched the feature into GA with 10 connectors and helped grow the team from 1 to 10+. Today we support over 30+ connectors
2018 — 2018
New York, New York
• Interned at Amazon’s Manhattan office within the AWS Elastic MapReduce(EMR) and AWS Athena service teams.
• Worked with a principal engineer to design and develop two optimizations to PrestoDB, an open source distributed SQL query engine used in Athena.
• The first optimization is a customer facing feature called partition projection. https://docs.aws.amazon.com/athena/latest/ug/partition-projection.html
• The second optimization was related to increasing throughput for split generation within Presto. Presto had a hard time with S3 buckets that contained millions of small (few KB) of files and my optimization decreased query runtimes by over 95% in certain cases.
• Optimizations decreased processing times for specific queries by up to 95%.
• Both optimizations are forecasted to directly decrease costs by $X mm per year. Exact number is confidential.
• The nature of the optimization required me to do an extensive deep dive into PrestoDB in order to understand it's architecture.
• Received mentorship from many incredibly talented engineers including a world class principal engineer.
2017 — 2017
2017 — 2017
Seattle, Washington, United States
• Interned within Amazon's Simple Storage Solutions (S3) which is the leading cloud storage provider in the market.
• I created a regional ops dashboard that forecasted capacity usage for S3’s index servers. The dashboard replaced a manual process that originally used excel spreadsheets.
• Interacted and collaborated with multiple engineers and managers.
• Participated in daily stand ups and learned about the SCRUM process.
• Leveraged Amazon's internal technology stack, Spring, MySQL, and Java
Education
University of Minnesota
Mathematics and Computer Science
2015 — 2019
The University of British Columbia
Biochemistry
2012 — 2014