Experience
2020 — 2024
2020 — 2024
San Francisco, California, United States
• Maintained and helped scale compute and data platform used by a growing research organization of over 100 computational scientists and engineers for running ML workflows and experiments
• Led development of an artifact service that enabled researchers to register molecular artifacts with schematized custom metadata and discover and reference them for their experiments; >40k artifacts registered
• Developed pipelines for ingesting clinical and lab data into new data lakehouse
• Executed and helped strategize organization-wide migration of a lab management system, migrating tens of GBs of structured data across thousands of fields; coordinated cross-functionally with PMs, analysts, and scientists to minimize disruption
2017 — 2020
2017 — 2020
San Francisco Bay Area
• Developed and maintained de-identification pipeline used to deliver cancer patient datasets to a large pharma company; implemented using PySpark, parquet files on S3 for persistence, and AWS EMR for deployment
• Headed >1000x scaling of CDC (change data capture) service processing 30 million records/day during data migrations for large health system customers; implemented observability via New Relic and performance tuning on RabbitMQ
• Refactored an ETL pipeline, which ingests manually abstracted data and merges from disparate data sources, in order to align with major architectural update; moved persistence from S3 to AWS Aurora, automated test runs on CircleCI
• Designed and developed services to transition Django application monolith to microservices architecture deployed on a Kubernetes cluster; implemented form app backends with Flask/DynamoDB/PostgreSQL and built an interface to Auth0
2016 — 2017
2016 — 2017
Palo Alto, California
• Developed the clinical analytic reports feature, which aggregated clinical and molecular data of thousands of patients into insights for clinicians; wrote complex queries leveraging PostgreSQL jsonb and built framework to maintain them
• Implemented content migration strategy for managing versioning and updating of clinical trial and RxNorm therapy content across single-tenant customer databases
• Maintained the Syapse Application, which included writing/optimizing SQL queries, implementing REST API endpoints in a Django application, writing migrations, and writing SPARQL queries against Blazegraph (AWS Neptune)
2014 — 2015
2014 — 2015
Toronto, Canada Area
• Elicited requirements from stakeholders, developed medium fidelity prototypes, conducted user acceptance testing and wrote software requirements specifications for web application product
• Managed team projects throughout the software development lifecycle from planning to deployment
• Spearheaded implementation of new documentation and communications platform (Confluence) to resolve team collaboration issues
2013 — 2013
Western Province, Kenya
• Led needs assessments and infrastructure improvement at 4 government health facilities
• Planned and supervised campaign to treat and prevent jiggers within 3 communities spanning 51 households in Sabatia District
• Conducted research on pharmaceutical drug stock-outs in rural Kenya identifying inefficiencies within the supply chain
Education
University of Toronto
Bachelor of Applied Science (B.A.Sc.)
Hack Reactor