I have more than 10+ years experience in programming with an emphasis on performance and scalability and 8+ years experience helping organizations get the most out of their data, by using a tool-kit comprising; machine learning, semantic modeling, natural language processing and graph analytics.
Experience
2023 — Now
2023 — Now
San Jose, California, United States
2020 — 2022
2020 — 2022
Oakland, California, United States
As a Principal Software Engineer at Marqeta, I exercised significant technical and strategic leadership in enhancing system performance, reliability, and scalability for critical high-speed, high-volume payment processing infrastructure. I provided hands-on engineering guidance, drove impactful technical initiatives from conception to delivery, and fostered cross-functional collaboration to deliver high-impact solutions in a demanding real-time environment.
▪ Led efforts to establish comprehensive performance baselines and continuous monitoring, providing crucial visibility for data-driven optimization and proactive issue detection.
▪ Drove strategic technical roadmaps by identifying and prioritizing key initiatives through close collaboration with engineering teams, product managers, and business stakeholders.
▪ Provided hands-on engineering leadership in upgrading and tuning service middleware, doubling application layer processing thresholds and boosting system throughput.
▪ Led a high-impact initiative to diagnose and mitigate database-related production outages, which were identified as a primary culprit for system instability. This involved extensive data gathering, root cause analysis, and the successful initiation and oversight of multiple strategic projects aimed at long-term database performance and resilience improvements.
▪ Collaborated and led cross-functional efforts to implement an L7 caching layer, significantly enhancing performance, connection management, and back-pressure to prevent failures.
▪ Co-invented and contributed to the patent: "Parallel processing in idle states of a transaction processing protocol." This innovation reflects thought leadership and significantly improved the efficiency and speed of high-volume transaction validation by leveraging idle system periods for parallel processing of independent components, addressing critical bottlenecks in real-time payment systems.
2018 — 2020
2018 — 2020
San Francisco Bay Area
▪ Architected a unified ML & analytics data platform in Google BigQuery, consolidating multiple federated Cassandra and Elasticsearch clusters into a single “source of truth.”
▪ Built real-time streaming pipelines using Confluent Platform (Apache Kafka), stitching together disparate data sources and performing on-the-fly feature engineering for downstream analytics.
▪ Developed a data-quality reporting framework in BigQuery, generating automated health-check metrics and dashboards to monitor ingestion accuracy and completeness.
▪ Delivered self-service reporting via Looker API and, when needed, a custom microservices layer—optimizing for both functional requirements and cost efficiencies.
▪ Implemented advanced entity-resolution workflows, combining unsupervised clustering (e.g., DBSCAN/K-Means) for candidate identification with supervised classification models to accurately detect duplicate user records.
▪ Mentored and onboarded junior engineers, establishing best practices in data-driven development, CI/CD pipelines, code reviews, and scalable system design.
2017 — 2018
2017 — 2018
San Francisco Bay Area
▪ Engineered ML Feature Pipelines from candidate resumes, applying GloVe word embeddings for skills and graph-analytics metrics to model job-transition behaviors.
▪ Evolved Core Matching Engine, refining algorithms that align job descriptions with qualified seekers to improve placement accuracy.
▪ Led Elasticsearch Upgrade & Keyword Standardization, overhauling index schemas and query analyzers to boost search relevancy and result quality.
▪ Optimized Index Architecture by splitting real-time and analytics indexes—cutting 90th-percentile query latency from 11s to 3s.
▪ Championed Software Best Practices, authoring development standards, code review guidelines, and CI/CD enhancements to raise engineering quality.
2013 — 2017
San Francisco Bay Area
▪ Unified & Modernized Data: Led 10+ enterprise data modernization initiatives, consolidating opaque, federated sources into a self-describing property graph with built-in terminology engines and data dictionaries.
▪ Graph ↔ Semantic Triple Mapping: Designed and built a bi-directional mapping framework between property graphs and RDF triples, enabling seamless consumption of Schema.org and other linked-data assets.
▪ Smart Kitchen Knowledge Graph: Architected and implemented a scalable knowledge graph for a smart-kitchen startup using Spark GraphFrames, DataStax Enterprise Graph, Apache TinkerPop, and Linked-Data principles—driving contextual product and usage insights.
▪ ML Democratization: Spearheaded efforts to bring machine learning capabilities to teams lacking dedicated ML engineers by deploying AutoML pipelines, standardizing feature-engineering libraries, and running hands-on workshops.
▪ End-to-End ML Delivery: Delivered 15+ machine learning solutions—leveraging XGBoost, deep neural networks, distributed random forests, and regression models—to tackle diverse business use cases from predictive maintenance to demand forecasting.
Education
India
Bachelor of Engineering - BE
2000 — 2004