I’m an AI Infrastructure and Data Engineer with 5+ years of experience building scalable ETL pipelines, distributed systems, and machine learning platforms. My work focuses on event-driven architectures, cloud orchestration, CI/CD for ML, and deploying inference systems from edge to cloud.

Experience

ButlrSenior Software Engineer

2024 — Now

Optimized cloud infrastructure, reducing costs from $250k/month (8,000 sensors) to $100k/month (13,000+ sensors)

Developed AWS ECS applications using Terraform for infrastructure provisioning and Azure Kubernetes Service with Docker for efficient

containerization.

Built and deployed CI/CD pipelines for sensor occupancy applications, integrating Prometheus for monitoring and Grafana for analytics.

Engineered real-time data streaming pipelines, directing ~100GB/day to InfluxDB for analytics and AWS S3 for cost-effective storage.

Worked with big data formats (Avro, Parquet) for Machine Learning Engineering tasks.

ButlrApplied Scientist II

2022 — 2025

San Francisco Bay Area

Architected and operated 6 production Ray clusters (detections, headcount, occupancy, care, room motion,

visualizer) across multi-AZ AWS, supporting 100+ concurrent worker nodes and autoscaling from zero to 1,000

workers via Ray Autoscaler. Deployed Ray Serve for low-latency HTTP inference, running continuously in

production for 2+ years.

Designed a distributed stream processing framework using Ray’s actor model to ingest NATS message

streams, implement priority-based queuing, and dynamically schedule distributed processors—automatically

scaling concurrent workers to handle data from thousands of devices.

Built versioned Ray cluster deployment pipelines with templated configs, automated job submission, and

rollback support. Integrated full-stack observability using Prometheus, Ray’s metrics API, Grafana, and

CloudWatch with custom counters, gauges, and latency histograms.

Developed models for detection, tracking, motion detection, pose estimation and successfully deploying them end-to-end for ~5000 sensors

Achieved 92% accuracy in person detection and counting using OpenCV for low-resolution thermal data.

ButlrMachine learning Scientist

2020 — 2022

San Francisco Bay Area

Developed quantized people-tracking models for edge deployment using TensorFlow Lite on Coral Edge TPU.

Specialized in IoT communication protocols (MQTT) for seamless sensor-to-cloud data transmission, while maintaining and scaling servers.

Snap Inc.Software Engineer Intern (Computer Vision)

2020 — 2020

Los Angeles Metropolitan Area

Developed deep learning models using PyTorch with optimized backbone architectures for efficient execution on mobile devices.

Led end-to-end dataset preprocessing, model training, and evaluation.

University of Southern CaliforniaGraduate Research Assistant SAIL Lab

2019 — 2020

Greater Los Angeles Area

Working in SAIL lab in USC on a Natural language processing task: Movie summarization

Converting movie scripts into Wikipedia like plot summaries

Experience+2

Experience