Washington, District of Columbia, United States
• Led key workstreams for an end-to-end migration of Rancher Kubernetes clusters and databases from on-premises to Google Cloud under budget and ahead of renewal deadline securing savings of over $100,000.
• Engineered and maintained a suite of standardized Terraform modules, reducing manual configuration errors by over 90% and ensuring consistent, repeatable infrastructure provisioning across multiple GCP environments.
• Spearheaded critical infrastructure security and compliance efforts, directly contributing to a successful SOC2 audit by designing and testing disaster recovery failover strategies, conducting quarterly RBAC reviews, and managing the remediation of audit findings.
• Managed and resolved high-priority production incidents as a part of a 24/7 on-call rotation, improving overall system uptime by 15% through proactive quarterly maintenance and security upgrades.
• Collaborated with development teams to build and optimize CI/CD pipelines, leveraging GitLab, Helm, and ArgoCD to automate application delivery to GKE clusters and increase release velocity by 30%.
• Authored and maintained comprehensive technical documentation, including infrastructure diagrams and runbooks, to standardize operational procedures across a team of 15+ engineers.