• Responsible for building, automating and maintaining AWS cloud and data center hybrid cloud environments, both monolithic and microservices; including deployments, CI CD pipelines, release management using Jenkins and Bamboo, 24/7 automated monitoring platform with alerting and real-time visualizations of current state of applications, vms/instances/servers, DBs, and different clusters of Storm, Mesos, RabbitMQ, Elasticsearch, Spark, Redis, MongoDB and others.
• Primary support for on-call rotation for Production issues, supporting a plethora of technologies and DBs.
• Implemented end to end service and host monitoring using Prometheus, Grafana, Alert Manager, PagerDuty and Slack integrations. System monitoring using prometheus plugins, log event monitoring using ELK and AWS Cloudwatch metrics, graphed in Grafana, alerts managed by Alert manager and sent to PagerDuty and /or Slack.
• Writing Automation and configuration management using Chef and Ansible for new and migrating old Chef scripts to using Ansible.
• Designed various solutions for running a stack that includes applications running in/with, Mesos, Marathon, Storm, Kafka, Cassandra, Spark.
• Responsible for seamless and online migrations of entire stacks with TB’s of different databases to and from AWS and local data centers.
• Day to day maintenance, deployment, troubleshooting for application delivery in all environments, DEV, QA, Staging and Production.