Tech Stack
AnsibleAWSAzureCloudDockerElasticSearchGoogle Cloud PlatformKafkaKubernetesLinuxPostgresPrometheusRabbitMQRedisTerraform
About the role
- Lead and expand the Clients Cloud Operations team in Europe, focusing on growth and hiring.
- Drive cloud infrastructure implementation and optimization across AWS, GCP, and Azure for reliability, security, and scalability.
- Manage and optimize middleware services, including PostgreSQL, Ignite, Redis, Elasticsearch, Flink, ClickHouseDB, RabbitMQ, and Kafka.
- Serve as the primary escalation point for production issues during EU daytime hours; ensure timely resolution and root cause analysis.
- Resolve customer-reported issues and production incidents in collaboration with Support, Development, and QA teams.
- Provide technical leadership on cloud infrastructure design, performance tuning, and operational improvements.
- Ensure production operations meet security and compliance standards (ISO 27001, 27017, 27701, SOC 2, StateRAMP, FedRAMP).
- Review new service designs and deployments for security and compliance prior to production rollout.
- Coordinate with remote teams in the US, Canada, China, and India to deliver continuous 24x7 cloud service operations.
- Scale the Cloud Operations team by developing domain experts and future leaders.
- Analyze service performance, identify bottlenecks, and implement improvement plans.
- Improve service availability, scalability, and monitoring through automation, tooling, and process enhancements.
- Foster cross-regional collaboration to maintain operational excellence across time zones.
Requirements
- BS level technical degree required; Computer Science or Engineering background preferred.
- 8+ years of experience in a CloudOps / DevOps role and 5+ years of experience in a manager role.
- An ability to drive to big picture goals while valuing and maintaining a strong attention to detail.
- Experience architecting infrastructure for highly available, highly scalable cloud services.
- Hands-on experience with AWS or any public cloud (Azure, GCP etc).
- Knowledge of Linux, security, and networking fundamentals.
- Working knowledge of container-based architecture and deployment (Docker, Kubernetes).
- Working knowledge of deployment automation development (Ansible, Terraform, Helm).
- Experience in diagnosing and resolving complex application problems.
- Working knowledge of Elasticsearch, PostgreSQL, Redis, Ignite and RabbitMQ.
- Experience with monitoring tools (Nagios, Kibana, Prometheus).
- Experience with cloud security and compliance implementation.
- Strong sense of personal responsibility and accountability for delivering high quality work.
- Comfortable working within a distributed team located in multiple time zones.
- EU passport holder.