Tech Stack
AnsibleAWSAzureCloudDockerJenkinsKubernetesLinuxMySQLNoSQLPostgresPythonRDBMSRedisTerraform
About the role
- Design, operate, and maintain production clusters, developing automation strategies and deployment processes
- Take ownership of platform challenges, proactively addressing and resolving issues to ensure a high-performance, reliable infrastructure
- Manage and scale cloud infrastructure (AWS and Azure), continuously monitor systems, plan and execute upgrades, and respond to outages including after-hours support when needed
- Identify potential issues early; evaluate and improve technologies, procedures, and workflows to increase operational efficiency and performance
- Investigate, evaluate, and implement new tools and technologies, applying expertise in AWS, Azure, Docker Swarm, Kubernetes, GitLab CI, Jenkins, and Terraform
- Operate within agile/lean frameworks and collaborate closely with cross-functional teams (product management, design, engineering)
- Seek and implement innovative approaches to improve infrastructure performance, reliability, scalability, and security
- Meet regularly with the line manager to review progress; manage issue resolution and escalation; work effectively with other teams; support other organizational priorities when necessary; act as an Omilia ambassador
Requirements
- 4+ years of working experience in a similar role
- Experience with AWS
- Strong scripting skills (Bash, Python)
- Experience in operating a container orchestration cluster (Kubernetes, Docker Swarm)
- Experience with Terraform
- Experience with Continuous Integration Tools (Gitlab CI, Jenkins, ArgoCD)
- Experience with Configuration Management Tools (Ansible)
- Considered a plus: Experience with Azure
- Considered a plus: Experience in Linux Administration
- Considered a plus: Experience with Cloud Security Services
- Considered a plus: Basic RDBMS knowledge (MySQL, Postgres)
- Considered a plus: Basic NoSQL knowledge (Redis)