Tech Stack
AnsibleCloudGrafanaKubernetesLinuxNode.jsPrometheusPythonRubyRuby on RailsTerraform
About the role
- Architect, design, and maintain scalable, secure Kubernetes clusters and object-storage integrations
- Migrate legacy systems to cloud-native Kubernetes environments with minimal disruption
- Build and maintain Infrastructure-as-Code using tools like Terraform, Ansible, and Helm
- Automate CI/CD pipelines and deployment workflows
- Proactively monitor, debug, and optimize cluster health, resource usage, and performance
- Apply advanced Kubernetes networking, RBAC, service meshes, and security best practices
- Collaborate with developers and ops in Kanban-style sprints; plan and deliver improvements while triaging urgent tickets
- Document systems and processes clearly and build a knowledge-sharing environment
- Champion stability, resilience, and continuous improvement of infrastructure
- Partner closely with a senior technical counterpart to design and level up infrastructure
Requirements
- Proven experience managing Kubernetes and Linux in production environments
- Experience migrating legacy systems to Kubernetes clusters
- Deep autonomy: ability to plan and execute without hand-holding
- Infrastructure-as-Code proficiency: Terraform, Ansible, Helm
- Strong DevOps toolchain skills: CI/CD, Git, automation, alerting
- Proficiency scripting in Python, Bash, or similar
- Experience with observability tools (e.g., Prometheus, Grafana)
- Excellent communication skills and ability to collaborate across teams
- Problem-solver who thrives in ambiguity and fixes issues on the fly
- Experience working in Kanban-style workflows
- Nice-to-have: CKA/CKAD or similar certifications
- Nice-to-have: Familiarity with cloud-native object storage (Ceph, MinIO, S3-compatible)
- Nice-to-have: Experience in entertainment or media tech stacks
- Nice-to-have: Experience managing Ruby on Rails and NodeJS applications