Tech Stack
AWSAzureCassandraCloudDistributed SystemsDNSDockerElasticSearchFluxGoogle Cloud PlatformGradleGrafanaKubernetesNoSQLPrometheusTerraform
About the role
- Cloud Operations Engineer (Kubernetes) · LATAM Location: Anywhere in LATAM Job Type: Remote Project: Healthtech Time Zone: GMT-3 (Argentina time) English Level: B2 / C1
Design, deploy, and maintain Kubernetes clusters for development, staging, and production environments.
Collaborate with software and operations teams to support a diverse ecosystem of cloud applications.
Implement scalable, fault-tolerant deployments using Kubernetes.
Set up and manage observability tools like Grafana and Prometheus.
Continuously optimize infrastructure for performance and cost-efficiency.
Automate infrastructure with tools such as ArgoCD, Helm, Kustomize, GitHub Actions, and Gradle.
Support distributed NoSQL storage (Scylla or Cassandra) and OpenSearch workloads.
Enforce Kubernetes security best practices including RBAC and network policies.
Participate in incident management, including root cause analysis and disaster recovery planning.
Maintain detailed technical documentation and operational playbooks.
Requirements
- 3+ year of experience managing production-grade Kubernetes clusters.
Previous experience as a Cloud Ops Engineer, SRE, or similar Kubernetes-focused role.
Strong understanding of Kubernetes architecture, resources, and deployments.
Hands-on with Docker, Helm, Kustomize, and cloud platforms (AWS, Azure, or GCP).
Experience with GitOps and Infrastructure as Code (ArgoCD, Flux, Terraform, etc.).
Operational experience with Scylla or Cassandra and OpenSearch or Elasticsearch.
Solid understanding of cloud networking, DNS, and load balancing.
Strong skills in debugging, performance tuning, and distributed systems.
Awareness of Kubernetes security best practices.
Excellent collaboration and communication skills.