TensorWave

Kubernetes Platform Engineer

TensorWave

full-time

Posted on:

Location: 🇺🇸 United States

Visit company website
AI Apply
Apply

Job Level

JuniorMid-Level

Tech Stack

AnsibleCloudGrafanaHAProxyKubernetesLinuxPrometheusSpringTerraform

About the role

  • Own and troubleshoot operational issues within Kubernetes environments
  • Maintain and monitor core services (e.g., Cilium, HAProxy, Prometheus, etc.)
  • Ensure uptime, performance, and reliability of multi-tenant clusters
  • Assist with Ingress/Egress connectivity and network debugging
  • Support internal and customer teams in secure, isolated VPC environments
  • Collaborate with senior engineers on automation and cluster lifecycle improvements

Requirements

  • 2–4 years experience in DevOps, SRE, or Linux infrastructure roles
  • 1+ years of hands-on experience with Kubernetes in production
  • Familiarity with networking, CNI plugins, and core Linux troubleshooting
  • Strong infrastructure-as-code mindset using tools like Helm, Terraform, or Ansible
  • Solid experience with monitoring and logging tools (e.g., Prometheus, Grafana, Loki)
  • Understanding of secure infrastructure design principles and least-privilege access
  • Comfortable working in a team-oriented, fast-paced operational environment
  • Experience with RKE2, Rancher, or similar platforms (nice to have)
  • Experience troubleshooting or supporting AI or GPU-based workloads (nice to have)
  • Familiarity with HAProxy, Cilium, or other Kubernetes ingress/networking tools (nice to have)