Salary
💰 $117,000 - $147,000 per year
Tech Stack
AnsibleAWSAzureCloudDistributed SystemsDNSDockerGoGoogle Cloud PlatformGrafanaJavaKubernetesPrometheusPythonTerraformTypeScript
About the role
- Work directly with developers to debug infrastructure and deployment issues in Temporal environments (Kubernetes, Docker, cloud services)
- Investigate and resolve performance bottlenecks, networking problems, and availability incidents
- Provide clear, actionable guidance to help developers run Temporal reliably at scale
- Independently drive technical solutions for complex production issues and proactively improve systems
- Contribute to observability and monitoring solutions using Prometheus, Grafana, and related tools
- Help improve reliability, load balancing, and ingress/egress networking for Temporal deployments
- Partner closely with engineering, product, and developer advocacy teams to relay field issues and feedback
- Document best practices, troubleshooting playbooks, and infrastructure guides for the developer community
- Participate in on-call rotations to support production deployments and provide timely responses to critical issues
Requirements
- 5+ years of experience in cloud-based environments as an infrastructure engineer
- Proficiency with AWS, GCP, or Azure cloud platforms
- Hands-on experience deploying and managing containerized services (Kubernetes, Docker, EKS, GKE)
- Familiarity with cloud infrastructure and networking (load balancing, DNS, TLS, ingress/egress)
- Demonstrated ability to manage services across on-premises and cloud infrastructures
- Experience with monitoring/observability tools such as Prometheus, Grafana, or OpenTelemetry
- Prior experience troubleshooting production performance and availability issues
- Strong written and verbal communication skills; ability to explain complex technical concepts clearly
- Comfort working on a remote team and collaborating across time zones
- (Nice to have) Experience with infrastructure-as-code (Terraform, Ansible, or AWS CDK)
- (Nice to have) Knowledge of security certificate management and implementation
- (Nice to have) Background in customer-facing roles such as professional services, solutions architecture, or developer advocacy
- (Nice to have) Fluency in one or more of: Python, Go, Java, or TypeScript
- (Nice to have) Familiarity with distributed systems concepts and Temporal workflows