Tech Stack
AnsibleAWSAzureCloudDistributed SystemsDNSDockerGoGoogle Cloud PlatformGrafanaJenkinsKubernetesPrometheusPythonTerraform
About the role
- DevOps Engineer who thrives in scaling complex infrastructure, building resilient platforms, and driving automation across the software delivery lifecycle. This role is integral to ensuring our platform is secure, scalable, observable, and fast-moving to support both internal teams and customer-facing products.
- Design, implement, and maintain core platform infrastructure to support scale and high availability.
- Champion observability across systems to ensure proactive monitoring and rapid incident response. You can't improve what you don't measure.
- Automate build, deployment, and environment management to enable continuous delivery and infrastructure as code.
- Collaborate across engineering to drive reliability, performance, and platform best practices.
Requirements
- Bachelor's degree in Computer Science, Engineering, or equivalent practical experience (not required).
- 5+ years of experience in DevOps, Site Reliability, or Platform Engineering roles.
- Deep expertise in cloud environments (AWS, GCP, or Azure) and infrastructure as code (Terraform, Pulumi, etc).
- Proficiency in container orchestration (Kubernetes, Nomad), Docker, and service mesh technologies.
- Strong understanding of CI/CD systems (GitHub Actions, ArgoCD, Jenkins, CircleCI, etc.).
- Hands-on experience with monitoring and observability stacks (Prometheus, Grafana, ELK, Datadog).
- Familiarity with distributed systems, load balancing, and caching strategies.
- Strong scripting and automation skills in Bash, Python, or Go.
- Knowledge of networking, DNS, TLS, IAM, and secrets management.
- Excellent troubleshooting, time management, and communication skills.
- Passion for automation, performance, reliability, and developer experience.