
Senior DevOps Engineer
Evidation
full-time
Posted on:
Location Type: Remote
Location: Remote • California • 🇺🇸 United States
Visit company websiteSalary
💰 $160,000 - $200,000 per year
Job Level
Senior
Tech Stack
AWSCloudDistributed SystemsDockerGoKubernetesPythonRubyTerraform
About the role
- Design, build, and maintain highly available, scalable infrastructure on AWS using Infrastructure as code.
- Design and operate multi-tenant Kubernetes environments running on EKS, including cluster operations, workload management, autoscaling, and cost-optimized configurations.
- Drive Infrastructure-as-Code (IaC) best practices using Terraform and Pulumi, including modularization, testing, versioning, and safe deployment patterns.
- Contribute to CI/CD ecosystem using GitHub Actions, reusable workflows, and secure secrets management; ensure fast, resilient, and traceable deployment pipelines.
- Build and maintain containerization based software delivery pipeline leveraging Docker, Helm charts, and Github workflows.
- Define and continuously improve monitoring, alerting, dashboards, and logging using Datadog.
- Evaluate operational data to identify performance, stability, and cost-efficiency opportunities.
- Provide advanced support for major incidents, performing root cause analysis, writing clear postmortems, and ensuring long-term corrective actions.
- Apply a security-first mindset to infrastructure architecture, IAM, network boundaries, and workload configurations.
- Implement work in alignment to controls in support of ISO 27001, SOC 2, HIPAA, and other regulated requirements.
- Collaborate with Security to operationalize secure-by-default infrastructure patterns.
- Collaborate with Engineering, Data, and Delivery teams to define requirements, translate technical needs, and deliver scalable solutions.
- Facilitate knowledge sharing through documentation, playbooks, incident reviews, and architectural discussions.
- Identify opportunities to add value beyond immediate requests—improving reliability, simplifying processes, and reducing operational load.
Requirements
- 8+ years of DevOps, SRE, Platform Engineering, or relevant experience supporting production cloud systems.
- Expert-level experience with AWS services.
- Expert-level experience managing Kubernetes environments, including Helm, KEDA, cluster lifecycle, and multi-environment deployments.
- Advanced CI/CD experience using GitHub Actions (workflows, reusable workflows, OIDC auth, environments) or similar technology.
- Expert-level containerization skills (Docker, image optimization, registry management).
- Strong proficiency with Terraform and Pulumi for Infrastructure as Code.
- Hands-on experience with AI-assisted development tools (VSCode, GitHub Copilot, code generation workflows).
- Strong proficiency with scripting and coding automation tools.
- Experience in more than one of: Bash, Python, Ruby, or Go.
- Experience building reliable, observable systems using Datadog (metrics, logs, traces, monitors) or similar solution.
- Strong understanding of distributed systems, networking, autoscaling, and operational patterns in cloud-native architectures.
- Strong debugging, problem-solving, and incident response skills across complex, multi-service systems.
Benefits
- salary + bonus + equity + benefits
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
AWSKubernetesTerraformPulumiGitHub ActionsDockerHelmDatadogBashPython
Soft skills
problem-solvingincident responsecollaborationknowledge sharingcommunication
Certifications
ISO 27001SOC 2HIPAA