Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
CrowdStrike

SRE/Dev Ops Engineer

CrowdStrike

SRE/Dev Ops Engineer managing production infrastructure across multiple clouds for cybersecurity leader CrowdStrike. Focused on automation, system reliability, and collaboration with engineering teams.

Posted 5/30/2026full-timeSunnyvale • California • 🇺🇸 United StatesSeniorLead💰 $120,000 - $180,000 per yearWebsite

Tech Stack

Tools & technologies
CloudDistributed SystemsFluxGrafanaJenkinsKubernetesNoSQLPrometheusTerraform

About the role

Key responsibilities & impact
  • Run production infrastructure - Deploy, upgrade, and maintain platform services across multiple clouds and regions on Kubernetes.
  • Build and maintain CI/CD pipelines - Make it safe and fast to ship infrastructure changes using GitOps workflows and release automation.
  • Build control planes - Create the APIs and tooling that make provisioning and scaling repeatable and self-service.
  • Own capacity planning - Track usage, forecast growth, right-size clusters, and keep infrastructure costs in check.
  • Build observability - Set up metrics, dashboards, and alerts using Prometheus and Grafana.
  • Write runbooks that make on-call clear and actionable.
  • Own on-call and incidents - Join the on-call rotation, resolve issues, write postmortems, and turn repeat problems into automation.
  • Automate everything - Deployments, upgrades, certificate rotations, failover. If you do it by hand more than once, automate it.
  • Driving system reliability by blending software engineering principles with AI-driven automation, moving from reactive firefighting to proactive, automated operations.
  • Harden security - set up auth, encryption, secret rotation, and network policies.
  • Keep dependencies patched and CVEs resolved.
  • Own disaster recovery - Build backup strategies, test failover, and make sure platforms can survive infrastructure failures.
  • Enable other teams - Provide templates, patterns, and direct support to help engineering teams use platforms reliably.
  • Collaborate across teams - Collaborate with Infrastructure, SRE, and Data Services on shared operational problems.

Requirements

What you’ll need
  • 8+ years in DevOps, SRE, or platform engineering.
  • Hands-on experience running stateful distributed systems on Kubernetes in production.
  • CI/CD experience - Building and owning pipelines using GitHub Actions, Jenkins, Tekton, or similar tools.
  • Infrastructure-as-code skills - Terraform, Pulumi, or Crossplane, no manual configuration.
  • GitOps experience - ArgoCD or Flux for managing infrastructure deployments.
  • Observability skills - Prometheus, Grafana, and distributed tracing tools like Jaeger or OpenTelemetry.
  • Database operations - Backup, restore, schema management, and performance tuning for relational and NoSQL databases.
  • Security mindset - You implement auth, encryption, secret management, and network policies as part of normal work.
  • Multi-cloud or multi-region experience - you have managed infrastructure across providers or regions.

Benefits

Comp & perks
  • Market leader in compensation and equity awards
  • Comprehensive physical and mental wellness programs
  • Competitive vacation and holidays for recharge
  • Paid parental and adoption leaves
  • Professional development opportunities for all employees regardless of level or role
  • Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections
  • Vibrant office culture with world class amenities
  • Great Place to Work Certified™ across the globe

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
KubernetesCI/CDGitOpsTerraformPulumiCrossplanePrometheusGrafanadatabase operationssecurity
Soft Skills
collaborationcapacity planningautomationincident managementproblem-solvingcommunicationsupportproactive operationsteam enablementleadership