Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
CrowdStrike

Engineer II – Site Reliability

CrowdStrike

Engineer II focusing on Site Reliability for Temporal infrastructure within CrowdStrike's engineering team. Responsibilities include operations, automation, and performance tuning in a hybrid work environment.

Posted 6/23/2026full-timeBangalore • 🇮🇳 IndiaMid-LevelSeniorWebsite

Tech Stack

Tools & technologies
AnsibleAWSCloudDistributed SystemsGoGoogle Cloud PlatformKubernetesPostgresPythonTerraform

About the role

Key responsibilities & impact
  • Operate Temporal infrastructure in production - deploy updates, monitor cluster health, respond to alerts, and maintain availability across multiple environments using Helm, Kubernetes and FluxCD
  • Automate operational work - write scripts and workflows that make deployments, upgrades, scaling operations, and troubleshooting repeatable and safe; reduce manual toil over time
  • Support capacity planning and performance tuning - track resource utilization, identify bottlenecks, tune configuration for better performance and contribute to capacity forecasts under guidance
  • Build observability - instrument services with metrics and logs, improve dashboards, and refine alerting so the team can catch problems before they impact users
  • Contribute to on call rotation - participate in incident response, learn how to triage and escalate issues effectively, write runbooks that help the next person on-call
  • Learn GitOps workflows - work with FluxCD to manage infrastructure-as-code, submit pull requests for configuration changes, and understand how declarative deployment pipelines work
  • Troubleshoot operational issues - investigate deployment failures, connectivity problems, performance degradations, and work with teammates to determine root cause and preventive fixes
  • Partner with consuming teams - help internal engineers onboard to Temporal, answer questions, debug integration issues, and contribute to documentation that makes adoption easier
  • Grow your infrastructure skills - work with PostgreSQL, AWS/GCP, Kubernetes networking, Helm chart management, certificate rotation, secret management and distributed systems operations under mentorship

Requirements

What you’ll need
  • 3+ years in DevOps, SRE, platform engineering or infrastructure roles - you've worked on production systems and understand the basics of running services reliably
  • Kubernetes fundamentals - you've deployed services to Kubernetes, understand pods/deployments/services, and can debug basic cluster issues; you don't need deep expertise but should be comfortable navigating kubectl and reviewing YAML
  • Helm experience - you've used Helm to deploy applications, understand charts and values files, and can troubleshoot failed releases
  • Some infrastructure-as-code experience - you've used tools like Terraform, Ansible, or GitOps workflows (FluxCD, ArgoCD) to manage infrastructure declaratively rather than clicking in consoles
  • Cloud platform exposure - you've worked with AWS or GCP in some capacity; you understand basic compute, networking, and storage primitives but don't need to be an expert
  • Scripting ability - you can write scripts (Bash, Python, Go) to automate repetitive tasks and build simple tooling
  • Basic understanding of stateful systems - you've worked with databases (PostgreSQL preferred) or other persistent services and understand backups, schema management, and connection handling at a foundational level
  • Willingness to learn and ask for help - you're comfortable saying "I don't know" and diving into unfamiliar territory with support from teammates.

Benefits

Comp & perks
  • Market leader in compensation and equity awards
  • Comprehensive physical and mental wellness programs
  • Competitive vacation and holidays for recharge
  • Paid parental and adoption leaves
  • Professional development opportunities for all employees regardless of level or role
  • Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections
  • Vibrant office culture with world class amenities
  • Great Place to Work Certified™ across the globe

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
KubernetesHelmFluxCDPostgreSQLAWSGCPBashPythonGoinfrastructure-as-code
Soft Skills
willingness to learnproblem-solvingcollaborationcommunicationincident responsetriagedocumentationcapacity planningperformance tuningdebugging