Engineer II – Site Reliability

CrowdStrike

Engineer II focusing on Site Reliability for Temporal infrastructure within CrowdStrike's engineering team. Responsibilities include operations, automation, and performance tuning in a hybrid work environment.

Posted 6/23/2026full-timeBangalore • 🇮🇳 IndiaMid-LevelSeniorWebsite

Tech Stack

Tools & technologies

AnsibleAWSCloudDistributed SystemsGoGoogle Cloud PlatformKubernetesPostgresPythonTerraform

About the role

Key responsibilities & impact

Operate Temporal infrastructure in production - deploy updates, monitor cluster health, respond to alerts, and maintain availability across multiple environments using Helm, Kubernetes and FluxCD
Automate operational work - write scripts and workflows that make deployments, upgrades, scaling operations, and troubleshooting repeatable and safe; reduce manual toil over time
Support capacity planning and performance tuning - track resource utilization, identify bottlenecks, tune configuration for better performance and contribute to capacity forecasts under guidance
Build observability - instrument services with metrics and logs, improve dashboards, and refine alerting so the team can catch problems before they impact users
Contribute to on call rotation - participate in incident response, learn how to triage and escalate issues effectively, write runbooks that help the next person on-call
Learn GitOps workflows - work with FluxCD to manage infrastructure-as-code, submit pull requests for configuration changes, and understand how declarative deployment pipelines work
Troubleshoot operational issues - investigate deployment failures, connectivity problems, performance degradations, and work with teammates to determine root cause and preventive fixes
Partner with consuming teams - help internal engineers onboard to Temporal, answer questions, debug integration issues, and contribute to documentation that makes adoption easier
Grow your infrastructure skills - work with PostgreSQL, AWS/GCP, Kubernetes networking, Helm chart management, certificate rotation, secret management and distributed systems operations under mentorship

Requirements

What you’ll need

3+ years in DevOps, SRE, platform engineering or infrastructure roles - you've worked on production systems and understand the basics of running services reliably
Kubernetes fundamentals - you've deployed services to Kubernetes, understand pods/deployments/services, and can debug basic cluster issues; you don't need deep expertise but should be comfortable navigating kubectl and reviewing YAML
Helm experience - you've used Helm to deploy applications, understand charts and values files, and can troubleshoot failed releases
Some infrastructure-as-code experience - you've used tools like Terraform, Ansible, or GitOps workflows (FluxCD, ArgoCD) to manage infrastructure declaratively rather than clicking in consoles
Cloud platform exposure - you've worked with AWS or GCP in some capacity; you understand basic compute, networking, and storage primitives but don't need to be an expert
Scripting ability - you can write scripts (Bash, Python, Go) to automate repetitive tasks and build simple tooling
Basic understanding of stateful systems - you've worked with databases (PostgreSQL preferred) or other persistent services and understand backups, schema management, and connection handling at a foundational level
Willingness to learn and ask for help - you're comfortable saying "I don't know" and diving into unfamiliar territory with support from teammates.

Benefits

Comp & perks

Market leader in compensation and equity awards
Comprehensive physical and mental wellness programs
Competitive vacation and holidays for recharge
Paid parental and adoption leaves
Professional development opportunities for all employees regardless of level or role
Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections
Vibrant office culture with world class amenities
Great Place to Work Certified™ across the globe

ATS Keywords

✓ Tailor your resume

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools

KubernetesHelmFluxCDPostgreSQLAWSGCPBashPythonGoinfrastructure-as-code

Soft Skills

willingness to learnproblem-solvingcollaborationcommunicationincident responsetriagedocumentationcapacity planningperformance tuningdebugging