FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Engineer II – Site Reliability
CrowdStrikeEngineer II focusing on Site Reliability for Temporal infrastructure within CrowdStrike's engineering team. Responsibilities include operations, automation, and performance tuning in a hybrid work environment.
Tech Stack
Tools & technologiesAnsibleAWSCloudDistributed SystemsGoGoogle Cloud PlatformKubernetesPostgresPythonTerraform
About the role
Key responsibilities & impact- Operate Temporal infrastructure in production - deploy updates, monitor cluster health, respond to alerts, and maintain availability across multiple environments using Helm, Kubernetes and FluxCD
- Automate operational work - write scripts and workflows that make deployments, upgrades, scaling operations, and troubleshooting repeatable and safe; reduce manual toil over time
- Support capacity planning and performance tuning - track resource utilization, identify bottlenecks, tune configuration for better performance and contribute to capacity forecasts under guidance
- Build observability - instrument services with metrics and logs, improve dashboards, and refine alerting so the team can catch problems before they impact users
- Contribute to on call rotation - participate in incident response, learn how to triage and escalate issues effectively, write runbooks that help the next person on-call
- Learn GitOps workflows - work with FluxCD to manage infrastructure-as-code, submit pull requests for configuration changes, and understand how declarative deployment pipelines work
- Troubleshoot operational issues - investigate deployment failures, connectivity problems, performance degradations, and work with teammates to determine root cause and preventive fixes
- Partner with consuming teams - help internal engineers onboard to Temporal, answer questions, debug integration issues, and contribute to documentation that makes adoption easier
- Grow your infrastructure skills - work with PostgreSQL, AWS/GCP, Kubernetes networking, Helm chart management, certificate rotation, secret management and distributed systems operations under mentorship
Requirements
What you’ll need- 3+ years in DevOps, SRE, platform engineering or infrastructure roles - you've worked on production systems and understand the basics of running services reliably
- Kubernetes fundamentals - you've deployed services to Kubernetes, understand pods/deployments/services, and can debug basic cluster issues; you don't need deep expertise but should be comfortable navigating kubectl and reviewing YAML
- Helm experience - you've used Helm to deploy applications, understand charts and values files, and can troubleshoot failed releases
- Some infrastructure-as-code experience - you've used tools like Terraform, Ansible, or GitOps workflows (FluxCD, ArgoCD) to manage infrastructure declaratively rather than clicking in consoles
- Cloud platform exposure - you've worked with AWS or GCP in some capacity; you understand basic compute, networking, and storage primitives but don't need to be an expert
- Scripting ability - you can write scripts (Bash, Python, Go) to automate repetitive tasks and build simple tooling
- Basic understanding of stateful systems - you've worked with databases (PostgreSQL preferred) or other persistent services and understand backups, schema management, and connection handling at a foundational level
- Willingness to learn and ask for help - you're comfortable saying "I don't know" and diving into unfamiliar territory with support from teammates.
Benefits
Comp & perks- Market leader in compensation and equity awards
- Comprehensive physical and mental wellness programs
- Competitive vacation and holidays for recharge
- Paid parental and adoption leaves
- Professional development opportunities for all employees regardless of level or role
- Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections
- Vibrant office culture with world class amenities
- Great Place to Work Certified™ across the globe
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
KubernetesHelmFluxCDPostgreSQLAWSGCPBashPythonGoinfrastructure-as-code
Soft Skills
willingness to learnproblem-solvingcollaborationcommunicationincident responsetriagedocumentationcapacity planningperformance tuningdebugging