FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.
Tech Stack
Tools & technologiesAzureCloudDistributed SystemsKubernetesTerraform
About the role
Key responsibilities & impact- Build and operate reliable, scalable services in Kubernetes-based environments.
- Improve service availability, performance, and operational readiness.
- Design and implement automation for deployments, scaling, and recovery.
- Reduce operational toil through scripting, tooling, and process improvements.
- Implement and maintain observability for logs, metrics, and traces.
- Participate in on-call rotations and support incident response and resolution.
- Contribute to post-incident reviews and follow-up reliability improvements.
- Share knowledge and provide informal guidance to teammates.
Requirements
What you’ll need- 5+ years of experience in SRE with hands-on experience operating production systems in Azure cloud environments.
- Practical experience with Kubernetes and containerized workloads.
- Experience implementing observability (monitoring, logging, alerting) for distributed systems.
- Experience with automation and Infrastructure as Code (e.g., Terraform or similar).
- Familiarity with CI/CD pipelines and release practices.
- Ability to troubleshoot incidents and communicate clearly during outages.
- Experience collaborating with cross-functional engineering teams.
- Relevant professional experience gained through work, projects, or equivalent hands-on learning.
Benefits
Comp & perks- Country specific benefits
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
KubernetesAzureobservabilitymonitoringloggingalertingInfrastructure as CodeTerraformCI/CDscripting
Soft Skills
communicationcollaborationtroubleshootingguidanceincident responseproblem-solvingprocess improvementteamworkreliability improvementavailability
