FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.
Tech Stack
Tools & technologiesAWSCloudGoGoogle Cloud PlatformPythonSDLC
About the role
Key responsibilities & impact- Design and build automated reliability and self-healing systems that protect production at scale.
- Own and improve incident management tooling and on-call health.
- Develop and evolve our observability infrastructure, including monitoring and alerting.
- Contribute to AI-driven operational tooling that goes beyond triage.
- Drive incident prevention by identifying systemic patterns and eliminating operational toil.
- Partner directly with product engineering teams to diagnose reliability gaps.
- Define and champion operational excellence best practices across engineering.
- Champion and embed Samsara’s cultural principles as we scale globally.
Requirements
What you’ll need- 8+ years of experience designing and building products in a software engineering team.
- Bachelor's Degree in Computer Science/Engineering or equivalent practical experience.
- 3+ years of experience in infrastructure and/or platform engineering-focused teams.
- Expertise in Observability and reliability, operational metrics, and data analysis.
- Proven track record in architecting monitoring frameworks, SLO platforms, and automated response workflows.
- Proven experience working on large-scale enterprise software applications.
- Experience in Developer Experience (DevEx) & Internal Portals.
- Familiarity with cloud platforms (AWS, GCP, or the like).
- Experience in implementing AI-driven automation across the software development lifecycle (SDLC).
- Familiarity with writing high-quality code (Go, Python, or equivalent) focused on infrastructure, deployment, and operations challenges.
- Experience mentoring and supporting engineers in a technical lead capacity.
- Proactive growth mindset, always looking at ways to improve the status quo.
Benefits
Comp & perks- Comprehensive health and parental leave plans
- Professional development stipend
- Flexible working model that caters to diverse needs
- Initial RSU grant with no vesting cliff
- Ongoing refresh opportunities tied to performance
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
observabilityreliabilityoperational metricsdata analysismonitoring frameworksSLO platformsautomated response workflowshigh-quality codeAI-driven automationinfrastructure
Soft Skills
proactive growth mindsetmentoringsupporting engineerschampioning operational excellencediagnosing reliability gaps
Certifications
Bachelor's Degree in Computer ScienceBachelor's Degree in Engineering
