Drive incident response best practices, lead postmortems, and define SLAs/SLOs across platform services
Collaborate with product & app teams to implement reliability reviews
Oversee evolution of the observability stack
Build frameworks and tools that reduce toil and empower engineering teams with self-service capabilities
Advocate for reliability and operational excellence across product and engineering teams
Requirements
7+ years in SRE, platform, systems, or infrastructure engineering
Strong background in AWS and Kubernetes operations
Experience establishing SLOs/SLA frameworks and driving org-wide adoption
Strong track record leading incident response and postmortem culture
Experience with observability ecosystems
Programming/scripting skills in Python, Go, or similar (automation-first mindset)
Strong cross-functional leadership partnering with product, platform, and app teams
Benefits
Canary Days: As a company we want to ensure that the team has time to recharge. Each month we provide company wide days off to ensure there is at least one extended weekend or day off.
Self Improvement Club: We meet each month and share our personal goals for the month. Each individual is provided a budget towards any purchases that help us achieve these goals.
Professional Development Chats: We provide budget to help drive cross functional professional development conversations across the organization.
Travel Reimbursement: Team members are able to visit our offices across New York, San Francisco or Dallas when they choose, and are provided a travel stipend for doing so. Spend time working with the team in their office, and use the rest of your time exploring a new city!
Personal Travel Reimbursement: If you stay at a hotel that Canary works with, we provide a credit towards your stay.
Canary Technologies is an equal opportunity employer. We recruit, employ, train, compensate and promote talent regardless of race, religion, ethnicity, national origin, citizenship, gender, gender identity, sexual orientation, age, veteran status, disability, genetic information or any other protected characteristic.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.