
Senior Site Reliability Engineer, SRE
dotCMS
full-time
Posted on:
Location Type: Remote
Location: Canada
Visit company websiteExplore more
Job Level
About the role
- Build the "Golden Path": You will own and evolve the build pipelines, dev setups, and development containers that allow our Stream Aligned teams to ship code independently and safely.
- Institute Observability (O11y): You will own the strategy and tooling for Alerting, Monitoring, and Tracing, empowering developers to see inside their own applications.
- Drive Reliability via SLOs: You will help Stream Aligned teams define and implement Service Level Objectives (SLOs) to back their code pipelines and observability tools.
- Enable, Don't Gatekeep: You will act as a consultant and mentor ("on loan" to teams when necessary) to help them tackle complex infrastructure challenges while ensuring final decision-making and ownership remains with the Stream Aligned team.
- Future-Proofing: You will help explore and implement new capabilities, including our AI toolchain adoption and modernization efforts.
- Incident Management: Participate in an on-call rotation with a focus on blameless post-mortems and systematically removing the root causes of fatigue.
Requirements
- +5 years of total experience with at least 3+ years of experience in one of the following roles: SRE, DevOps, or Platform Engineering roles.
- Previous experience in Engineering roles
- Proven track record of: At least 3 YOE with Kubernetes, AWS, Linux, Terraform, and PostgreSQL.
- Experience with Java applications is highly preferred.
- A deep understanding of CI/CD, Infrastructure as Code, and Observability stacks.
- Experience using and contributing to open source projects, and ideally, you are passionate about the open source ecosystem and proud to be a part of it.
- Excellent written and verbal English communication skills.
- You can explain complex infrastructure concepts to application developers clearly.
- You believe in the DevOps philosophy of "you build it, you run it."
- You are a teacher at heart who prefers enabling others over doing it for them.
- You identify patterns and requirements to build 1:many solutions.
Benefits
- Open PTO policy (after the first 90 days in the company)
- Generous number of local company-paid holidays
- Annual company-paid training and development
- Flexible work hours
- Remote work options
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
KubernetesAWSLinuxTerraformPostgreSQLJavaCI/CDInfrastructure as CodeObservabilityService Level Objectives
Soft Skills
communicationmentoringconsultingproblem-solvingteachingpattern recognitioncollaborationenabling othersexplaining complex conceptsblameless post-mortems