
Senior Site Reliability Engineer
RBC
full-time
Posted on:
Location Type: Office
Location: Montreal • Canada
Visit company websiteExplore more
Job Level
About the role
- Participate in code and non-functional (performance, security, maintainability, compliance, change management) reviews of all production-bound SRE solutions
- Ensure problems are quickly identified and solved through review of Zeke / Splunk / Dynatrace / Salesforce monitoring, inbound calls, email or ServiceNow tickets while providing the highest possible level of production support
- Drive transformation by continuously looking for ways to automate existing processes
- Track, audit, monitor, and implement technical work streams
- Act as portfolio SME (Subject Matter Expert) – understand & document common components, core functionalities, and infrastructure of supported applications
- Be an escalation point in the on-call rotation, and support our maintenance, scheduled work, support and release deployment requirements
- Drive in incident management and problem management for applications in scope and RCA Action items fulfillment/ownership
- Focus on Continuous improvement and technical standards – Drive improvements in productivity, monitoring, tooling, and best practices
- Manage technology currency (server patching, certificate renewal, compliance, etc.) with a keen eye on automating opportunities
Requirements
- 5+ years of working experience in Site Reliability Engineering (SRE) and best practices for running and maintaining critical systems, including monitoring, alerting, and incident management
- Intermediate experience in a variety of environments (Cloud, Linux/Unix/Windows and services/APIs, databases
- Working experience with scripting ideally in Java/.NET and SQL
- Strong expertise in major incident handling and communication
- Issue investigation skills
- Effective negotiation skills, stakeholder management
- Ability to influence the squad at an SRE level
- Hands-on experience in a variety of SRE languages and tools (Ansible, Dynatrace Managed, Moog, PagerDuty, ServiceNow, GitHub, Slack, Elastic, Logstash, Kibana, Blue Prism, Catch Point)
- Ability to work in a 7x24x365 work environment
Benefits
- A comprehensive Total Rewards Program including bonuses and flexible benefits
- Competitive compensation
- Commissions and stock where applicable
- Leaders who support your development through coaching and managing opportunities
- Flexible work/life balance options
- Opportunities to do challenging work
- Opportunities to take on progressively greater accountabilities
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Site Reliability Engineeringmonitoringalertingincident managementscriptingJava.NETSQLissue investigationautomation
Soft Skills
communicationnegotiationstakeholder managementinfluenceproblem management