RBC

Senior Site Reliability Engineer

RBC

full-time

Posted on:

Location Type: Office

Location: MontrealCanada

Visit company website

Explore more

AI Apply
Apply

Job Level

About the role

  • Participate in code and non-functional (performance, security, maintainability, compliance, change management) reviews of all production-bound SRE solutions
  • Ensure problems are quickly identified and solved through review of Zeke / Splunk / Dynatrace / Salesforce monitoring, inbound calls, email or ServiceNow tickets while providing the highest possible level of production support
  • Drive transformation by continuously looking for ways to automate existing processes
  • Track, audit, monitor, and implement technical work streams
  • Act as portfolio SME (Subject Matter Expert) – understand & document common components, core functionalities, and infrastructure of supported applications
  • Be an escalation point in the on-call rotation, and support our maintenance, scheduled work, support and release deployment requirements
  • Drive in incident management and problem management for applications in scope and RCA Action items fulfillment/ownership
  • Focus on Continuous improvement and technical standards – Drive improvements in productivity, monitoring, tooling, and best practices
  • Manage technology currency (server patching, certificate renewal, compliance, etc.) with a keen eye on automating opportunities

Requirements

  • 5+ years of working experience in Site Reliability Engineering (SRE) and best practices for running and maintaining critical systems, including monitoring, alerting, and incident management
  • Intermediate experience in a variety of environments (Cloud, Linux/Unix/Windows and services/APIs, databases
  • Working experience with scripting ideally in Java/.NET and SQL
  • Strong expertise in major incident handling and communication
  • Issue investigation skills
  • Effective negotiation skills, stakeholder management
  • Ability to influence the squad at an SRE level
  • Hands-on experience in a variety of SRE languages and tools (Ansible, Dynatrace Managed, Moog, PagerDuty, ServiceNow, GitHub, Slack, Elastic, Logstash, Kibana, Blue Prism, Catch Point)
  • Ability to work in a 7x24x365 work environment
Benefits
  • A comprehensive Total Rewards Program including bonuses and flexible benefits
  • Competitive compensation
  • Commissions and stock where applicable
  • Leaders who support your development through coaching and managing opportunities
  • Flexible work/life balance options
  • Opportunities to do challenging work
  • Opportunities to take on progressively greater accountabilities
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
Site Reliability Engineeringmonitoringalertingincident managementscriptingJava.NETSQLissue investigationautomation
Soft Skills
communicationnegotiationstakeholder managementinfluenceproblem management