Canonical

Senior Site Reliability Engineer, GitOps

Canonical

full-time

Posted on:

Location Type: Remote

Location: Anywhere in the World

Visit company website

Explore more

AI Apply
Apply

Job Level

About the role

  • Drive the development of automation, Gitops in your team as an embedded tech lead
  • Closely collaborate with the IS architect to align your solutions with the IS architecture vision
  • Design and architect services that IS can offer to the organization as products
  • Apply your experience of IaC to develop infrastructure as code practice within IS by constantly increasing automation and improving IaC processes
  • Automate software operations for re-usability and consistency across private and public clouds, taking into consideration the complexities of distributed systems
  • Maintain operational responsibility for all of Canonical’s core services, networks, and infrastructure
  • Develop skills in troubleshooting, capacity planning, and performance investigation, Setting up, maintaining and using observability tools such as Prometheus, Grafana, and Elasticsearch; design, implement and maintain monitoring and alerting for various systems and services
  • Provide assistance and work with globally distributed engineering, operations, and support peers
  • Be given uninterrupted development time to focus on larger projects and automation of manual tasks
  • Share your experience, know-how and best practices with other team members in design sessions, mentorship and ‘doing work together’
  • Carry final responsibility for time-critical escalations

Requirements

  • A modern view on hosting architecture, driven by infrastructure as code across both private and public clouds.
  • A product mindset thriving to develop products rather than solutions.
  • Python software development experience, with large projects
  • Experience working with Kubernetes or other container orchestration systems.
  • Proven exposure to manage and deploy cloud infrastructure with code.
  • Practical knowledge of Linux networking, routing, and firewalls
  • Affinity with various forms of Linux storage, from Ceph to Databases
  • Hands-on experience administering enterprise Linux servers
  • Extensive knowledge of cloud computing concepts and technologies
  • Bachelor's degree or greater, preferably in computer science or related engineering field
  • Able to communicate clearly and effectively in English over email, chat, video or voice calls and in-person
  • Motivated and able to troubleshoot from kernel to web, and willing to ask others when appropriate
  • A willingness to be flexible and able to learn new things quickly
  • Be inspired by the needs of fast-changing environments
  • Happy to work within distributed teams
  • Be passionate and familiarized about open-source, especially Ubuntu or Debian
Benefits
  • Distributed work environment with twice-yearly team sprints in person
  • Personal learning and development budget of USD 2,000 per year
  • Annual compensation review
  • Recognition rewards
  • Annual holiday leave
  • Maternity and paternity leave
  • Team Member Assistance Program & Wellness Platform
  • Opportunity to travel to new locations to meet colleagues
  • Priority Pass and travel upgrades for long-haul company events
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
PythonKubernetesInfrastructure as Code (IaC)Linux networkingLinux routingLinux firewallsCloud infrastructure managementObservability toolsPrometheusGrafana
Soft Skills
CollaborationMentorshipCommunicationTroubleshootingFlexibilityMotivationAdaptabilityTeamworkProblem-solvingProduct mindset
Certifications
Bachelor's degree in computer scienceBachelor's degree in engineering