
Senior Site Reliability Engineer, GitOps
Canonical
full-time
Posted on:
Location Type: Remote
Location: Anywhere in the World
Visit company websiteExplore more
Job Level
About the role
- Drive the development of automation, Gitops in your team as an embedded tech lead
- Closely collaborate with the IS architect to align your solutions with the IS architecture vision
- Design and architect services that IS can offer to the organization as products
- Apply your experience of IaC to develop infrastructure as code practice within IS by constantly increasing automation and improving IaC processes
- Automate software operations for re-usability and consistency across private and public clouds, taking into consideration the complexities of distributed systems
- Maintain operational responsibility for all of Canonical’s core services, networks, and infrastructure
- Develop skills in troubleshooting, capacity planning, and performance investigation, Setting up, maintaining and using observability tools such as Prometheus, Grafana, and Elasticsearch; design, implement and maintain monitoring and alerting for various systems and services
- Provide assistance and work with globally distributed engineering, operations, and support peers
- Be given uninterrupted development time to focus on larger projects and automation of manual tasks
- Share your experience, know-how and best practices with other team members in design sessions, mentorship and ‘doing work together’
- Carry final responsibility for time-critical escalations
Requirements
- A modern view on hosting architecture, driven by infrastructure as code across both private and public clouds.
- A product mindset thriving to develop products rather than solutions.
- Python software development experience, with large projects
- Experience working with Kubernetes or other container orchestration systems.
- Proven exposure to manage and deploy cloud infrastructure with code.
- Practical knowledge of Linux networking, routing, and firewalls
- Affinity with various forms of Linux storage, from Ceph to Databases
- Hands-on experience administering enterprise Linux servers
- Extensive knowledge of cloud computing concepts and technologies
- Bachelor's degree or greater, preferably in computer science or related engineering field
- Able to communicate clearly and effectively in English over email, chat, video or voice calls and in-person
- Motivated and able to troubleshoot from kernel to web, and willing to ask others when appropriate
- A willingness to be flexible and able to learn new things quickly
- Be inspired by the needs of fast-changing environments
- Happy to work within distributed teams
- Be passionate and familiarized about open-source, especially Ubuntu or Debian
Benefits
- Distributed work environment with twice-yearly team sprints in person
- Personal learning and development budget of USD 2,000 per year
- Annual compensation review
- Recognition rewards
- Annual holiday leave
- Maternity and paternity leave
- Team Member Assistance Program & Wellness Platform
- Opportunity to travel to new locations to meet colleagues
- Priority Pass and travel upgrades for long-haul company events
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
PythonKubernetesInfrastructure as Code (IaC)Linux networkingLinux routingLinux firewallsCloud infrastructure managementObservability toolsPrometheusGrafana
Soft Skills
CollaborationMentorshipCommunicationTroubleshootingFlexibilityMotivationAdaptabilityTeamworkProblem-solvingProduct mindset
Certifications
Bachelor's degree in computer scienceBachelor's degree in engineering