
Explore more
About the role
- Design, develop, and optimize key components of a Kubernetes-powered observability platform used by development teams in a constantly evolving cloud environment.
- Apply high-quality software development principles to enable continuous deployment of improvements to critical, high-traffic services, including telemetry ingestion, alerting, and observability tooling.
- Collaborate on the design of solutions that meet internal customer needs within an agile team that values consensus, collective code delivery, continuous improvement, and contributions to Open Source software.
- Participate in support and on-call activities, maintaining high standards of quality and reliability to ensure platform performance and minimize operational load.
- Contribute to integrating artificial intelligence into the platform and engineering practices, including developing AI agents to accelerate investigations, troubleshooting, and operational efficiency.
Requirements
- Bachelor’s degree in mathematics, statistics, computer science, or another relevant field
- 3+ years of experience in a similar role within a software development team
- Experience with public clouds (AWS/EKS, OCI/OKE)
- Experience with Kubernetes and containers (Docker)
- Hands-on experience with Infrastructure as Code concepts (Terraform, Argo, Helm)
- Experience in building a conversational AI agent (Pydantic)
Benefits
- Comprehensive health benefits, life and disability insurance, and fertility and family-building support programs
- Generous paid time off and vacation, paid volunteer time, quarterly personal wellness days, and meeting-free days
- Tuition and book reimbursement programs to support your continuous learning and professional development
- Thrive Global wellness program, confidential Employee Assistance Program (EAP), and individual wellness coaching
- Employee programs — including Employee Resource Groups (ERGs), the "GoTo Gives Back" volunteering program, and our charitable donation matching program — to expand your social network and amplify the impact of your efforts
- A Registered Retirement Savings Plan (RRSP) to help you plan for the future
- Gym reimbursement programs to support your physical well-being
- Access to telemedicine services for convenient healthcare
- GoTo performance bonus program to celebrate your engagement and contributions
- A monthly remote work allowance to cover home office expenses
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
KubernetesDockerTerraformArgoHelmAI agentstelemetry ingestionalertingobservability toolingInfrastructure as Code
Soft Skills
collaborationconsensuscontinuous improvementcollective code deliverysupporton-call activitiesquality assurancereliabilityproblem-solvingcommunication