
Senior Site Reliability Engineer
Euna Solutions
full-time
Posted on:
Location Type: Hybrid
Location: Oakville • Canada
Visit company websiteExplore more
Salary
💰 CA$120,000 - CA$150,000 per year
Job Level
Tech Stack
About the role
- Design & implement highly available, scalable, and fault-tolerant systems with a programming-driven approach to problem-solving.
- Partner closely with software developers, applying your multi-language programming skills (e.g., Python, Go, Java, or others) to build tools, services, and automation that improve reliability.
- Drive adoption of Infrastructure as Code (IaC) using Terraform and other technologies, ensuring repeatable, version-controlled deployments.
- Design, build, and maintain CI/CD pipelines — integrating automated testing, linting, and deployment strategies informed by software development best practices.
- Implement and manage observability solutions (monitoring, logging, tracing) that provide actionable insights into application performance and infrastructure health.
- Participate in code reviews for infrastructure-related services, promoting high-quality, maintainable, and secure code.
- Mentor junior engineers on both SRE principles and coding standards across languages.
- Participate in incident response activities, perform root cause analysis, and implement long-term preventative measures — often via code-driven solutions.
- Evaluate and integrate new tools, frameworks, and programming techniques to improve operational efficiency and team productivity.
- Contribute to the technical direction of the SRE team, shaping priorities with a developer’s mindset.
Requirements
- Bachelor’s degree in Computer Science, Software Engineering, or equivalent practical experience.
- 6+ years of combined experience in SRE, DevOps, or software engineering roles.
- Proven expertise in designing and supporting distributed systems at scale.
- Solid professional experience in multiple programming languages (e.g., Python, Go, Java, C#, or JavaScript/TypeScript) with strong debugging and code optimization skills.
- Hands-on experience with IaC tools — especially Terraform.
- Extensive CI/CD pipeline design & management experience.
- Familiarity with observability platforms (Prometheus, Coralogix, Datadog, etc.).
- Strong understanding of cloud platforms (AWS, Azure) and containerization (Docker, Kubernetes).
- Ability to troubleshoot complex issues across the full stack — from code to infrastructure.
- Excellent communication and collaboration skills with both technical and non-technical stakeholders.
- Passion for automation, operational excellence, and applying software engineering discipline to SRE work.
Benefits
- Competitive wages
- Wellness days
- Community Engagement Committee
- Flexible workday
- Benefits
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
PythonGoJavaC#JavaScriptTypeScriptInfrastructure as CodeTerraformCI/CDobservability
Soft Skills
communicationcollaborationmentoringproblem-solvingtroubleshootingroot cause analysisoperational excellenceautomationteam productivitydeveloper mindset
Certifications
Bachelor’s degree in Computer ScienceBachelor’s degree in Software Engineering