Kami

Senior DevOps Engineer

Kami

full-time

Posted on:

Location Type: Hybrid

Location: AucklandNew Zealand

Visit company website

Explore more

AI Apply
Apply

Job Level

About the role

  • Analyze and optimize system reliability, performance, and resource utilization of cloud infrastructure
  • Develop and maintain automation scripts for deployment, monitoring, and maintenance tasks.
  • Implement infrastructure as code (IaC) to automate the provisioning and configuration of infrastructure components.
  • Design and implement monitoring solutions to proactively identify and address issues.
  • Participate in on-call rotations and respond to incidents to ensure system stability and performance.
  • Conduct capacity planning to anticipate future resource needs and optimize infrastructure scalability.
  • Define and track reliability metrics to measure and improve system performance.
  • Prepare and present reports on system reliability and performance.
  • Work closely with software development teams to influence and improve the reliability and scalability of applications.
  • Conduct post-incident reviews to identify root causes and implement preventive measures.
  • Troubleshoot complex issues in a production environment.

Requirements

  • 7+ years of experience in a DevOps, SRE or similar role
  • Bachelor's degree in Computer Science, Information Technology, or a related field.
  • Relevant experience in software engineering, systems administration, or a related field.
  • Proficiency in programming languages (e.g. Python, Go, Ruby)
  • Strong scripting skills for automation tasks (e.g. Bash, Python)
  • Hands-on experience and in-depth knowledge of cloud platforms (e.g. Google Cloud, AWS) and container orchestration tools (e.g. Kubernetes)
  • A proficient understanding of core networking concepts (e.g. TCP/IP, DNS, load balancing)
  • Familiarity with Infrastructure as Code (IaC) tools (e.g. Terraform) and/or configuration management tools (e.g. Ansible, Puppet, Chef)
  • Experience with infrastructure monitoring, logging and alerting tools (e.g. Datadog, Prometheus, Grafana, PagerDuty), and log analysis
  • Strong collaboration and communication skills to work effectively with cross-functional teams
  • Ability to analyze complex systems and troubleshoot issues effectively.
Benefits
  • A people-first employer that is on an inspiring mission to build the future of education while changing the lives of millions
  • Continuous learning and development opportunities, including subsidised course fees, certifications, conferences, and free access to Udemy and more

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
DevOpsSREPythonGoRubyBashTerraformAnsibleKubernetesTCP/IP
Soft skills
collaborationcommunicationtroubleshootingcapacity planningincident responsesystem analysisreport preparationperformance optimizationroot cause analysisproactive problem solving
Certifications
Bachelor's degree in Computer ScienceBachelor's degree in Information Technology