
Site Reliability Engineer
Selector
full-time
Posted on:
Location Type: Remote
Location: Remote • 🇺🇸 United States
Visit company websiteJob Level
Mid-LevelSenior
Tech Stack
AnsibleChefKubernetesLinuxNode.jsPythonTerraform
About the role
- Design, implement, and maintain infrastructure for platform using infrastructure as code (IaC) tools like Terraform or Ansible.
- Automate software deployments and configuration management using tools like GitOps or Kubernetes.
- Configure and manage monitoring tools to proactively identify and troubleshoot performance issues.
- Implement incident response procedures to ensure rapid resolution of service disruptions.
- Collaborate with the development team to integrate CI/CD pipelines for faster and more reliable deployments.
- Stay up-to-date on the latest DevOps and AIOps trends and technologies.
- Participate in code reviews and contribute to the overall code quality of the platform.
Requirements
- Bachelor’s degree or higher in a relevant field
- Kubernetes operations understanding
- Experience with multi-node kubernetes deployment in EKS, GKE, AKS and RKE2
- Expertise in infrastructure automation tools like Terraform, Ansible, or Chef.
- Proficiency in scripting languages like Python, Bash, or PowerShell.
- In-depth knowledge of Linux operating systems.
- Excellent troubleshooting and problem-solving skills.
- Strong communication and collaboration skills.
- Ability to work independently and as part of a team.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
infrastructure as codeTerraformAnsibleGitOpsKubernetesCI/CD pipelinesscripting languagesPythonBashPowerShell
Soft skills
troubleshootingproblem-solvingcommunicationcollaborationindependenceteamwork