
Site Reliability Engineer, SRE
CAKE.com
full-time
Posted on:
Location Type: Remote
Location: United States
Visit company websiteExplore more
About the role
- Scale and secure our rapidly growing infrastructure
- Automate critical processes
- Ensure a seamless experience for new users
- Make sure the infrastructure keeps up with the growth
- Ensure system scalability and high traffic handling
- Define and deploy monitoring, alerting, and logging systems
- Respond to and resolve production incidents
- Conduct thorough post-mortems
- Monitor server logs for abnormalities
- Design, manage and maintain automation tools for operational processes
Requirements
- 5+ years of relevant work experience
- Working experience with AWS
- Docker
- Git
- CI/CD tools like Gitlab CI, Jenkins, etc.
- Experience with IaC tools like Terraform, CloudFormation, Ansible, Puppet, Packer
- Proficiency with Linux and other Unix-based systems
- Experience setting up build automation
- Excellent understanding of security and safety best practices
- Bachelor’s degree in Computer Science or equivalent work experience
- Excellent written and verbal English communication skills
- Ability to work with mixed US and EU based teams
Benefits
- No overtime
- No work on weekends
- No late working hours
- In-house learning programs
- Tech lectures
- Knowledge sharing
- Remote work with provided MacBook
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
AWSDockerGitCI/CDGitlab CIJenkinsTerraformCloudFormationAnsibleLinux
Soft Skills
communicationcollaborationproblem-solvingincident responsepost-mortem analysis
Certifications
Bachelor’s degree in Computer Science