
Senior Site Reliability Engineer – Container
Alkymi
full-time
Posted on:
Location Type: Remote
Location: United States
Visit company websiteExplore more
Salary
💰 $115,000 - $130,500 per year
Job Level
About the role
- Participate in the architecture and implementation of scalable platform solutions which present long term solutions that meet business requirements
- Collaborate with software engineers, DevOps, Information Security, and other teams to integrate applications into the platform
- Create and share guidance for other teams on the proper implementation of infrastructure including technical guides with best practices
- Monitor system health, performance, and reliability, and implement proactive measures to prevent downtime
- Implement processes to ensure security vulnerabilities are remediated within SLA
- Investigate and troubleshoot complex system and performance issues, providing root cause analysis and solutions
- Identify opportunities for process and system improvements and contribute to ongoing performance and cost optimization efforts
- Stay informed about industry trends and best practices to continually improve the platform
- Participate in a on-call schedule
- Create system infrastructure and processes documentation
Requirements
- 4+ years experience in a DevOps / SRE / Platform Engineering role
- Direct experience of cloud platforms (e.g., AWS, Azure, GCP)
- Proficiency in scripting and automation using tools used by DevOps professionals such as Python, Bash, Powershell, or Java/.NET development
- Familiar with creating/modifying infrastructure-as-code (IAC)
- Strong experience with modern CI/CD tooling focused around rapid container deployment.
- Strong understanding of networking, load balancing, and security principles.
- Experience using automation tools. Build, provision, deploy, test and monitor
- Familiarity with creating physical and logical infrastructure diagrams
- Ability to communicate effectively both verbally and in written form. Adapts communication style to different audiences
- Effective presentation skills
- Ability to work cross functionally
- Provide mentorship to team members
- Work is done independently and reviewed at critical points
- Key stakeholder in projects of diverse scope from design to completion
- Enhances relationships with internal/external partners
- Ability to participate in on-call rotation as assigned
- Master’s degree in computer science or related field (preferred)
- Experience with containerization and orchestration technologies, such as Docker and Kubernetes (preferred)
- Understanding of regulatory standards or experience working in a PCI environment (preferred)
- Previous experience with Git or a similar source code management system (preferred)
- Experience with monitoring and observability tools (e.g., Prometheus, Grafana, ELK Stack) (preferred).
Benefits
- remote-first environment
- unlimited paid time off
- 401(k) with employer match
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
DevOpsSREPlatform Engineeringcloud platformsscriptingautomationinfrastructure-as-codeCI/CDnetworkingcontainerization
Soft Skills
effective communicationpresentation skillscross-functional collaborationmentorshipindependent workrelationship buildingproblem-solvingroot cause analysisprocess improvementadaptability
Certifications
Master's degree in computer science