
Staff Site Reliability Engineer, Cloud
Kentik
full-time
Posted on:
Location Type: Remote
Location: United States
Visit company websiteExplore more
Salary
💰 $165,000 - $200,000 per year
Job Level
Tech Stack
About the role
- Make sure our real-time, scalable, infrastructure is set up for growth and working efficiently. Our infrastructure runs on our own hardware, across multiple locations as well as all major cloud vendors
- Work on tools and processes to better monitor our platform as well as ensuring its stability through our rapid growth
- Deep-diving into diverse topics, from firewalls and IP routing, to database replication strategies or automating build processes
- Collaborate with engineering and infrastructure teams on finding solutions from an operational perspective
- Assist with expanding our cloud deployments across the major cloud providers
- Contribute code, code reviews and tools or patches to all kinds of existing code
- Write design documents or collaborate on colleagues’ docs to introduce new features or changes into our infrastructure
- Provide valuable feedback on team goals, projects, and processes. We believe in continuously improving our team
Requirements
- 8+ years of experience in cloud-based Systems Administration, IT and/or SRE related projects
- Expertise in public cloud environments such as AWS, GCP, Azure, or OCI.
- Strong command of containerization and orchestration using Docker and Kubernetes.
- Solid programming and automation skills using Bash, Python, or Go.
- Proficiency with Infrastructure as Code (IaC) and configuration management platforms such as Terraform, Ansible, and Puppet.
- Proficiency in Linux administration and command-line tools (e.g., SSH, grep, awk).
- Detailed understanding of major internet protocols (TCP/IP, DNS, HTTP, TLS)
- Networking administration experience: concepts such as routing, firewalls (iptables), peering sound familiar
- A passion for documenting code, processes, and infrastructure in runbooks and wikis
- Worked with metrics monitoring solutions such as grafana, prometheus, telegraf, and OpenTelemetry
- Experience creating and managing tickets with third party vendors and owning cloud vendor partner relationships.
Benefits
- 100% of premiums are paid by company for health, vision and dental coverage for you and your dependents
- Additionally, an annual Health Reimbursement Account (HRA) of $3,000 for an individual or $4,500 for a family
- Paid family & medical leave
- Open PTO, a quarterly Wellness Day, and a minimum of 10 paid holidays
- 401(k) retirement account
- Home office reimbursement
- Stock options
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
cloud-based Systems AdministrationSREAWSGCPAzureOCIDockerKubernetesBashPython
Soft skills
collaborationfeedbackcontinuous improvement