
Staff Site Reliability Engineer – Data Infrastructure
Fastly
full-time
Posted on:
Location Type: Hybrid
Location: San Francisco • California, Colorado, New York • 🇺🇸 United States
Visit company websiteSalary
💰 $195,720 - $234,864 per year
Job Level
Lead
Tech Stack
DockerGoKubernetesLinuxPrometheusTerraform
About the role
- Lead full lifecycle projects from design and development through roll out and maintenance
- Deploy and maintain several different types of critical data storage systems on scales from gigabytes to petabytes
- Develop statistics and dashboards to measure service-level objectives for these systems
- Maintain and create tools for management of configuration, backup, and authenticated access to data systems using peer review, CI/CD, and both daemon- and container-based deployment
- Write code that is performant, maintainable, clear, and concise and contribute to code reviews
- Provide technical leadership of full lifecycle projects, driving project progress and collaborating with stakeholders
- Coordinate and communicate with team members and cross-functional teams, fostering relationships to understand end-user needs
- Help project, plan, and scale for growth
- Document processes and requirements for intra- and cross-team knowledge transfer
- Mentor and support other engineers, fostering knowledge sharing and collaboration
- Participate in on-call rotation as needed
Requirements
- Significant professional experience building and maintaining at least one category of backend system, including both APIs and data stores (relational-, column-, vector-, or document-oriented)
- Most Staff Engineers at Fastly have more than 7 years of related experience
- Experience measuring customer-focused performance using tools like Prometheus or DataDog
- Linux system skills, including file systems, networking, I/O analysis, and general kernel tuning
- A strong grasp of networking, routing, network protocols, and related concepts
- Advanced Terraform skills, including working with modules and remote state
- Experience developing automation tools in Go and other languages
- Experience with container technology, Kubernetes/Docker/Helm, or similar
- Collaborative mindset with experience working across cross-functional teams
- Communicative, collaborative, empathetic with a thoughtful, customer-driven approach
- Availability during core North American business hours and evenings and weekends as needed for on-call support (timezones span UTC-8 to UTC-3)
- Willingness to participate in on-call rotation
Benefits
- comprehensive benefits package including medical, dental, and vision insurance
- Family planning
- Mental health support
- Employee Assistance Program
- Insurance (Life, Disability, and Accident)
- Flexible Vacation policy
- up to 18 days of accrued paid sick leave
- 401(k) (including company match)
- Employee Stock Purchase Program
- Equity and discretionary bonus programs
- 11 paid local holidays (2025)
- 11 paid company wellness days
- Benefits start on the first day of employment
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
backend systemsAPIsdata storesLinuxnetworkingTerraformGoautomation toolsKubernetesDocker
Soft skills
technical leadershipcollaborationcommunicationmentoringcustomer-focusedempathyknowledge sharingproject planningrelationship buildingproblem-solving