Salary
💰 $169,000 - $271,000 per year
Tech Stack
CloudGoogle Cloud PlatformKubernetesTerraform
About the role
- Lead and support a team of SREs and DBREs through performance management, mentoring, and career development
- Define and evolve the SRE roadmap—including reliability metrics, tooling, incident response, and service ownership
- Oversee the operation and scaling of our GCP infrastructure, VPN, ingress/egress, and internal networking
- Drive automation across CI/CD, observability, and service operations using tools like Terraform, ArgoCD, and Kubernetes
- Own incident response practices, root cause analyses, and postmortems
- Partner with product engineering teams to ensure platform reliability unlocks member experience
- Influence long-term strategy across reliability, performance, disaster recovery, and cost management
Requirements
- 6+ years in infrastructure, site reliability, or cloud engineering roles, with 2–3+ years of experience leading SRE or infrastructure teams
- Deep experience operating systems in GCP (compute, IAM, storage, networking, etc.)
- Hands-on skills with infrastructure-as-code (Terraform), CI/CD pipelines, observability stacks, and Kubernetes
- A mindset of mentorship, experimentation, and empathy for developers and members
- Opportunity to tackle tough challenges, learn and grow from fellow top talent, and help millions of people reach their personal financial goals
- Flexible hours and virtual first work culture with a home office stipend
- Premium Medical, Dental, and Vision Insurance plans
- Generous paid parental and caregiver leave
- 401(k) savings plan with matching contributions
- Financial advisor and financial wellness support
- Flexible PTO and generous company holidays, including Juneteenth and Winter Break
- All-company in-person events once or twice a year and virtual events throughout to connect with your team members and leadership team
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
GCPinfrastructure-as-codeTerraformCI/CDKubernetesobservabilityincident responseroot cause analysisperformance managementdisaster recovery
Soft skills
mentorshipexperimentationempathyteam leadershipcareer development