Salary
💰 $124,300 - $266,400 per year
Tech Stack
GoRubyTerraform
About the role
- Help keep all user-facing services and production systems reliable, scalable, and efficient
- Specialize in operating a large fleet of GitLab environments built to meet FedRAMP compliance requirements
- Automate workflows across the full environment lifecycle—from provisioning new environments to daily operational tasks
- Deploy features and updates safely and consistently
- Maintain operational excellence across many production environments simultaneously
- Integrate the demands of operating at scale with the rigor of public sector compliance
Requirements
- Proven ability to operate and troubleshoot production workloads across multiple tenants or environments
- Strong hands-on experience with Terraform, including workspace strategies, state management, and automation patterns that scale
- Skilled at diagnosing deployment failures, interpreting pod logs, and debugging scheduling issues and rollback scenarios in live environments
- Ability to read and debug code in Go and/or Ruby
- Experience supporting infrastructure for many customers or environments simultaneously
- Able to reason through complex systems and operational challenges
- Proven ability to work across teams and with internal or external customers to solve technical problems
- Comfortable using GitLab as a daily tool for infrastructure automation, collaboration, and operational workflows
- Benefits to support your health, finances, and well-being
- All remote, asynchronous work environment
- Flexible Paid Time Off
- Team Member Resource Groups
- Equity Compensation & Employee Stock Purchase Plan
- Growth and development budget
- Parental leave
- Home office support
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
TerraformGoRubyproduction workloadsdeployment failurespod logsscheduling issuesrollback scenariosinfrastructure automationenvironment lifecycle
Soft skills
troubleshootingproblem solvingcollaborationreasoning through complex systemsoperational excellence