You will combine software and systems engineering to build and run large-scale, distributed, fault-tolerant systems.
Your primary goal will be to solve operational problems with a software engineering mindset, treating operations as a software problem and automating away toil.
Define, measure, and monitor Service Level Objectives (SLOs) and Service Level Indicators (SLIs) in partnership with product and engineering teams. Champion and help manage error budgets to guide the balance between reliability work and new feature velocity.
Scale systems sustainably through automation; evolve systems by pushing for changes that improves reliability and velocity.
Increase visibility into the health and durability of the Unqork platform.
Practice sustainable incident response and blameless postmortems.
Maintain existing services and tools, augmenting and replacing
Requirements
3-5 years as a Site Reliability Engineer or Software Developer
Advanced experience with programming/scripting languages such as JavaScript/NodeJS, Go, or Python.
Knowledge in Linux monitoring, troubleshooting, and administration.
Can write and evaluate code for scalability/runtime.
Experience with container orchestration platforms such as Kubernetes or Nomad.
Experience with monitoring, APM, and logging tooling (Eg: ELK, Grafana, Datadog, NewRelic, or Splunk).
Experience working with at least one DBMS (Eg: Postgres, MySQL, Oracle, or MongoDB).
Experience with configuration management tools (Eg: Ansible, Puppet, Chef, or Salt). Ansible Tower or AWX is a plus.
Experience with Infrastructure-as-Code tools such as Terraform, Cloudformation, Google Deployment Manager, or Azure Resource Manager.
Experience working with at least one major Cloud Provider (AWS/Azure/GCP).
Understanding of cloud native security requirements (Eg: WAF, security groups)
Benefits
💻 Work from home with a remote-first community
🏝 Unlimited PTO (and the encouragement to use it)
📝 Student loan payback program
🏥 100% employer-covered medical, dental, and vision options available to you and your dependents
💸 Flexible Spending Account (FSA)
🏠 Monthly stipend toward your WFH setup, vacation, development and more
💰 Employer-sponsored 401(k) with contribution match
🏋🏻♀️ Subsidized ClassPass Membership
🍼 Generous Paid Parental Leave
ATS Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.