Kraken

Lead Site Reliability Engineer

Kraken

full-time

Posted on:

Location: 🇺🇸 United States

Visit company website
AI Apply
Apply

Salary

💰 $170,000 - $200,000 per year

Job Level

Senior

Tech Stack

AWSDistributed SystemsDockerKubernetesLinuxPostgresPythonRabbitMQRDBMSTerraform

About the role

  • Ensure the availability, performance, and scalability of products on Kraken's platform.
  • Own and lead the Product Reliability team: define strategic objectives, manage priorities, and deliver major initiatives on clear timelines.
  • Collaborate with the Staff Platform Engineer and wider Platform Engineering to deliver technical implementations and outcomes.
  • Line-manage engineers in the Product Reliability team: set performance expectations, review performance, and provide coaching and feedback.
  • Deliver technical improvements including small features and bug fixes; support service offerings owned by the team.
  • Support team delivery through code reviews, technology research, and architectural guidance.
  • Build a strong culture of open communication and an inclusive team environment.
  • Tackle interesting and difficult problems in the global energy market and drive continuous reliability improvements.

Requirements

  • Excellent communication skills, working effectively with developers, product managers and other business stakeholders.
  • Record of successfully and consistently delivering critical path projects, on time and at scale.
  • Meticulous organisation and planning skills.
  • Experience of mentoring and coaching a team to perform at a high-level of quality.
  • Experience managing and supporting large-scale internet-facing distributed systems for millions of customers.
  • Good experience with AWS and a programming language.
  • Knowledge of security best-practices, security and CI/CD tooling, and methodologies.
  • Previous experience in leading technical delivery for small, highly-autonomous teams (helpful).
  • Previous experience as a technical individual contributor, preferably as a Site Reliability Engineer (helpful).
  • Track-record of effective collaboration with other teams and departments to drive holistic outcomes (helpful).
  • A proactive, innovative mindset with the ability to drive continuous improvement (helpful).
  • Previous experience working in a remote-first asynchronous global team (helpful).
  • Familiarity with PostgreSQL or similar RDBMS, Docker and Kubernetes (Amazon EKS), Python, Datadog, messaging queues/event-driven processing (RabbitMQ), Terraform, and experience with a Linux distribution.
Aerospike

Senior Support Engineer

Aerospike
Seniorfull-time$150k–$180k / year🇺🇸 United States
Posted: 30 days agoSource: boards.greenhouse.io
AWSCloudDistributed SystemsDockerKubernetesLinuxNoSQL
WillHire

Senior Software Engineer

WillHire
Seniorfull-time$133k–$199k / yearCalifornia · 🇺🇸 United States
Posted: 6 days agoSource: workday.wd5.myworkdayjobs.com
AWSCloudDistributed SystemsDjangoDockerFlaskKubernetesNoSQLPythonRDBMSTerraform
Truelogic Software

Staff DevOps Engineer, AWS – Health Care

Truelogic Software
Leadfull-timeCalifornia · 🇺🇸 United States
Posted: 7 days agoSource: jobs.ashbyhq.com
AWSCloudDistributed SystemsDockerEC2GoGrafanaJenkinsKubernetesLinuxPrometheusPython+1 more
Twilio

Senior Software Engineer (L3)

Twilio
Seniorfull-time$139k–$204k / year🇺🇸 United States
Posted: 39 days agoSource: boards.greenhouse.io
AWSCloudDistributed SystemsDockerDynamoDBGoGrafanaJavaKafkaKubernetesLinuxPostgres+4 more
TigerData (creators of TimescaleDB)

Senior/Staff Product Manager, AI

TigerData (creators of TimescaleDB)
Seniorfull-timeCalifornia · 🇺🇸 United States
Posted: 18 days agoSource: jobs.ashbyhq.com
AWSCloudDistributed SystemsDockerGoKubernetesPostgresPython