AlphaSense

Staff Site Reliability Engineer

AlphaSense

full-time

Posted on:

Location Type: Remote

Location: Remote • 🇺🇸 United States

Visit company website
AI Apply
Apply

Salary

💰 $150,000 - $225,000 per year

Job Level

Lead

Tech Stack

AWSAzureCloudDNSGoGoogle Cloud PlatformGrafanaKubernetesPrometheusPythonTCP/IP

About the role

  • Architect Reliability Paved Paths: Build frameworks and self-service tooling that let teams own the reliability of their services in a “You Build It, You Run It” culture.
  • Lead AI-Driven Reliability: Drive our AIOps strategy — automating diagnostics, remediation, and proactive failure prevention.
  • Champion Reliability Culture: Embed SRE practices across engineering via design reviews, production readiness, and operational standards.
  • Incident Leadership: Act as Incident Commander during critical events, modeling operational excellence, and ensuring blameless postmortems lead to lasting improvements.
  • Advance Observability: Deliver end-to-end monitoring, tracing, and profiling (Prometheus, Grafana, OTEL, Continuous Profiling) to optimize performance proactively.
  • Mentor & Multiply: Elevate engineers across SRE and product teams through mentorship, technical guidance, and knowledge sharing.

Requirements

  • 8+ years of experience in Site Reliability Engineering, DevOps, or a similar role, with at least 3+ of those years operating in a Senior+ SRE position
  • Strong background in running production SaaS systems at scale.
  • Proficiency in at least one programming/scripting language (Python, Go, or similar).
  • Hands-on expertise with cloud platforms (AWS, GCP, or Azure) and Kubernetes.
  • Deep understanding of networking fundamentals (TCP/IP, DNS, HTTP/S, load balancing).
  • Experience with monitoring & alerting (Prometheus, Grafana, Datadog, ELK).
  • Familiarity with advanced observability (OTEL, continuous profiling).
  • Proven incident management experience, including leading high-severity incidents and postmortems.
  • Strong troubleshooting skills across the full stack.
  • Excellent communication and collaboration skills.
Benefits
  • You may also be offered equity, and a generous benefits program.

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
Site Reliability EngineeringDevOpsPythonGoAWSGCPAzureKubernetesTCP/IPmonitoring
Soft skills
mentorshipcommunicationcollaborationtroubleshootingincident managementoperational excellenceblameless postmortemsleadershipknowledge sharingdesign reviews
Forbright Bank

Senior Engineer, Site Reliability Engineering, Digital Banking

Forbright Bank
Seniorfull-time$130k–$150k / year🇺🇸 United States
Posted: 7 hours agoSource: jobs.lever.co
AWSCloudDistributed SystemsGrafanaJavaScriptKafkaKubernetesLinuxNode.jsPythonSplunkSQL+1 more
CrowdStrike

Senior DevOps Engineer – Log Platform

CrowdStrike
Seniorfull-time$140k–$215k / year🇺🇸 United States
Posted: 7 hours agoSource: crowdstrike.wd5.myworkdayjobs.com
AnsibleAWSCloudGoogle Cloud PlatformJenkinsKubernetesLinuxPythonTerraform
Lytx, Inc.

Staff Site Reliability Engineer

Lytx, Inc.
Leadfull-time$184k–$233k / year🇺🇸 United States
Posted: 7 hours agoSource: lytx.wd1.myworkdayjobs.com
AWSCloudDNSEC2GoGrafanaJavaScriptKubernetesLinuxNGINXNode.jsNoSQL+7 more
Redhorse Corporation

GIS Developer, DevOps Specialist

Redhorse Corporation
Mid · Seniorfull-time$110k–$120k / yearColorado · 🇺🇸 United States
Posted: 13 hours agoSource: jobs.lever.co
JavaScript.NETPython