Handoff

Site Reliability Engineer, SRE

Handoff

full-time

Posted on:

Location Type: Hybrid

Location: São Paulo • 🇧🇷 Brazil

Visit company website
AI Apply
Apply

Job Level

Mid-LevelSenior

Tech Stack

AWSAzureCloudGoGoogle Cloud PlatformGrafanaPrometheusPython

About the role

  • You will own and elevate the reliability, scalability, and observability of Handoff’s platform.
  • This is a hands-on role focused on preventing incidents, improving system resilience, and enabling fast, safe product development.
  • You’ll work closely with Backend, Fullstack, Data, and AI engineers to ensure our systems are production-ready, observable, and built to scale, while keeping a strong focus on user impact and developer velocity.
  • This is not a pure ops role. We’re looking for someone who thinks like an engineer, codes regularly, and partners deeply with product and engineering teams.

Requirements

  • Strong experience as an SRE, Platform Engineer, DevOps Engineer, or similar reliability-focused role.
  • Solid understanding of reliability fundamentals, availability, latency, error rates, throughput, durability.
  • Hands-on experience with cloud platforms like AWS, GCP, or Azure.
  • Deep familiarity with observability tools such as Prometheus, DataDog, Grafana, OpenTelemetry, or similar.
  • Strong debugging skills and comfort working in high-pressure production incidents.
  • Experience improving CI/CD pipelines and release safety.
  • Ability to write production-quality code or scripts in languages like Python, Go, or Bash.
  • Experience with infrastructure-as-code and automation.
  • A pragmatic mindset that balances reliability with product velocity and real-world constraints.
  • Strong communication skills and comfort collaborating across engineering, product, and leadership.
  • Comfortable in a fast-paced environment, you’re quick to adapt to changing priorities and balance rapid iteration with high-quality outputs.
Benefits
  • 💸 Competitive **salary in USD**
  • 💰 Attractive **stock options**
  • 🌴 **Unlimited PTO**
  • 🚛 Relocation** allowance**
  • 👨‍💻 **Top-notch** equipment
  • 🧳 **Team offsites around the world** - we've already been to more than 5 countries!

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
SREPlatform EngineerDevOps Engineerreliability fundamentalscloud platformsAWSGCPAzureobservability toolsPython
Soft skills
strong communication skillscollaborationpragmatic mindsetadaptabilityhigh-pressure incident management