
Site Reliability Engineer, SRE
Handoff
full-time
Posted on:
Location Type: Hybrid
Location: São Paulo • 🇧🇷 Brazil
Visit company websiteJob Level
Mid-LevelSenior
Tech Stack
AWSAzureCloudGoGoogle Cloud PlatformGrafanaPrometheusPython
About the role
- You will own and elevate the reliability, scalability, and observability of Handoff’s platform.
- This is a hands-on role focused on preventing incidents, improving system resilience, and enabling fast, safe product development.
- You’ll work closely with Backend, Fullstack, Data, and AI engineers to ensure our systems are production-ready, observable, and built to scale, while keeping a strong focus on user impact and developer velocity.
- This is not a pure ops role. We’re looking for someone who thinks like an engineer, codes regularly, and partners deeply with product and engineering teams.
Requirements
- Strong experience as an SRE, Platform Engineer, DevOps Engineer, or similar reliability-focused role.
- Solid understanding of reliability fundamentals, availability, latency, error rates, throughput, durability.
- Hands-on experience with cloud platforms like AWS, GCP, or Azure.
- Deep familiarity with observability tools such as Prometheus, DataDog, Grafana, OpenTelemetry, or similar.
- Strong debugging skills and comfort working in high-pressure production incidents.
- Experience improving CI/CD pipelines and release safety.
- Ability to write production-quality code or scripts in languages like Python, Go, or Bash.
- Experience with infrastructure-as-code and automation.
- A pragmatic mindset that balances reliability with product velocity and real-world constraints.
- Strong communication skills and comfort collaborating across engineering, product, and leadership.
- Comfortable in a fast-paced environment, you’re quick to adapt to changing priorities and balance rapid iteration with high-quality outputs.
Benefits
- 💸 Competitive **salary in USD**
- 💰 Attractive **stock options**
- 🌴 **Unlimited PTO**
- 🚛 Relocation** allowance**
- 👨💻 **Top-notch** equipment
- 🧳 **Team offsites around the world** - we've already been to more than 5 countries!
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
SREPlatform EngineerDevOps Engineerreliability fundamentalscloud platformsAWSGCPAzureobservability toolsPython
Soft skills
strong communication skillscollaborationpragmatic mindsetadaptabilityhigh-pressure incident management