
Site Reliability Engineer
Seekerh
full-time
Posted on:
Location Type: Remote
Location: Brazil
Visit company websiteExplore more
About the role
- Design, implement, and evolve observability solutions (logs, metrics, traces, alerts, and dashboards)
- Work in AWS environments, ensuring monitoring standards, resilience, and stability
- Define, track, and optimize SLIs and SLOs
- Create and maintain alerts, runbooks, and incident response processes
- Support advanced troubleshooting activities, focusing on identifying and mitigating issues related to performance, availability, and service degradation
Requirements
- Strong expertise in AWS
- Advanced experience with monitoring and observability
- Knowledge of Infrastructure as Code (IaC) using Terraform, CloudFormation, ARM, Bicep
- Experience with CI/CD pipelines
- Strong analytical skills for diagnosing and resolving complex incidents
Benefits
- Work model: 100% remote
- Employment: CLT (Brazilian employment contract)
- Projects with high technical complexity and visibility
- Technically mature, collaborative environment focused on best practices
- Strong support for continuous learning and professional growth
- Exposure to modern technologies and large-scale challenges
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
AWSmonitoringobservabilityInfrastructure as CodeTerraformCloudFormationARMBicepCI/CDanalytical skills