Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
Gauge

Site Reliability Engineer – Network Focus, AWS

Gauge

Site Reliability Engineer ensuring networking reliability in distributed and cloud environments at Stefanini. Focused on AWS management, incident resolution, and network analysis.

Posted 4/30/2026full-timeRemote • BrasilMid-LevelSeniorWebsite

Tech Stack

Tools & technologies
AWSCloudDNSEC2FirewallsJenkinsTCP/IP

About the role

Key responsibilities & impact
  • Ensure the reliability, performance and availability of connectivity between systems by operating, diagnosing and evolving networks in distributed, hybrid and cloud environments, with a focus on traffic analysis, advanced troubleshooting and communication architecture between services.
  • **Strategic AWS environment management:**
  • Operate and evolve complex, reproducible environments with high availability, performance and horizontal scalability, using services such as EC2, ECS, Lambda, RDS and S3.
  • **Reliability and observability:**
  • Define, implement and evolve end-to-end observability practices (SLIs, SLOs, SLAs) with tools like New Relic, CloudWatch and custom dashboards;
  • Proactively identify bottlenecks and incidents.
  • **Connectivity and network performance:**
  • Diagnose and resolve latency, packet loss and throughput issues in distributed environments;
  • Troubleshoot DNS, VPNs, firewalls and load balancers (L4/L7);
  • Analyze communication flows between services (cloud, on-premises and external integrations);
  • Support the definition and evolution of connectivity architecture between systems.
  • **Automation and CI/CD:**
  • Design, maintain and optimize robust and secure pipelines (Jenkins, Bitbucket, GitOps) for continuous delivery of microservices and serverless workloads.
  • **SRE culture and continuous improvement:**
  • Promote blameless post-mortems, chaos engineering and automation of operational tasks to reduce toil and increase team efficiency.
  • **Security and governance (DevSecOps):**
  • Integrate best practices for IAM, secure networking, encryption, traffic control, vulnerability monitoring and compliance into the application lifecycle, including connectivity troubleshooting and communication analysis between services.
  • **Documentation and knowledge sharing:**
  • Create and maintain clear documentation on architecture, automations, incidents and runbooks, driving autonomy and improving onboarding.

Requirements

What you’ll need
  • Experience with networking in corporate or distributed environments
  • Hands-on experience troubleshooting network issues in production environments
  • Experience in traffic analysis, identifying bottlenecks and resolving critical incidents
  • Advanced knowledge of network protocols (TCP/IP, DNS, BGP, OSPF, NAT)
  • Experience with VPNs, firewalls, load balancers and network segmentation
  • Experience with cloud networking (AWS VPC, subnets, routing tables, NAT Gateway, etc.)
  • Experience with hybrid environments (cloud + on-premise integration)
  • **Preferred:**
  • Experience with CDN and edge technologies (Cloudflare, Akamai, etc.)
  • Experience with Direct Connect or equivalent solutions
  • Knowledge of network automation (Infrastructure as Code, scripting)
  • Experience with network observability tools
  • Familiarity with SRE concepts (SLI/SLO, availability, resilience)

Benefits

Comp & perks
  • Meal allowance / Food voucher
  • Health insurance
  • Dental insurance
  • Day off
  • Gympass / Totalpass
  • Childcare assistance
  • Pet assistance
  • Fuel allowance
  • Home office allowance
  • Educational reimbursement
  • Free online health platform
  • E-learning – Stefanini Academy with a range of courses
  • Mentoring – Mentoring platform (an opportunity to meet people, develop skills and share experiences)
  • Discounts at institutions for Undergraduate, Postgraduate, language courses and other trainings
  • Perks and discounts at top establishments

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
traffic analysisadvanced troubleshootingnetwork protocolsnetwork automationcloud networkingCI/CDobservability practicessecurity best practicesmicroservicesserverless workloads
Soft Skills
communicationproblem-solvingteam efficiencydocumentationknowledge sharing