
Senior Site Reliability Engineer – SRE
Associated Nerd Global
full-time
Posted on:
Location Type: Hybrid
Location: Belo Horizonte • 🇧🇷 Brazil
Visit company websiteSalary
💰 R$28,000 per month
Job Level
Senior
Tech Stack
CloudDockerGoogle Cloud PlatformJavaScriptKafkaKubernetesMicroservicesNode.jsPostgresPythonRabbitMQReactTerraform
About the role
- Design cloud and on‑prem infrastructure, and lead Docker/Kubernetes operations (optimizing autoscaling, rollouts, security).
- Develop reliable pipelines (Git/gates/automation) and implement end-to-end observability (SLOs/SLIs/SLAs, logs/metrics/tracing).
- Operate microservices (service mesh, resilience patterns) and manage critical data (PostgreSQL HA/tuning).
- Manage secrets, access policies, supply chain security and system hardening.
- Implement Infrastructure as Code and GitOps (Terraform/Helm/ArgoCD).
- Lead incident response and postmortems with data- and AI-driven continuous improvement.
- Align with Engineering, Product, Data and ML teams.
Requirements
- 6+ years in SRE/DevOps/Platform engineering at high scale.
- Strong expertise in Kubernetes, Docker, CI/CD, observability (SLOs), PostgreSQL, microservices architecture, security, and experience with IaC and GitOps.
- Passion for applying LLMs/AI to operations.
- Experience with Node.js/Python, NestJS/React, Git/Cursor, GCP (other clouds a plus), PostgreSQL, Docker/Kubernetes, Terraform/Helm/ArgoCD.
- Experience with AI SDKs/LLMs, operational automations (n8n/Crew.ai), vector databases (RAG/pgvector), Kafka/RabbitMQ, FinOps, chaos engineering, SAST/DAST.
Benefits
- True autonomy and a highly collaborative environment;
- Direct influence on product and team development;
- Opportunity to grow with the business from the ground up;
- Fixed salary of R$28,000/month (PJ contract) plus real possibility of Stock Options;
- Hybrid work model in Belo Horizonte.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
KubernetesDockerCI/CDobservabilityPostgreSQLmicroservices architectureInfrastructure as CodeGitOpsNode.jsPython
Soft skills
leadershipincident responsecontinuous improvementcollaboration