Salary
💰 $72,100 - $105,100 per year
Tech Stack
AnsibleAWSAzureCloudDNSDockerFluxGoogle Cloud PlatformGrafanaJenkinsKubernetesLinuxMongoDBMySQLPostgresPrometheusRedisTerraformVault
About the role
- Operate and harden Kubernetes clusters (EKS/GKE/AKS), VPC/VNet networking, container registries, and service meshes where applicable; tune autoscaling, reliability, and cost
- Design and support multi-region architectures spanning U.S., LATAM and APAC with focus on latency, resilience, and secure connectivity
- Author reusable Terraform modules, Helm charts, and CI/CD pipelines (GitHub Actions/Jenkins/GitLab CI)
- Define SLIs/SLOs; wire up metrics/logs/traces (Prometheus, Grafana, OpenTelemetry); build actionable alerts; participate in 24×7 on-call rotation
- Implement IAM/RBAC, secrets management (Vault/KMS/Key Vault), image scanning, least-privilege access, and baseline hardening
- Operate managed databases/caches (Postgres/MySQL, MongoDB, Redis) with backup/restore, patching, and performance workflows
- Improve release workflows (blue/green, canary, progressive delivery via Argo CD/Flux) and coach teams on operational readiness
- Partner across U.S.-based engineering, product, sales teams, and LATAM engineering to turn requirements into reliable, secure infrastructure and CI/CD outcomes
- Engage with enterprise clients in security-sensitive, compliance-driven environments to ensure trust and operational excellence
- Participate in an equitable on-call rotation to support 24×7 operations
Requirements
- 2-3 years’ experience as a DevOps or Infrastructure/Network Engineer
- Foundational cloud hands-on in AWS and/or GCP (compute, containers, IAM, VPC/networking, managed DBs); Azure exposure welcome
- Infrastructure as Code with Terraform (plus Helm/Ansible or CloudFormation)
- CI/CD experience (GitHub Actions, Jenkins, or GitLab CI) and container build workflows (Docker, artifact registries)
- Linux + networking fundamentals (DNS, TLS, routing, VPN/IPSec, load balancing, troubleshooting)
- Observability basics—metrics/logs/traces; familiarity with Prometheus & Grafana; experience with OpenTelemetry is a plus
- Strong communication skills; calm under pressure; ownership mindset and eagerness to learn
- Collaborative mindset with experience working across distributed, multicultural teams in multiple time zones (U.S., LATAM, APAC)
- Adaptability to shifting priorities and emerging technologies in a fast-paced environment
- Problem-solving ability with structured analytical approach to troubleshooting and incident resolution
- Attention to detail in technical implementation and documentation
- Customer-oriented mindset for enterprise, security-sensitive, compliance-driven clients
- Resilience and composure for on-call responsibilities
- Must be authorized to work in the US and not require visa sponsorship; eligibility to obtain/maintain a U.S. security clearance is a plus
- Preferred: experience in mission-critical/NOC environments; familiarity with public-sector security frameworks (NIST 800-53, DISA STIGs, Cloud Computing SRG), zero-trust (NIST 800-207)
- Preferred: SRE concepts (SLIs/SLOs, error budgets, incident/postmortem discipline); Kubernetes hardening (CIS Benchmarks, kube-bench)
- Preferred: exposure to supply-chain data/standards (GS1 EPCIS 2.0); Spanish or cross-cultural LATAM collaboration; APAC collaboration experience
- Preferred certifications: Linux fundamentals (Linux Essentials/LPI), CCNA, CKAD, AWS Cloud Practitioner/Google Cloud Digital Leader/AZ-900