Build and maintain Azure platform foundations, including networking, identity, secrets, runtime, and data services
Own Infrastructure as Code (Terraform/Bicep), GitHub Actions pipelines, and golden paths/templates
Implement observability (DataDog) across services; establish SLOs and alerting
Provide developer experience tooling and environments; implement cost controls and security guardrails
Use GitHub Copilot and Copilot Chat to draft and iterate on Terraform/Bicep/Pulumi, Helm charts, Kubernetes manifests (AKS), GitHub Actions workflows, and policy-as-code
Configure and operationalize Copilot for day-to-day platform tasks (e.g., generating runbooks, PR descriptions, test scaffolding, scripts), and create reusable “recipes”/playbooks for the team
Design for resilience and disaster recovery across critical infrastructure components; implement backup, failover, and recovery automation
Implement secure secrets management with automated rotation, audit trails, and compliance alignment (e.g., Azure Key Vault, HashiCorp Vault)
Enforce platform governance using policy-as-code (e.g., Azure Policy, OPA, Rego); ensure compliance with internal and external standards
Design observability pipelines using OpenTelemetry, Azure Monitor, and DataDog; enable distributed tracing and root cause analysis
Integrate platform security practices including threat modeling, vulnerability scanning, runtime hardening, and secure CI/CD
Expose platform capabilities via APIs and self-service tooling; empower teams with reusable modules and automation recipes.
Requirements
8+ years of experience in platform/SRE roles on Azure
Expertise in Infrastructure as Code (Terraform or Bicep) and GitHub Actions CI/CD
Proficiency with DataDog (or equivalent) for metrics, logs, and traces; experience with incident response practices
Experience enabling eventing platforms (Kafka or Azure native)
Benefits
Flexible work arrangements
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
Infrastructure as CodeTerraformBicepGitHub ActionsDataDogOpenTelemetryKubernetesAPIsAzureeventing platforms