
Senior Site Reliability Engineer
DataSnipper
full-time
Posted on:
Location Type: Remote
Location: Remote • 🇺🇸 United States
Visit company websiteJob Level
Senior
Tech Stack
AzureCloudGoGrafanaPostgresPrometheusPythonRedisSQLTerraformVault
About the role
- Define and own cloud infrastructure strategy, reference architectures, and platform roadmaps for Azure across compute, networking, identity, data, security, and observability.
- Design and implement an enterprise-scale Azure Landing Zone (management groups, subscriptions, RBAC, Azure Policy) and governance for multi-tenant SaaS and regulated customers.
- Architect highly available, multi-region solutions leveraging AKS/Container Apps, App Service, Azure DB for PostgreSQL, Redis, Service Bus/Event Grid, Front Door/Traffic Manager, and CDN.
- Enable secure private connectivity patterns (Private Link, VNet integration, Azure Firewall/WAF, ExpressRoute/VPN) and champion zero-trust principles with Entra ID and Managed Identity.
- Establish platform engineering "golden paths" and reusable accelerators: Terraform modules, environment bootstrapping, and CI/CD templates in GitHub Actions.
- Drive well-architected reviews for mission-critical workloads; translate findings into improvements for reliability, security, performance, and cost with measurable SLOs/SLIs.
- Implement end-to-end observability using Azure Monitor, Log Analytics, Application Insights, and Prometheus/Grafana; automate proactive detection and post-incident improvement plans.
- Partner with Security to implement least-privilege access, PIM, Defender for Cloud, Key Vault, secret rotation, and compliance controls.
- Define and validate DR/BCP strategies (RTO/RPO), including zone-redundancy, geo-replication, backups, and failover testing.
- Mentor and coach engineering teams; lead architecture reviews, threat modeling, technical workshops, and author documentation and reference architectures.
- Evaluate and guide adoption of new Azure capabilities; collaborate with partners and vendors to enhance the platform.
Requirements
- 7+ years in cloud architecture or platform engineering with deep hands-on Microsoft Azure expertise.
- Proven track record designing multi-tenant, multi-region SaaS architectures and enterprise-scale Azure Landing Zones with governance and policy.
- Expertise across Azure services: AKS/Container Apps, App Service, VMSS; VNet/vWAN, Private Link, Azure Firewall, App Gateway/WAF, Front Door; Entra ID (Azure AD), RBAC, Managed Identity, PIM; Storage, Azure SQL DB; Service Bus/Event Grid; Key Vault; Defender for Cloud; Azure Monitor/Log Analytics/Application Insights.
- Strong DevOps/SRE practices: CI/CD (GitHub Actions), GitOps, blue/green and canary deployments, infrastructure testing, progressive delivery.
- Hands-on with Infrastructure as Code (Terraform and/or Bicep; ARM), policy-as-code, and environment bootstrapping at scale.
- Solid grasp of networking and hybrid connectivity (ExpressRoute, VPN), security-by-design, and zero trust.
- FinOps mindset with demonstrable cost optimization, tagging/chargeback, budgets/alerts, and rightsizing.
- Proficiency in scripting/coding (PowerShell and one of Python/C#/Go).
- Nice to have: AZ-305, AZ-400, CKA/CKAD; experience in regulated environments (SOC 2, ISO 27001, HIPAA, GDPR); contributions to public docs/reference architectures.
- Must be based on the East Coast of the United States.
- Must be legally authorized to work in the United States (application asks about sponsorship).
Benefits
- Excellent salary.
- Flexible paid time off.
- Remote work.
- Comprehensive medical and dental coverage.
- 401K match.
- Paid parental leave.
- Stock participation plan.
- Access to OpenUp and Talkspace mental health and wellness platform.
- International working environment and professional development opportunities.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
cloud architectureplatform engineeringMicrosoft Azuremulti-tenant architecturesAzure Landing ZonesDevOps practicesInfrastructure as Codescriptingcost optimizationnetworking
Soft skills
mentoringcoachingleadershipcollaborationcommunicationproblem-solvingtechnical documentationarchitecture reviewsthreat modelingworkshop facilitation
Certifications
AZ-305AZ-400CKACKAD