FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Senior DevOps, Site Reliability Engineer
AppspaceSenior DevOps & Site Reliability Engineer for Appspace ensuring reliability and scalability of SaaS applications through automation. Integrating AI tools and managing diverse cloud environments.
Tech Stack
Tools & technologiesAzureCloudGoogle Cloud PlatformKubernetesLinuxMongoDBMySQLPythonRabbitMQSQLTerraform
About the role
Key responsibilities & impact- Be the technical anchor for a global platform footprint that includes a mix of Azure IaaS/PaaS, Google Cloud Platform (GCP), Kubernetes, and various data platforms.
- Identify manual "toil" and replace it with automated workflows for monitoring, change management, and routine administration of large-scale VM environments to ensure a positive ROI.
- Lead the integration of AI tools for automated code reviews, development frameworks, and predictive log analysis to drive departmental velocity and efficiency.
- Design and maintain "self-service" deployment frameworks and CI/CD pipelines (GitHub Actions, Bamboo) using Infrastructure as Code (Bicep, Terraform).
- Evaluate platform components to determine the most cost-effective path: automating the current state or migrating features to modern, shared architectures.
- Design and maintain a comprehensive observability stack across Azure and GCP (metrics, logs, traces) to identify performance bottlenecks and proactively address system defects.
- Partner with engineering, security and operations teams to ensure new features are "born" with reliability, security and automated delivery in mind; Ensure adherence to security best practices and compliance standards (SOC2, HIPAA, ISO 27001) and operational excellence with cost efficiency.
- Investigate complex performance defects by following log trails across web, application, and database tiers (SQL Server, MongoDB, MySQL).
- Ensure all platforms meet security standards (SOC2, HIPAA, ISO 27001) through automated policy enforcement across Azure and GCP.
Requirements
What you’ll need- 6+ years in DevOps or SRE roles, with a proven track record of bridging development and operations in complex cloud environments
- Extensive experience with Microsoft Azure (IaaS, PaaS, App Services, Networking) and/or Google Cloud Platform (GCP).
- Expert-level PowerShell and Python skills. Hands-on experience with Bicep or Terraform is required
- Strong background in Windows/Linux Server OS, Kubernetes (AKS/GKE), Helm, and container orchestration
- Familiarity with various middleware and PaaS technologies (e.g. Event Hub, Service Bus, CosmosDB, RabbitMQ, MongoDB, etc.)
- Expert-level troubleshooting and the ability to reason through complex process workflows to identify faults in large-scale platform environments.
Benefits
Comp & perks- Health insurance
- 401(k) plan
- Paid parental leave
- Generous PTO
- Flexible work schedules
- Remote work opportunities
- Paid company holidays
- Appspace Quiet Fridays (No non-essential internal meetings scheduled)
- A casual dress work environment
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
DevOpsSREMicrosoft AzureGoogle Cloud PlatformPowerShellPythonBicepTerraformKubernetesSQL Server
Soft Skills
troubleshootingproblem-solvingcollaborationcommunicationleadership
Certifications
SOC2HIPAAISO 27001