Smarsh

Platform Engineer – Hybrid Infrastructure

Smarsh

full-time

Posted on:

Location Type: Hybrid

Location: AtlantaUnited States

Visit company website

Explore more

AI Apply
Apply

Salary

💰 $120,000 - $160,000 per year

About the role

  • Own Kubernetes platform operations, including cluster health, workload deployments, scaling, and incident response.
  • Design, implement, and operate infrastructure automation using Ansible, Terraform, and GitOps workflows (ArgoCD / Flux)
  • Lead migration projects moving on-premises workloads toward One Smarsh (cloud-native) platform services.
  • Build and maintain CI/CD pipelines (CircleCI, GitHub Actions) for infrastructure and application delivery.
  • Drive observability improvements across Datadog, Splunk, and ELK, including dashboards, alert tuning, and SLO/SLA definition.
  • Participate in the on-call rotation, responding to P1/P2 incidents; the team rotates on-call roughly every 4-6 weeks and performs scheduled overnight maintenance windows a few times per year.
  • Support security and compliance requirements, including patch management, access controls, and audit readiness for regulated workloads.
  • Contribute to runbooks and operational documentation as systems are built and changed
  • Collaborate with other Smarsh platform teams on the build and adoption of a One Smarsh platform.

Requirements

  • 4–7 years of experience in platform engineering, SRE, or infrastructure engineering roles.
  • Strong hands-on experience with Kubernetes (cluster operations, Helm, workload troubleshooting).
  • Proficiency with infrastructure-as-code tooling, specifically Ansible and/or Terraform in production environments.
  • Strong Linux systems administration skills (Ubuntu)
  • Experience with GitOps workflows and CI/CD pipelines at scale.
  • Experience with VMware vSphere in a production environment.
  • Demonstrated ability to self-direct and drive projects to completion with minimal oversight.
  • Comfortable operating in evolving environments where processes and tooling are actively maturing.
  • Strong communication skills with cross-functional stakeholders.
  • Experience with one or more of Datadog, Splunk, or ELK for dashboards, monitors, and log management is preferred.
  • Familiarity with compliance-sensitive or regulated industry infrastructure (financial services, healthcare, or similar) is preferred.
  • Experience with ArgoCD, Flux, or similar GitOps continuous delivery tooling is preferred.
  • Familiarity with Jenkins or Concourse for CI/CD pipeline management is preferred.
  • Familiarity with VMware Kubernetes Service (VKS) or other VMware-native Kubernetes platforms is preferred.
  • Python scripting for automation and tooling is preferred.
  • Prior experience in an on-call rotation with a defined SLA structure is preferred.
  • Experience with cloud infrastructure (AWS), beneficial as the team takes on cloud responsibilities later this year is preferred.
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
KubernetesAnsibleTerraformGitOpsCI/CDLinuxVMware vSpherePythonArgoCDFlux
Soft Skills
self-directionproject managementcommunication