unstructured.io

Site Reliability Engineer – Public Sector

unstructured.io

full-time

Posted on:

Origin:  • 🇺🇸 United States

Visit company website
AI Apply
Manual Apply

Job Level

Mid-LevelSenior

Tech Stack

AnsibleAWSAzureCloudDockerGoGrafanaKubernetesLinuxPrometheusPythonTerraformTypeScript

About the role

  • Build secure, highly available, and scalable cloud infrastructure for federal environments
  • Ensure compliance with FedRAMP, FISMA, and other security frameworks
  • Develop IaC with Terraform, Pulumi, or similar for repeatable, compliant deployments
  • Build and maintain automated CI/CD pipelines that balance speed and security
  • Implement and maintain monitoring, logging, and alerting (Prometheus, Grafana, Datadog, Elastic)
  • Lead incident response, postmortems, and reliability improvements
  • Partner with engineering and program teams for high-assurance rollouts

Requirements

  • 5–9 years managing software deployed to US government or Department of Defense (DOD) networks
  • Active SECRET clearance required; TS/SCI strongly preferred
  • Expertise with AWS GovCloud and/or Azure Government.
  • Deep experience with Kubernetes, Docker, and container orchestration at scale.
  • Strong Linux systems and networking fundamentals.
  • Scripting/automation: Python, Bash, or Go. IaC: Terraform, Pulumi, Ansible (or similar).
  • Strong grasp of monitoring, logging, and observability best practices.
  • Travel required up to 20%