Procurement Sciences AI

Sr. DevOps Engineer

Procurement Sciences AI

full-time

Posted on:

Origin:  • 🇺🇸 United States • District of Columbia, Utah, Washington

Visit company website
AI Apply
Manual Apply

Job Level

Senior

Tech Stack

AWSAzureCloudDockerGoogle Cloud PlatformGrafanaKubernetesMicroservicesPrometheusPythonSaltStackTerraformTypeScript

About the role

  • We are seeking a results-driven and innovative DevOps Engineer to play a critical role in building, maintaining, and optimizing our secure, multi-cloud AI-driven SaaS platform. You’ll work hand-in-hand with software engineering and SRE teams to deliver secure, reliable, automated cloud infrastructure and DevSecOps pipelines that support production, test, and development environments at scale. The ideal candidate will bring deep experience in Azure, AWS, and Google Cloud Platform (GCP), strong Kubernetes skills, and a proactive approach to automation, observability, and reliability—ideally within a high-growth SaaS or GovCon context.
  • Deploy, configure, and manage our AI-powered SaaS platform across Azure, AWS, and Google Cloud Platform (GCP), ensuring high availability, security, scalability, and cost-effectiveness.
  • Design, implement, and maintain robust CI/CD pipelines (Azure DevOps, GitHub Actions, GitLab, or similar), automating build, test, deployment, and infrastructure provisioning workflows.
  • Collaborate closely with development, SRE, and security teams to enable rapid, reliable software releases, with baked-in system observability, compliance, and security controls.
  • Manage and optimize multi-cloud virtual private networks, container orchestration (Kubernetes, Helm), and microservices deployments.
  • Operate, monitor, and tune infrastructure by applying best practices in cloud security, cost management, and resource utilization; develop infrastructure as code using Terraform, ARM/Bicep templates, and/or similar tools.
  • Implement, manage, and integrate monitoring, alerting, and logging platforms (Datadog, Prometheus, Grafana, ELK, Cloud native tools, etc.) to ensure visibility into service health and performance.
  • Ensure infrastructure and application environments meet and maintain security, compliance, and auditability requirements (supporting FedRAMP, CMMC, and public sector regulatory needs).
  • Conduct regular assessments, vulnerability scans, and incident response drills; manage secrets and key rotation in line with industry best practices.
  • Provide technical guidance and production support, including troubleshooting, root cause analysis, and incident management to minimize downtime.
  • Create and maintain clear, actionable documentation, runbooks, and technical process guidance.
  • Keep current with industry trends, new cloud technologies, and DevSecOps best practices, introducing improvements and driving adoption within the organization.

Requirements

  • Demonstrated success as a DevOps Engineer, Site Reliability Engineer, or Cloud Engineer in high-growth SaaS or cloud-native businesses.
  • Strong, hands-on experience deploying and supporting applications in Azure, AWS, and Google Cloud Platform environments.
  • Deep knowledge of containerization (Docker, Kubernetes) and orchestration practices using tools like Helm.
  • Proven ability to build, maintain, and optimize CI/CD pipelines (such as Azure DevOps, GitHub Actions, GitLab CI, or similar).
  • Proficient in scripting languages such as Bash and PowerShell; experience with Python or TypeScript is highly desirable.
  • Experience managing infrastructure as code (Terraform, ARM, Bicep) and automating cloud provisioning at scale.
  • Familiarity with monitoring, logging, and observability platforms, and a strong understanding of SLIs, SLAs, and SLOs.
  • Sound understanding of cloud security, networking fundamentals, and identity/access management in public and hybrid cloud architectures.
  • Bachelor's degree in Computer Science, Engineering, or a related field—or equivalent professional experience.