Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
Akkadian Labs

DevOps Engineer

Akkadian Labs

DevOps Engineer supporting the design, implementation, and maintenance of scalable infrastructure and DevOps processes at Akkadian Labs. Collaborating with development, QA, and product teams for reliable deployments and automation.

Posted 5/12/2026full-timeRemote • New Jersey • 🇺🇸 United StatesMid-LevelSeniorWebsite

Tech Stack

Tools & technologies
AWSCloudDockerEC2GrafanaJenkinsKubernetesLinuxPrometheusPythonTerraform

About the role

Key responsibilities & impact
  • Support deployment and maintenance of scalable infrastructure in AWS and hybrid cloud environments.
  • Assist in managing infrastructure-as-code (IaC) using Terraform, CloudFormation, or similar tools.
  • Help maintain Linux-based environments.
  • Contribute to containerization efforts using Docker and orchestration via Kubernetes.
  • Work on the design, deployment and management of AI agent workloads, including provisioning compute instances and managing resource scaling for inference-heavy tasks.
  • Play a key role in building and maintaining model deployment pipelines, including versioning, testing, and rollback of AI models in production environments.
  • Monitor AI API consumption and infrastructure costs, implementing alerting and controls to prevent runaway usage and support budget visibility.
  • Coordinate the implementation of infrastructure-level security guardrails for AI systems, including access controls and data isolation for model inputs and outputs.
  • Manage monitoring and observability efforts using tools such as Prometheus, Grafana, and the ELK stack.
  • Troubleshoot system issues and contribute to incident response and root cause analysis.
  • Develop and execute strategies for improving system reliability, performance, and uptime.
  • Build, maintain, and optimize CI/CD pipelines using tools such as Jenkins, BitBucket CI/CD, or similar.
  • Automate routine operational tasks including builds, testing, deployments, and system updates.
  • Work with engineering teams to integrate pipelines with Akkadian tools.
  • Follow secure DevOps practices and assist in implementing security controls.
  • Support compliance initiatives and vulnerability remediation efforts.
  • Work closely with DevOps, engineering, QA, and product teams to support deployments and releases.
  • Maintain documentation for infrastructure, processes, and operational procedures.
  • Participate in team ceremonies and continuous improvement initiatives.

Requirements

What you’ll need
  • Experience: 5+ years of experience in DevOps, Site Reliability Engineering (SRE), or a related role.
  • Cloud Expertise: Hands-on experience with AWS (e.g., EC2, ECS, S3, IAM, Lambda, CloudWatch).
  • Linux Knowledge: Working knowledge of Linux environments.
  • Containerization: Familiarity with Docker and Kubernetes.
  • Scripting: Basic to intermediate scripting ability in Python, Bash, or similar languages.
  • CI/CD: Experience building or maintaining CI/CD pipelines and related tools.
  • Observability: Exposure to monitoring and observability tools such as Prometheus, Grafana, and ELK.
  • Security: Understanding of secure DevOps practices and basic compliance concepts.
  • Preferred Qualifications
  • Experience supporting AI or machine learning workloads, compute environments.
  • Exposure to AI model deployment pipelines and model versioning practices.
  • Experience with infrastructure-as-code tools such as Terraform or CloudFormation.
  • Familiarity with hybrid cloud or on-premises environments.
  • Exposure to security best practices in DevOps contexts, including AI-specific concerns such as data isolation and access controls.
  • Experience supporting production systems and participating in on-call rotations.

Benefits

Comp & perks
  • We offer a fully remote environment
  • a competitive benefits package including medical
  • dental
  • vision
  • company-paid life insurance and disability policies
  • 401(k) with a generous matching program
  • paid time off.

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
AWSTerraformCloudFormationLinuxDockerKubernetesPythonBashCI/CDmonitoring
Soft Skills
incident responseroot cause analysissystem reliabilityperformance improvementteam collaborationdocumentationcontinuous improvement