SuccessKPI

Senior DevOps Engineer

SuccessKPI

full-time

Posted on:

Location Type: Remote

Location: MarylandVirginiaUnited States

Visit company website

Explore more

AI Apply
Apply

Job Level

About the role

  • Be a key member of the Engineering team, responsible for designing, building, and maintaining the infrastructure that supports our SaaS analytics platform.
  • Champion automation, reliability, security, and scalability as you optimize cloud-based environments and drive best practices across CI/CD pipelines, monitoring, and infrastructure-as-code.
  • Design, implement, and maintain scalable, reliable, and secure infrastructure on cloud platforms (primarily AWS).
  • Create new infrastructure or environments to meet evolving customer and product demands.
  • Monitor infrastructure performance and availability, ensuring high uptime and efficiency.
  • Apply infrastructure-as-code principles using tools such as Terraform, AWS CloudFormation, or the Serverless framework.
  • Build, maintain, and optimize CI/CD pipelines for application deployments using tools like Jenkins, Bitbucket Pipelines, or equivalent.
  • Automate and standardize release processes to support frequent, reliable, and fast software delivery.
  • Support production release and bug-fix deployments, including environment configurations.
  • Develop scripts and tooling (using Python, Bash, Node.js, etc.) to automate infrastructure management, deployments, and operational tasks.
  • Champion continuous improvement in automation to reduce manual effort and improve reliability.
  • Implement and manage robust monitoring and logging systems using AWS CloudWatch, Datadog, Dynatrace, or custom solutions.
  • Proactively identify, troubleshoot, and resolve infrastructure and application issues before they impact end users.
  • Participate in on-call rotations for critical production systems support.
  • Apply security best practices across all infrastructure layers to ensure secure operations.
  • Conduct routine security audits and vulnerability assessments to maintain compliance with applicable standards and frameworks.
  • Partner closely with development teams to support application architecture, deployments, and infrastructure decisions.
  • Collaborate with QA, Product, and Customer Support teams to resolve customer-impacting issues and improve system reliability.

Requirements

  • Bachelor's degree in Computer Science, Information Technology, Engineering, or a related field.
  • 8+ years of experience in a DevOps, Site Reliability Engineer (SRE), or Operations Engineer role in a cloud-first environment.
  • Strong hands-on experience with AWS cloud services and infrastructure management.
  • Proficiency in scripting and development (Python, Bash, Node.js).
  • Experience with containerization and orchestration tools such as Docker, Kubernetes, or AWS ECS.
  • Deep familiarity with CI/CD tools: Jenkins, GitLab CI, CircleCI, Bitbucket Pipelines, or similar.
  • Proficient in infrastructure-as-code: Terraform, AWS CloudFormation, Serverless framework, etc.
  • Solid understanding of networking, security, Linux system administration, and cloud architecture patterns.
  • Excellent analytical, problem-solving, and troubleshooting abilities.
  • Strong communication and collaboration skills across technical and non-technical stakeholders.
  • Self-starter with a proactive mindset and a passion for continuous improvement.
  • Comfortable working independently and as part of a distributed team.
Benefits
  • Opportunity to work for an organization that prides itself on offering a diverse and dynamic culture where employees are proud to work
  • Opportunity to work for a fast-growth global company in the rapidly growing analytics space
  • Opportunity for career development and growth opportunities as we grow and scale
  • Opportunity to build industry relationships and work alongside seasoned industry experts
  • Opportunity to work with our leadership team to strategize, collaborate, and solve customer challenges every day - YOU HAVE A VOICE AT SUCCESSKPI!
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
AWSinfrastructure-as-codeTerraformAWS CloudFormationServerless frameworkCI/CDJenkinsDockerKubernetesscripting
Soft Skills
analytical skillsproblem-solvingtroubleshootingcommunicationcollaborationproactive mindsetcontinuous improvementindependent workteamworkcustomer support