
Senior DevOps Engineer
SuccessKPI
full-time
Posted on:
Location Type: Remote
Location: Maryland • Virginia • United States
Visit company websiteExplore more
Job Level
About the role
- Be a key member of the Engineering team, responsible for designing, building, and maintaining the infrastructure that supports our SaaS analytics platform.
- Champion automation, reliability, security, and scalability as you optimize cloud-based environments and drive best practices across CI/CD pipelines, monitoring, and infrastructure-as-code.
- Design, implement, and maintain scalable, reliable, and secure infrastructure on cloud platforms (primarily AWS).
- Create new infrastructure or environments to meet evolving customer and product demands.
- Monitor infrastructure performance and availability, ensuring high uptime and efficiency.
- Apply infrastructure-as-code principles using tools such as Terraform, AWS CloudFormation, or the Serverless framework.
- Build, maintain, and optimize CI/CD pipelines for application deployments using tools like Jenkins, Bitbucket Pipelines, or equivalent.
- Automate and standardize release processes to support frequent, reliable, and fast software delivery.
- Support production release and bug-fix deployments, including environment configurations.
- Develop scripts and tooling (using Python, Bash, Node.js, etc.) to automate infrastructure management, deployments, and operational tasks.
- Champion continuous improvement in automation to reduce manual effort and improve reliability.
- Implement and manage robust monitoring and logging systems using AWS CloudWatch, Datadog, Dynatrace, or custom solutions.
- Proactively identify, troubleshoot, and resolve infrastructure and application issues before they impact end users.
- Participate in on-call rotations for critical production systems support.
- Apply security best practices across all infrastructure layers to ensure secure operations.
- Conduct routine security audits and vulnerability assessments to maintain compliance with applicable standards and frameworks.
- Partner closely with development teams to support application architecture, deployments, and infrastructure decisions.
- Collaborate with QA, Product, and Customer Support teams to resolve customer-impacting issues and improve system reliability.
Requirements
- Bachelor's degree in Computer Science, Information Technology, Engineering, or a related field.
- 8+ years of experience in a DevOps, Site Reliability Engineer (SRE), or Operations Engineer role in a cloud-first environment.
- Strong hands-on experience with AWS cloud services and infrastructure management.
- Proficiency in scripting and development (Python, Bash, Node.js).
- Experience with containerization and orchestration tools such as Docker, Kubernetes, or AWS ECS.
- Deep familiarity with CI/CD tools: Jenkins, GitLab CI, CircleCI, Bitbucket Pipelines, or similar.
- Proficient in infrastructure-as-code: Terraform, AWS CloudFormation, Serverless framework, etc.
- Solid understanding of networking, security, Linux system administration, and cloud architecture patterns.
- Excellent analytical, problem-solving, and troubleshooting abilities.
- Strong communication and collaboration skills across technical and non-technical stakeholders.
- Self-starter with a proactive mindset and a passion for continuous improvement.
- Comfortable working independently and as part of a distributed team.
Benefits
- Opportunity to work for an organization that prides itself on offering a diverse and dynamic culture where employees are proud to work
- Opportunity to work for a fast-growth global company in the rapidly growing analytics space
- Opportunity for career development and growth opportunities as we grow and scale
- Opportunity to build industry relationships and work alongside seasoned industry experts
- Opportunity to work with our leadership team to strategize, collaborate, and solve customer challenges every day - YOU HAVE A VOICE AT SUCCESSKPI!
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
AWSinfrastructure-as-codeTerraformAWS CloudFormationServerless frameworkCI/CDJenkinsDockerKubernetesscripting
Soft Skills
analytical skillsproblem-solvingtroubleshootingcommunicationcollaborationproactive mindsetcontinuous improvementindependent workteamworkcustomer support