Define and maintain scalable system architectures leveraging AWS (preferred) or other major cloud providers, with emphasis on CloudFormation/IaC
Architect and maintain container orchestration solutions (Docker, Kubernetes/EKS, ECS), optimizing deployment and resource utilization
Develop and maintain automation scripts and tooling in Python with a focus on production-grade code, performance, and maintainability
Integrate security best practices and government compliance standards (e.g., FedRAMP, FIPS) throughout the DevOps lifecycle, working closely with InfoSec teams
Optimize CI/CD pipelines and automation workflows using tools such as Jenkins, GitHub Actions, or AWS CodePipeline to drive developer productivity
Implement and manage observability and monitoring solutions, including APM and logging tools (Datadog, CloudWatch, Prometheus, Grafana, Kibana) for system reliability and incident response
Mentor junior engineers on DevOps methodologies, conduct code reviews, and provide technical guidance
Collaborate closely with cross-functional teams including product managers, software engineers, and security professionals to communicate infrastructure and operational requirements
Design, implement, and maintain the infrastructure and deployment practices that power CoCounsel for Government solutions for government clients
Requirements
4+ years in DevOps, Site Reliability Engineering, or a related field, with a strong focus on AWS and container-based architectures
Proven track record designing, building, and supporting production environments with a high degree of automation, reliability, and scalability
Deep understanding of AWS (preferred) or any major cloud provider (GCP/Azure), with an emphasis on CloudFormation or equivalent Infrastructure as Code technology
Experience with container orchestration (Docker, Kubernetes (EKS/GKE/AKS), ECS) and serverless (Lambda, ECS, App service, cloud run)
Experience with compute services (EC2, virtual machines)
Advanced scripting and automation skills in Python, including building and maintaining infrastructure tooling or backend services
Experience optimizing CI/CD pipelines and automation workflows using Jenkins, GitHub Actions, or AWS CodePipeline
Experience with configuration management tools like Ansible is a plus
Strong knowledge of cloud and application security, with practical experience implementing encryption, identity and access management, and network security controls
Practical experience integrating government compliance standards (e.g., FedRAMP, FIPS)
Experience with modern cloud design patterns (microservices, immutable infrastructure, blue/green and canary deployments, data management for scale and resiliency)
Hands-on experience with APM and observability tools such as Datadog, CloudWatch, Prometheus, Grafana, and Kibana
Comfortable with a high level of autonomy and proactive decision-making in a distributed, remote-first team environment
Excellent collaboration and communication skills
Benefits
Hybrid Work Model: Flexible hybrid working environment (2-3 days a week in the office)
Flex My Way: flexible work arrangements, including work from anywhere for up to 8 weeks per year
Market competitive health, dental, vision, disability, and life insurance programs
Retirement savings and competitive 401k plan with company match
Annual Bonus eligibility based on enterprise and individual performance
Flexible vacation, sick and safe paid time off, paid holidays (including two company Mental Health Days off)
Access to the Headspace app and mental health resources
Tuition Reimbursement and career development (Grow My Way)
Employee incentive programs and Employee Stock Purchase Plan
Optional hospital, accident and sickness insurance (paid 100% by the employee)
Optional life and AD&D insurance (paid 100% by the employee)
Flexible Spending and Health Savings Accounts
Fitness reimbursement and Employee Assistance Program
Group Legal Identity Theft Protection benefit (paid 100% by employee)
Commuter benefits
Adoption & Surrogacy Assistance
Access to 529 Plan
Two paid volunteer days off annually and social impact opportunities
Sabbatical leave
Parental leave
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
AWSCloudFormationInfrastructure as CodeDockerKubernetesECSPythonCI/CDJenkinsAnsible