Salary
💰 $200,000 - $260,000 per year
Tech Stack
AnsibleAWSCloudCyber SecurityDockerEC2FirewallsGrafanaIoTJenkinsKubernetesLinuxPrometheusPythonTerraform
About the role
- Manage and optimize Claroty’s cloud-based infrastructure in AWS GovCloud ensuring FedRAMP compliance and high availability
- Monitor and enhance system performance, scalability, and reliability through observability tools, automation, and best practices
- Implement and maintain security controls aligned with FedRAMP, NIST 800-53, and federal cybersecurity standards
- Develop and manage infrastructure automation using Terraform and Ansible
- Enhance DevSecOps pipelines, automate deployments, and improve system resilience (GitLab CI/CD, Jenkins, Kubernetes)
- Implement and manage monitoring solutions (Prometheus, Grafana, ELK Stack), respond to incidents, and conduct post-mortems
- Configure and maintain VPCs, VPNs, security groups, and firewalls in AWS GovCloud
- Manage rollout strategy for new technologies and oversee execution to ensure minimal disruption
- Act as first line of response for critical incidents (GOV Production On-Call), triage and coordinate to restore services
- Monitor production performance and degradation and conduct regular infrastructure upgrades
- Oversee release flow ensuring seamless transitions and handle production impacts
- Collaborate with DevOps, security teams, developers, and federal stakeholders to maintain compliant and secure cloud environment
Requirements
- U.S. Citizenship (required for working in GovCloud environments)
- 6-8+ years of experience in SRE, DevOps, or Cloud Engineering roles
- Hands-on experience with AWS GovCloud (EC2, EKS, MSK, S3, RDS, IAM, CloudTrail, CloudWatch)
- Strong expertise in Infrastructure as Code (Terraform, Ansible)
- Experience with FedRAMP, NIST 800-53, and cloud security best practices
- Proficiency in Kubernetes, Docker, and container orchestration
- Knowledge of Linux system administration and scripting (Python, Bash)
- Experience with logging, monitoring, and observability tools (Prometheus, Grafana, ELK Stack)
- Strong troubleshooting, problem-solving, and automation mindset
- Cross-Functional Leadership and Execution skills (autonomy, ownership, communication, collaboration)