Build, deploy, and manage infrastructure automation using Terraform and Ansible.
Design, implement, and support highly available, fault-tolerant systems across hybrid environments (on-premises and cloud).
Manage and optimize AWS infrastructure and services with a focus on scalability, performance, cost efficiency, and security.
Integrate automation into CI/CD pipelines to streamline deployments and reduce manual intervention.
Develop and maintain automation frameworks, reusable modules, and deployment templates.
Establish and enforce standards for monitoring, alerting, logging, and observability.
Collaborate with development and operations teams to align infrastructure automation with broader technology roadmaps.
Review and enhance system architecture to meet best practices in security, resiliency, and reliability.
Document and maintain standard operating procedures, including deployment workflows, incident response, and disaster recovery.
Provide tier 4 support, troubleshooting, and resolution of infrastructure and automation-related issues.
Act as escalation point for systems after business hours only when tier 3 on-call rotation personnel are unable to resolve customer service impairments.
Requirements
15+ years of applicable experience
Automation & IaC Expertise: Advanced proficiency in Ansible and Terraform for infrastructure provisioning, configuration management, and automation at scale.
Infrastructure as Code / GitOps: Strong knowledge of IaC, Configuration as Code, and GitOps principles and best practices.
Cloud (AWS): Hands-on experience with a broad range of AWS services, including EC2, VPC, IAM, S3, CloudWatch, CloudTrail, Route53, ELB, Transit Gateway, Direct Connect, EKS, ECS, and KMS.
Deep understanding of AWS networking, monitoring, and security best practices.
DevOps & CI/CD: Proficiency with CI/CD pipelines and tooling (e.g., GitLab CI/CD, Packer, Python, or similar scripting languages).
Containers & Orchestration: Solid understanding of Docker and Kubernetes for application deployment and management.
Security & Compliance: Working knowledge of OS-level hardening, vulnerability remediation, and secure infrastructure design.
Networking: Strong foundation in networking fundamentals (Layer 2/3, IPv4; IPv6 a plus).
Monitoring & Observability: Familiarity with tools such as Prometheus, Grafana, ELK/EFK stacks, CloudWatch, Datadog, or equivalent platforms.
Problem-Solving & Troubleshooting: Proven ability to diagnose and resolve complex issues across automation, infrastructure, cloud, and networking layers.
Collaboration & Communication: Excellent communication skills with experience working in cross-functional DevOps/Engineering teams.
Leadership & Initiative: Self-starter with adaptability, able to thrive in fast-paced, mission-critical environments.
Qualifications: Bachelor’s degree in engineering or business. MBA preferred.
Certifications: AWS certifications (e.g., Solutions Architect, SysOps, or DevOps Engineer) are highly desirable.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
TerraformAnsibleAWSCI/CDDockerKubernetesPythonGitOpsInfrastructure as CodeMonitoring