Tech Stack
AnsibleAWSKubernetesPostgresPythonTerraform
About the role
- Own and evolve our AWS infra (ECS/Fargate, RDS/Postgres, S3, IAM, VPC)
- Maintain and improve CI/CD pipelines (GitHub Actions) and Infra-as-Code (Terraform)
- Set up alerting, log aggregation, and performance dashboards (CloudWatch, Datadog, or OpenTelemetry)
- Implement secure-by-default practices; support SOC 2 / HIPAA readiness
- Write Python/Bash scripts for backups, monitoring, deployment hooks, etc.
- Assist with Postgres tuning, access control, and migrations
- Lay groundwork for container orchestration (EKS/Kubernetes) if/when we scale beyond ECS
Requirements
- 3+ years in DevOps, DevSecOps, or SRE roles
- Strong AWS experience, especially ECS and Terraform
- Experience implementing or improving monitoring systems and incident workflows
- Comfort with scripting (Python, Bash) for automation and operational tooling
- Familiarity with Postgres in production
- Exposure to Ansible or other config mgmt tools (bonus)
- Experience supporting compliance frameworks (HIPAA, SOC 2) (bonus)
- Familiarity with Kubernetes or EKS or desire to grow into it (bonus)
- Exposure to AI/ML infra (e.g., GPU workloads, model deployment) (bonus)