Tech Stack
AWSAzureCloudDockerGoogle Cloud PlatformJavaJavaScriptKafkaKubernetesLinuxMySQLNode.jsPostgresPythonRubyTerraform
About the role
- Automate deployment, monitoring, and management for 100% AWS cloud based infrastructure
- Develop, implement and own developer focused internal tooling and automation as your product
- Share in new feature design as the cloud SME supporting container/k8s first designs
- Create and maintain automated Ci/CD pipelines that streamline the release process and ensure high-quality code is delivered rapidly and continuously in a safe and compliant manner
- Leverage agentic and generative AI and AI tooling in the development, management and enhancement of the Internal Developer and Production Platforms
- Collaborate with developers to identify pain points and bottlenecks in the development process and design and implement solutions that improve the developer experience and increase overall productivity
- Put together comprehensive onboarding programs and training resources to seamlessly integrate new developers into the organization
- Participate in an on-call rotation and provide emergency support occasionally outside of normally scheduled hours as needed
- Evolve and extend observability tools and systems to determine and resolve performance and usability issues
Requirements
- 5+ years of professional technical experience in software engineering, cloud operations, or a related function.
- 3+ years of professional experience implementing and maintaining infrastructure in a public cloud environment such as AWS (heavily preferred), Google Cloud Platform, or Azure
- Experience and demonstrated proficiency with Infrastructure as Code (Terraform preferred)
- Experience implementing and maintaining Kubernetes clusters (EKS preferred)
- Demonstrated proficiency with networking as it pertains to cloud infrastructure (VPCs, subnets, route tables, NACLs, security groups, peering connections, etc)
- Experience implementing and maintaining relational database systems/clusters (RDS or Aurora preferred)
- Experience with security systems such as Web Application Firewall, HIDS/FIM, intrusion detection systems, and vulnerability detection systems
- Experience developing and maintaining internal tooling consumed by others
- Experience with Linux container technologies, such as Docker, OCI, LXC and ability to administer, build and deploy images in an automated manner.
- Proficiency in at least two of the following programming or scripting languages: Ruby, Python, Java, Javascript (NodeJS), Bash
- Working understanding of relational database systems such as MySQL (preferred) and PostgreSQL
- Experience with Kafka is a plus (MSK preferred)
- Experience with generative and agentic AI in an infrastructure management and implementation capacity is a plus
- Experience working in a PCI DSS environment is a plus
- Bachelor's degree in Computer Engineering, Computer Science, or a related field (or equivalent experience).
- Demonstrated experience leveraging AI tools and technologies to improve workflows, enhance decision-making, or drive innovation.