Tech Stack
AWSCloudDistributed SystemsDockerEC2GraphQLJavaScriptKubernetesNode.jsPostgresPrometheusRustTCP/IPTerraformTypeScript
About the role
- Drive the continuous improvement and high-availability operation of our FedRAMP-High compliant infrastructure.
- Manage and mentor a team of Infrastructure Software Engineers, leading Infrastructure and Platform Engineering efforts.
- Leverage Infrastructure as Code (IaC) to manage large-scale, distributed systems with a strong emphasis on security, reliability, and automation.
- Champion modern practices in architecture, observability, and operability.
- Advocate for secure, scalable design decisions and guide your team in making optimal trade-offs.
- Significantly shape the evolution of our platform, ensuring reliability and performance for critical life and safety applications.
- Enhance and maintain tooling for seamless testing and deployments, fostering rapid feedback and high collaboration between product and infrastructure teams.
- Act as a hands-on tech lead manager, deepening the team’s use of tools like Terraform, Node/TypeScript, and Rust.
- Apply proven experience with managing infrastructure/reliability teams, driving practices such as toil reduction, incident response/management, and Learning From Incidents (LFI).
Requirements
- Bachelor’s Degree in Computer Science OR equivalent experience
- Minimum 5 years industry experience in cloud-native environments
- Minimum 3 years of team lead/management experience
- US or Dual US/Canadian citizenship
- Experience with IaC: AWS CDK, Terraform Cloud
- Experience with Containers: EC2, Docker, Kubernetes
- Networking: TCP/IP
- CI/CD: GitLab, CodeCommit, CodeDeploy
- Observability & Security: CloudWatch, OpenTelemetry, security scanning/automation, Prometheus
- Databases & Search: Postgres, RDS, Aurora, Amazon OpenSearch
- Compliance: FedRAMP/NIST-800, GovCloud deployment, FIPS-140-2
- Operational Tooling & Programming: Node.js, TypeScript, Bash scripting, Rust
- Other Skills: GraphQL, building/managing secure, scalable distributed systems
- Leadership: Experience leading a high-performing engineering team