Tech Stack
CloudKubernetesNode.jsPostgresSpark
About the role
- Escalation point on the Engineering side for high-severity incidents
- Monitor and optimize cloud spend and infrastructure costs, and manage vendor relationships
- Continuously scale and improve common infrastructure elements and platform services
- Lead and grow high-performing engineering teams responsible for cloud infrastructure, DevOps, site reliability and technical operations
- Oversee availability, capacity planning, and disaster recovery to ensure systems are resilient and fault-tolerant
- Collaborate with security and compliance teams to ensure adherence to regulatory requirements across infrastructure and operations
- Establish and monitor KPIs/SLAs for platform reliability, security, and performance
- Provide thought leadership on infrastructure strategy while being willing to dive deep into technical challenges when needed
- Improve developer experience through better tooling, workflows, and internal platforms that accelerate productivity and reduce friction
- Collaborate closely with product engineering teams, security, broker-dealer operations, and report directly to the CTO
Requirements
- 5 years of experience in a similar leadership role in engineering
- Strong ability to work independently, lead and deliver on large tasks, and collaborate with other members of the organization or external partners
- Production experience with Kubernetes, Postgres, event brokers and any of the major cloud providers’ platform services
- Proven track record of building and scaling infrastructure in cloud-native, high-growth environments (preferably in fintech, brokerage, or other regulated industries)
- Strong background in system design, observability, and incident response at scale
- Experience with event-driven architecture, security best practices, compliance frameworks, and regulatory environments relevant to financial services
- Excellent leadership, communication, and cross-functional collaboration skills
- Ability to balance long-term strategy with hands-on technical problem solving