Architect, design, and develop platform services with a strong focus on scalability, security, and developer experience.
Lead operational design for reliability: build comprehensive observability, monitoring, and incident response automation into security-critical services.
Build automation and tooling to drive self-healing systems, proactive risk detection, failure recovery, and continuous resilience testing.
Collaborate with compliance, governance, and risk teams to translate regulatory and policy requirements into scalable technical controls.
Lead technical design reviews, security architecture reviews, and incident postmortems for platform-level incidents.
Mentor engineers across multiple disciplines on both security and operational best practices.
Own end-to-end delivery of services: from initial design and development through deployment, production hardening, and lifecycle maintenance.
Requirements
6+ years of experience in software engineering, SRE, or security engineering roles, with significant experience operating security platform services.
Strong backend software development experience (Go, Java, Rust, Python).
Expertise with distributed systems, cloud infrastructure (AWS, GCP, Azure), Kubernetes, service mesh, and container orchestration.
Strong understanding of security domains: IAM, OAuth2, OIDC, PKI, secrets management, policy engines, audit pipelines, zero trust architecture.
Experience building highly reliable, observable, and resilient production systems.