Salary
💰 $210,000 - $230,000 per year
Tech Stack
CloudDistributed SystemsDNSGoGrafanaGRPCKafkaOpen SourcePostgresPython
About the role
- Director, Observability Engineering to lead teams responsible for edge agents, telemetry pipelines, alerting, dashboarding, and fleet control plane
- Report to the Chief Technology Officer and deliver scalable, reliable products built on the Orb observability platform
- Lead engineering teams focused on network and infrastructure observability, telemetry, monitoring products, and fleet-managed agents
- Provide leadership and guidance for architecture and evolution of the observability platform, ensuring scalability, resilience, and data quality
- Guide and grow international teams across telemetry ingestion, real-time analytics, visualization, and agent management
- Drive excellence in ingestion, storage, and analysis of massive network and telemetry data streams
- Support team operations including backlog management, technical planning, and delivery execution
- Foster a data-driven culture focused on rapid innovation, prototype-driven engineering, performance, and reliability
- Collaborate across product, design, and platform engineering teams to ensure alignment
Requirements
- Strong people leadership skills, with a track record of mentoring senior engineers
- Experience leading cross-functional engineering teams across multiple international locations
- 10+ years in engineering, including 5+ years in managing technical teams and 5+ years in a startup environment
- Proven ability to move quickly through problem-solving cycles with strong judgment around where to be meticulous and where to optimize for speed
- History of delivering and finishing projects rapidly, with clear examples of high-output execution and momentum
- Excellent communication and leadership skills, with the ability to collaborate across engineering and go-to-market functions
- Deep expertise in networking and infrastructure, distributed systems, observability, data pipelines, or network telemetry
- Proficiency in cloud-native architectures, data storage, and streaming technologies
- Nice to have: Experience with syslog, DPI, IPMI, DNS, BGP, SNMP, NetFlow/sFlow, gNMI, gRPC, eBPF, and streaming telemetry
- Nice to have: Familiarity with agent-based architectures, remote updates, and fleet management
- Nice to have: Familiarity with Grafana, Postgres, ClickHouse, Kafka, Elastic, or similar systems for telemetry and analytics
- Nice to have: Familiarity with Golang, Python, and distributed data systems
- Nice to have: Experience with AI/ML-driven observability or anomaly detection
- Nice to have: Familiarity with OpenTelemetry and related standards