NetBox Labs

Director, Observability Product

NetBox Labs

full-time

Posted on:

Origin:  • 🇺🇸 United States

Visit company website
AI Apply
Manual Apply

Salary

💰 $210,000 - $230,000 per year

Job Level

Lead

Tech Stack

CloudDistributed SystemsDNSGoGrafanaGRPCKafkaOpen SourcePostgresPython

About the role

  • Director, Observability Engineering to lead teams responsible for edge agents, telemetry pipelines, alerting, dashboarding, and fleet control plane
  • Report to the Chief Technology Officer and deliver scalable, reliable products built on the Orb observability platform
  • Lead engineering teams focused on network and infrastructure observability, telemetry, monitoring products, and fleet-managed agents
  • Provide leadership and guidance for architecture and evolution of the observability platform, ensuring scalability, resilience, and data quality
  • Guide and grow international teams across telemetry ingestion, real-time analytics, visualization, and agent management
  • Drive excellence in ingestion, storage, and analysis of massive network and telemetry data streams
  • Support team operations including backlog management, technical planning, and delivery execution
  • Foster a data-driven culture focused on rapid innovation, prototype-driven engineering, performance, and reliability
  • Collaborate across product, design, and platform engineering teams to ensure alignment

Requirements

  • Strong people leadership skills, with a track record of mentoring senior engineers
  • Experience leading cross-functional engineering teams across multiple international locations
  • 10+ years in engineering, including 5+ years in managing technical teams and 5+ years in a startup environment
  • Proven ability to move quickly through problem-solving cycles with strong judgment around where to be meticulous and where to optimize for speed
  • History of delivering and finishing projects rapidly, with clear examples of high-output execution and momentum
  • Excellent communication and leadership skills, with the ability to collaborate across engineering and go-to-market functions
  • Deep expertise in networking and infrastructure, distributed systems, observability, data pipelines, or network telemetry
  • Proficiency in cloud-native architectures, data storage, and streaming technologies
  • Nice to have: Experience with syslog, DPI, IPMI, DNS, BGP, SNMP, NetFlow/sFlow, gNMI, gRPC, eBPF, and streaming telemetry
  • Nice to have: Familiarity with agent-based architectures, remote updates, and fleet management
  • Nice to have: Familiarity with Grafana, Postgres, ClickHouse, Kafka, Elastic, or similar systems for telemetry and analytics
  • Nice to have: Familiarity with Golang, Python, and distributed data systems
  • Nice to have: Experience with AI/ML-driven observability or anomaly detection
  • Nice to have: Familiarity with OpenTelemetry and related standards