Lead Manager

Super Dispatch

full-time

Posted on: 3/5/2026

Location Type: Hybrid

Location: Georgia

Visit company website

Explore more

✨ AI Apply

Apply

Job Level

Senior

Tech Stack

AWS Cloud Go Google Cloud Platform Grafana Java Kafka Kubernetes Postgres Prometheus Python RabbitMQ

About the role

Own platform architecture and technical direction.
Own the technical roadmap for the Platform squad, including infrastructure modernization, reliability improvements, and cost optimization.
Make and document architecture decisions (ADRs) that affect the entire engineering organization.
Design and implement cross-cutting platform capabilities: identity/auth services, observability pipelines, deployment infrastructure, and security controls.
Drive API-first design practices across services.
Evaluate and adopt new technologies and tools.
Lead cost analysis and optimization of cloud infrastructure.
Be the go-to technical escalation point for platform-related questions from all product squads.
Own incident response for platform-level outages.
Conduct post-incident reviews and drive follow-up action items to prevent recurrence.
Set and track platform reliability metrics.
Contribute directly to platform services - writing production code in Python, Go, or other languages as needed.
Manage a team of ~4-5 platform engineers with regular 1:1s, career development conversations, and performance reviews.
Identify hiring needs and technical skill gaps; lead recruiting efforts for the platform squad.
Foster a collaborative culture where product squads feel supported.

Requirements

Advanced-level English skills, especially speaking and writing.
At least 7+ years of experience as a software engineer, with at least 3 years in infrastructure, platform, or SRE roles.
At least 2 years of experience managing a team of 3-10 engineers.
Deep hands-on experience with cloud infrastructure (GCP or AWS), Kubernetes, and container orchestration.
Strong background in at least two of: Java, Python, Go, - with willingness to work across all three.
Production experience with relational databases at scale (PostgreSQL), including replication, failover, and performance tuning.
Experience with message brokers (RabbitMQ, Kafka, or similar) and event-driven architectures.
Experience with observability and monitoring tools (Datadog, Grafana, Prometheus, or similar).
Track record of leading incident response for production systems and driving reliability improvements.
Experience with CI/CD pipeline design and deployment automation.
Demonstrated ability to make architectural decisions and communicate them clearly through ADRs or similar documentation.

Benefits

Hybrid working
Professional development opportunities
Flexible work hours

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools

PythonGoJavaKubernetescloud infrastructurePostgreSQLRabbitMQKafkaCI/CDevent-driven architecture

Soft Skills

leadershipcommunicationcollaborationteam managementincident responsetechnical decision makingcareer developmentperformance reviewscost optimizationreliability improvements