
Lead Manager
Super Dispatch
full-time
Posted on:
Location Type: Hybrid
Location: Georgia
Visit company websiteExplore more
Job Level
About the role
- Own platform architecture and technical direction.
- Own the technical roadmap for the Platform squad, including infrastructure modernization, reliability improvements, and cost optimization.
- Make and document architecture decisions (ADRs) that affect the entire engineering organization.
- Design and implement cross-cutting platform capabilities: identity/auth services, observability pipelines, deployment infrastructure, and security controls.
- Drive API-first design practices across services.
- Evaluate and adopt new technologies and tools.
- Lead cost analysis and optimization of cloud infrastructure.
- Be the go-to technical escalation point for platform-related questions from all product squads.
- Own incident response for platform-level outages.
- Conduct post-incident reviews and drive follow-up action items to prevent recurrence.
- Set and track platform reliability metrics.
- Contribute directly to platform services - writing production code in Python, Go, or other languages as needed.
- Manage a team of ~4-5 platform engineers with regular 1:1s, career development conversations, and performance reviews.
- Identify hiring needs and technical skill gaps; lead recruiting efforts for the platform squad.
- Foster a collaborative culture where product squads feel supported.
Requirements
- Advanced-level English skills, especially speaking and writing.
- At least 7+ years of experience as a software engineer, with at least 3 years in infrastructure, platform, or SRE roles.
- At least 2 years of experience managing a team of 3-10 engineers.
- Deep hands-on experience with cloud infrastructure (GCP or AWS), Kubernetes, and container orchestration.
- Strong background in at least two of: Java, Python, Go, - with willingness to work across all three.
- Production experience with relational databases at scale (PostgreSQL), including replication, failover, and performance tuning.
- Experience with message brokers (RabbitMQ, Kafka, or similar) and event-driven architectures.
- Experience with observability and monitoring tools (Datadog, Grafana, Prometheus, or similar).
- Track record of leading incident response for production systems and driving reliability improvements.
- Experience with CI/CD pipeline design and deployment automation.
- Demonstrated ability to make architectural decisions and communicate them clearly through ADRs or similar documentation.
Benefits
- Hybrid working
- Professional development opportunities
- Flexible work hours
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
PythonGoJavaKubernetescloud infrastructurePostgreSQLRabbitMQKafkaCI/CDevent-driven architecture
Soft Skills
leadershipcommunicationcollaborationteam managementincident responsetechnical decision makingcareer developmentperformance reviewscost optimizationreliability improvements