
Lead Platform Engineer
OnHires
full-time
Posted on:
Location Type: Remote
Location: Anywhere in Europe
Visit company websiteExplore more
Job Level
Tech Stack
About the role
- Architect and implement scalable infrastructure to support the deployment and management of our trading platform.
- Build and maintain internal tools to streamline developer workflows, including advanced CI/CD pipelines.
- Champion IaC practices using Terraform, CloudFormation, or Pulumi.
- Manage and optimize platform-critical services such as NATS Cluster, RabbitMQ, AWS RDS PostgreSQL, and Redis Cluster.
- Automate and optimize deployment processes to ensure seamless continuous integration and delivery.
- Manage and scale containerized workloads using Kubernetes and Docker.
- Define and maintain Service Level Objectives (SLOs) and Service Level Indicators (SLIs).
- Implement observability tools and dashboards (e.g., Prometheus, Datadog, Grafana) for real-time system monitoring.
- Lead incident response efforts, conduct root cause analysis, and implement actionable postmortem reviews.
- Architect and manage cloud-based systems to handle high-traffic, latency-sensitive applications.
- Implement robust disaster recovery and business continuity strategies, including backups and multi-region failover.
- Collaborate with security teams to enforce best practices for IAM, encryption, and compliance.
- Partner with software engineers to design infrastructure solutions tailored to their application needs.
- Help shape the engineering culture, promoting a philosophy of security, velocity, and reliability.
- Mentor junior engineers and document best practices to drive knowledge sharing and operational excellence.
- Contribute to evolving our backend microservices (currently NodeJS, with some Python and C#) towards Go and Rust.
- Evaluate and integrate critical third-party software and infrastructure, such as payment gateways and mobility stacks.
Requirements
- 5-8+ years of hands-on experience with cloud platforms, particularly AWS, including services like EC2, RDS, S3, Lambda, and VPC.
- Proficiency with Docker and Kubernetes (EKS) or ECS.
- Strong experience with Terraform, CloudFormation, or Pulumi.
- Proficiency in at least one programming language (e.g., Python, Go, TypeScript/JavaScript, Ruby, Java).
- Expertise in building and maintaining CI/CD workflows using tools like GitLab CI, Jenkins, or GitHub Actions.
- Experience with observability platforms (e.g., Prometheus, Datadog, Grafana).
- Proven ability to handle incident response, root cause analysis, and postmortem reviews.
- Ability to research, design, and deliver solutions to complex infrastructure challenges.
- Experience working directly with product engineers to improve workflows incrementally.
- Ownership mindset with the ability to mentor team members and advocate for best practices.
Benefits
- Competitive salary with future equity options
- Opportunities to work with cutting-edge technologies and evolve our platform.
- Flexible working hours and a remote-friendly environment.
- Professional growth through certifications, conferences, and internal training.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
TerraformCloudFormationPulumiKubernetesDockerAWS RDS PostgreSQLNATS ClusterRabbitMQCI/CDmicroservices
Soft Skills
incident responseroot cause analysismentoringknowledge sharingcollaborationownership mindsetadvocacy for best practicesengineering cultureproblem-solvingcommunication