OnHires

Lead Platform Engineer

OnHires

full-time

Posted on:

Location Type: Remote

Location: Anywhere in Europe

Visit company website

Explore more

AI Apply
Apply

Job Level

About the role

  • Architect and implement scalable infrastructure to support the deployment and management of our trading platform.
  • Build and maintain internal tools to streamline developer workflows, including advanced CI/CD pipelines.
  • Champion IaC practices using Terraform, CloudFormation, or Pulumi.
  • Manage and optimize platform-critical services such as NATS Cluster, RabbitMQ, AWS RDS PostgreSQL, and Redis Cluster.
  • Automate and optimize deployment processes to ensure seamless continuous integration and delivery.
  • Manage and scale containerized workloads using Kubernetes and Docker.
  • Define and maintain Service Level Objectives (SLOs) and Service Level Indicators (SLIs).
  • Implement observability tools and dashboards (e.g., Prometheus, Datadog, Grafana) for real-time system monitoring.
  • Lead incident response efforts, conduct root cause analysis, and implement actionable postmortem reviews.
  • Architect and manage cloud-based systems to handle high-traffic, latency-sensitive applications.
  • Implement robust disaster recovery and business continuity strategies, including backups and multi-region failover.
  • Collaborate with security teams to enforce best practices for IAM, encryption, and compliance.
  • Partner with software engineers to design infrastructure solutions tailored to their application needs.
  • Help shape the engineering culture, promoting a philosophy of security, velocity, and reliability.
  • Mentor junior engineers and document best practices to drive knowledge sharing and operational excellence.
  • Contribute to evolving our backend microservices (currently NodeJS, with some Python and C#) towards Go and Rust.
  • Evaluate and integrate critical third-party software and infrastructure, such as payment gateways and mobility stacks.

Requirements

  • 5-8+ years of hands-on experience with cloud platforms, particularly AWS, including services like EC2, RDS, S3, Lambda, and VPC.
  • Proficiency with Docker and Kubernetes (EKS) or ECS.
  • Strong experience with Terraform, CloudFormation, or Pulumi.
  • Proficiency in at least one programming language (e.g., Python, Go, TypeScript/JavaScript, Ruby, Java).
  • Expertise in building and maintaining CI/CD workflows using tools like GitLab CI, Jenkins, or GitHub Actions.
  • Experience with observability platforms (e.g., Prometheus, Datadog, Grafana).
  • Proven ability to handle incident response, root cause analysis, and postmortem reviews.
  • Ability to research, design, and deliver solutions to complex infrastructure challenges.
  • Experience working directly with product engineers to improve workflows incrementally.
  • Ownership mindset with the ability to mentor team members and advocate for best practices.
Benefits
  • Competitive salary with future equity options
  • Opportunities to work with cutting-edge technologies and evolve our platform.
  • Flexible working hours and a remote-friendly environment.
  • Professional growth through certifications, conferences, and internal training.
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
TerraformCloudFormationPulumiKubernetesDockerAWS RDS PostgreSQLNATS ClusterRabbitMQCI/CDmicroservices
Soft Skills
incident responseroot cause analysismentoringknowledge sharingcollaborationownership mindsetadvocacy for best practicesengineering cultureproblem-solvingcommunication