Keyrock

Site Reliability Engineer

Keyrock

full-time

Posted on:

Location Type: Remote

Location: Belgium

Visit company website

Explore more

AI Apply
Apply

About the role

  • Cloud Infrastructure Management: Design, deploy, and maintain scalable and resilient infrastructure on AWS using Infrastructure-as-Code (IaC).
  • Kubernetes Administration: Manage and optimize Kubernetes clusters for containerized applications, ensuring high availability and security.
  • Automation & CI/CD: Implement and manage CI/CD pipelines for efficient deployment, testing, and monitoring of applications.
  • Observability & Monitoring: Develop comprehensive monitoring solutions using Prometheus, Grafana, LGTM stack, or similar tools to improve system reliability.
  • Security & Compliance: Apply best practices for cloud security, IAM policies, and compliance frameworks (SOC2, ISO 27001, etc.).
  • Incident Response & Performance Optimization: Troubleshoot issues, perform root cause analysis, and implement fixes to optimize performance.
  • Infrastructure as Code (IaC): Utilize Terraform, Ansible, or similar tools to automate infrastructure provisioning and configuration management.
  • Collaboration & Knowledge Sharing: Work closely with software engineering, architecture and security teams to promote DevOps culture and best practices.
  • Disaster Recovery & Reliability Engineering: Design failover and backup strategies to ensure business continuity in the event of failures.

Requirements

  • Bachelor’s degree in Computer Science, Engineering, or a related field.
  • 5+ years of experience in cloud infrastructure, SRE, or DevOps roles.
  • Interest in or any exposure to trading or similar themes would be desirable (not essential)
  • AWS Certified SysOps Administrator - Associate: desired.
  • Strong expertise in AWS (EC2, S3, Lambda, RDS, VPC, IAM, etc.).
  • Hands-on experience with Kubernetes (EKS, K3s, or self-managed clusters).
  • Proficiency in scripting and automation using Python, Bash, or similar.
  • Experience with Infrastructure as Code (Terraform, CloudFormation, or Ansible).
  • Familiarity with monitoring, logging, and observability tools (Prometheus, Grafana, Datadog, etc.).
  • Strong understanding of networking concepts (VPC, Load Balancers, DNS, Firewalls).
  • Experience with DevOps methodologies, CI/CD pipelines, and GitOps practices.
  • Experience with high-performance and low-latency (sub millisecond) systems.
  • Familiarity with serverless architectures and event-driven computing.
  • Familiarity with Rust compilation processes and techniques.
  • Willing to collaborate and communicate asynchronously.
  • Actively look for opportunities to improve the availability and performance of the system by applying the learnings from monitoring and observation
  • Team spirit, ownership, critical thinking
  • Exposure to cloud cost optimization and FinOps strategies.
  • Previous exposure working with Crypto, Traditional Finance (Trad Fi) or Trading would be highly desirable but not essential
Benefits
  • A competitive salary package, with various benefits depending on method of engagement (Employee vs Contractor)
  • Autonomy in your time management thanks to flexible working hours and the opportunity to work remotely
  • The freedom to create your own entrepreneurial experience by being part of a team of people in search of excellence
  • Continuing Professional Development plan with learning and certification path in accordance with both the team objectives and areas of interests
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
AWSKubernetesTerraformAnsiblePythonBashCI/CDmonitoringobservabilitynetworking
Soft Skills
collaborationcommunicationteam spiritownershipcritical thinking
Certifications
AWS Certified SysOps Administrator - Associate