
Site Reliability Engineer
Keyrock
full-time
Posted on:
Location Type: Remote
Location: Belgium
Visit company websiteExplore more
About the role
- Cloud Infrastructure Management: Design, deploy, and maintain scalable and resilient infrastructure on AWS using Infrastructure-as-Code (IaC).
- Kubernetes Administration: Manage and optimize Kubernetes clusters for containerized applications, ensuring high availability and security.
- Automation & CI/CD: Implement and manage CI/CD pipelines for efficient deployment, testing, and monitoring of applications.
- Observability & Monitoring: Develop comprehensive monitoring solutions using Prometheus, Grafana, LGTM stack, or similar tools to improve system reliability.
- Security & Compliance: Apply best practices for cloud security, IAM policies, and compliance frameworks (SOC2, ISO 27001, etc.).
- Incident Response & Performance Optimization: Troubleshoot issues, perform root cause analysis, and implement fixes to optimize performance.
- Infrastructure as Code (IaC): Utilize Terraform, Ansible, or similar tools to automate infrastructure provisioning and configuration management.
- Collaboration & Knowledge Sharing: Work closely with software engineering, architecture and security teams to promote DevOps culture and best practices.
- Disaster Recovery & Reliability Engineering: Design failover and backup strategies to ensure business continuity in the event of failures.
Requirements
- Bachelor’s degree in Computer Science, Engineering, or a related field.
- 5+ years of experience in cloud infrastructure, SRE, or DevOps roles.
- Interest in or any exposure to trading or similar themes would be desirable (not essential)
- AWS Certified SysOps Administrator - Associate: desired.
- Strong expertise in AWS (EC2, S3, Lambda, RDS, VPC, IAM, etc.).
- Hands-on experience with Kubernetes (EKS, K3s, or self-managed clusters).
- Proficiency in scripting and automation using Python, Bash, or similar.
- Experience with Infrastructure as Code (Terraform, CloudFormation, or Ansible).
- Familiarity with monitoring, logging, and observability tools (Prometheus, Grafana, Datadog, etc.).
- Strong understanding of networking concepts (VPC, Load Balancers, DNS, Firewalls).
- Experience with DevOps methodologies, CI/CD pipelines, and GitOps practices.
- Experience with high-performance and low-latency (sub millisecond) systems.
- Familiarity with serverless architectures and event-driven computing.
- Familiarity with Rust compilation processes and techniques.
- Willing to collaborate and communicate asynchronously.
- Actively look for opportunities to improve the availability and performance of the system by applying the learnings from monitoring and observation
- Team spirit, ownership, critical thinking
- Exposure to cloud cost optimization and FinOps strategies.
- Previous exposure working with Crypto, Traditional Finance (Trad Fi) or Trading would be highly desirable but not essential
Benefits
- A competitive salary package, with various benefits depending on method of engagement (Employee vs Contractor)
- Autonomy in your time management thanks to flexible working hours and the opportunity to work remotely
- The freedom to create your own entrepreneurial experience by being part of a team of people in search of excellence
- Continuing Professional Development plan with learning and certification path in accordance with both the team objectives and areas of interests
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
AWSKubernetesTerraformAnsiblePythonBashCI/CDmonitoringobservabilitynetworking
Soft Skills
collaborationcommunicationteam spiritownershipcritical thinking
Certifications
AWS Certified SysOps Administrator - Associate