FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Platform Engineer
CasumoPlatform Engineer responsible for operating and improving production infrastructure at Casumo, an international online casino company. Collaborating with development and infrastructure teams to ensure scalability and reliability.
Tech Stack
Tools & technologiesCloudElasticSearchGoogle Cloud PlatformGrafanaJavaKafkaKubernetesLinuxPrometheusPythonRabbitMQTerraform
About the role
Key responsibilities & impact- Operate, scale, and continuously improve our production Kubernetes clusters running on Google Cloud Platform (GCP).
- Build, manage, and evolve cloud infrastructure using Infrastructure as Code (Terraform).
- Maintain and optimise critical messaging and event-streaming infrastructure, including RabbitMQ and Kafka.
- Manage edge networking, security, and traffic routing through Cloudflare.
- Automate operational processes and improve CI/CD pipelines to support safe, fast, and reliable releases.
- Collaborate with development teams to optimise Java services, JVM performance, container resource allocation, and connection pooling.
- Maintain and troubleshoot observability and logging platforms such as Elasticsearch and Kibana.
- Lead real-time incident response, structured triage, and mitigation during production outages.
- Coordinate cross-functional teams and communicate effectively with both technical and non-technical stakeholders during incidents.
- Design and execute load testing strategies to validate system resilience under peak traffic conditions.
- Conduct blameless postmortems, identify systemic risks, and implement long-term improvements to prevent recurring incidents.
- Improve monitoring, alerting, and observability using Prometheus and Grafana, helping reduce alert fatigue and improve Mean Time To Recovery (MTTR).
Requirements
What you’ll need- 3+ years of experience in Platform Engineering, DevOps, or Site Reliability Engineering
- Strong hands-on experience operating and troubleshooting Kubernetes in production environments
- Experience running infrastructure on GCP or another major cloud provider
- Solid experience with Infrastructure as Code, particularly Terraform
- Experience managing message brokers and event-streaming platforms such as RabbitMQ and Kafka
- Experience managing production incidents, conducting load testing, and driving structured postmortems
- Strong Linux systems knowledge with excellent troubleshooting skills across infrastructure, networking, and application layers
- Scripting and automation experience using Bash and/or Python
Benefits
Comp & perks- Private health insurance
- Wellness incentives, including a fitness allowance and mental well-being services
- Flexible national holidays: public holidays mean more time off, choose how and when to enjoy them!
- 2 weeks Work From Anywhere (10 days), increased to 4 weeks (20 days) after longer duration of employment within the Company: explore the world while working remotely
- Gourmet lunches and healthy snacks prepared by our in-house chef
- Variety of discounts from local vendors
- Access to some of the greatest tools and platforms for developing your professional skills and building success within your role
- A range of training courses, known as Casumo College, for continuous learning and growth
- Social events for building strong relationships with colleagues from all across the organisation
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
KubernetesGoogle Cloud PlatformTerraformRabbitMQKafkaCI/CDElasticsearchKibanaPrometheusGrafana
Soft Skills
incident responsetriagecommunicationcollaborationtroubleshootingload testingpostmortemssystem resiliencecross-functional coordinationprocess improvement