
Tech Lead – Cloud Reliability Engineering
Mendix
full-time
Posted on:
Location Type: Hybrid
Location: Rotterdam • 🇳🇱 Netherlands
Visit company websiteJob Level
Senior
Tech Stack
AWSAzureCloudGoGoogle Cloud PlatformGrafanaKubernetesLinuxPostgresPrometheusPythonSQLTerraformUnix
About the role
- Writing software/scripts to automate operations on our platform, reducing support requests and engineering time.
- Solving operational issues for our customers by investigating technically and liaising between Mendix Support (1st line) and other development teams in R&D.
- Providing out of hours support for critical customer issues on an on-call basis.
- Creating and maintaining monitoring & alerting systems to provide real-time visibility into the performance and availability of the platform (SRE).
- Developing and maintaining dashboards & reports to track key performance indicators and identify trends and issues.
- Delivering and supporting a high quality, highly available public cloud platform where customers can run Mendix apps.
- Developing and running Mendix Cloud infrastructure and services that offer deployment, operations and monitoring.
Requirements
- You have experience with Site Reliability Engineering (SRE).
- You have coding skills, ideally in Python; it’s a plus if you also have experience with Golang.
- You have good knowledge of infrastructure (AWS).
- You have experience with Infrastructure as Code (IaC), preferably Terraform or OpenTofu.
- You have strong experience with containerization technologies, primarily Kubernetes.
- You're comfortable writing a Python script to automate complex tasks to reduce manual effort.
- You have excellent communication and people skills, both written and verbal.
- You have the ability to spearhead, manage and explain complex technical issues and reduce them to a form that less technical customers & colleagues can understand.
- A deep understanding of Cloud architecture/deployment and infrastructure services like web servers, load balancing, SSL/TLS/X509, etc.
- You have experience with monitoring and logging tools such as CloudWatch, ELK, Grafana, Datadog or Prometheus.
- Proven experience administering, developing against, or architecting on a cloud platform (AWS is preferred; GCP or Azure acceptable).
- You have strong experience with containers and Linux/Unix systems.
- You are familiar with SQL/databases (primarily PostgreSQL).
- A passion for investigating complex issues and finding out the solution in a platform with many distributed applications.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
PythonGolangAWSInfrastructure as CodeTerraformOpenTofuKubernetesSQLPostgreSQLLinux
Soft skills
communication skillspeople skillsproblem-solvingtechnical explanationmanagement