Mendix

Tech Lead – Cloud Reliability Engineering

Mendix

full-time

Posted on:

Location Type: Hybrid

Location: Rotterdam • 🇳🇱 Netherlands

Visit company website
AI Apply
Apply

Job Level

Senior

Tech Stack

AWSAzureCloudGoGoogle Cloud PlatformGrafanaKubernetesLinuxPostgresPrometheusPythonSQLTerraformUnix

About the role

  • Writing software/scripts to automate operations on our platform, reducing support requests and engineering time.
  • Solving operational issues for our customers by investigating technically and liaising between Mendix Support (1st line) and other development teams in R&D.
  • Providing out of hours support for critical customer issues on an on-call basis.
  • Creating and maintaining monitoring & alerting systems to provide real-time visibility into the performance and availability of the platform (SRE).
  • Developing and maintaining dashboards & reports to track key performance indicators and identify trends and issues.
  • Delivering and supporting a high quality, highly available public cloud platform where customers can run Mendix apps.
  • Developing and running Mendix Cloud infrastructure and services that offer deployment, operations and monitoring.

Requirements

  • You have experience with Site Reliability Engineering (SRE).
  • You have coding skills, ideally in Python; it’s a plus if you also have experience with Golang.
  • You have good knowledge of infrastructure (AWS).
  • You have experience with Infrastructure as Code (IaC), preferably Terraform or OpenTofu.
  • You have strong experience with containerization technologies, primarily Kubernetes.
  • You're comfortable writing a Python script to automate complex tasks to reduce manual effort.
  • You have excellent communication and people skills, both written and verbal.
  • You have the ability to spearhead, manage and explain complex technical issues and reduce them to a form that less technical customers & colleagues can understand.
  • A deep understanding of Cloud architecture/deployment and infrastructure services like web servers, load balancing, SSL/TLS/X509, etc.
  • You have experience with monitoring and logging tools such as CloudWatch, ELK, Grafana, Datadog or Prometheus.
  • Proven experience administering, developing against, or architecting on a cloud platform (AWS is preferred; GCP or Azure acceptable).
  • You have strong experience with containers and Linux/Unix systems.
  • You are familiar with SQL/databases (primarily PostgreSQL).
  • A passion for investigating complex issues and finding out the solution in a platform with many distributed applications.

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
PythonGolangAWSInfrastructure as CodeTerraformOpenTofuKubernetesSQLPostgreSQLLinux
Soft skills
communication skillspeople skillsproblem-solvingtechnical explanationmanagement