Tech Stack
ApacheAzureCloudGrafanaJenkinsKafkaKubernetesMicroservicesNode.jsPrometheusPythonSplunkTerraform
About the role
- Design, build, and maintain CI/CD pipelines using Azure DevOps.
- Automate infrastructure provisioning using Terraform, ARM templates, or Bicep.
- Deploy, configure, and manage Confluent or Apache Kafka clusters (on-prem, Confluent Cloud, or Azure).
- Implement Kafka topics, partitions, consumer groups, and security configurations (SSL, SASL, ACLs).
- Monitor and troubleshoot Kafka performance, lag, and reliability issues.
- Manage Azure resources including AKS, Azure Functions, Azure Event Hubs, and Azure Monitor.
- Integrate Kafka with downstream and upstream systems (databases, APIs, data lakes, event-driven microservices).
- Maintain robust monitoring, logging, and alerting using Prometheus, Grafana, ELK, or Azure Monitor.
- Collaborate with development, data engineering, and architecture teams to support event-driven systems.
- Ensure high availability, fault tolerance, and disaster recovery configurations for Kafka clusters.
Requirements
- 5+ years of experience in DevOps engineering with a focus on Azure Cloud
- Hands-on experience with Apache Kafka / Confluent Kafka (setup, administration, and troubleshooting)
- Strong skills in CI/CD tools (Azure DevOps, GitHub Actions, Jenkins, etc.)
- Proficiency in Infrastructure as Code (IaC) tools: Terraform, ARM, or Bicep
- Experience with container orchestration (Kubernetes, AKS)
- Knowledge of networking, load balancing, and security best practices in cloud environments
- Familiarity with monitoring and logging tools (Prometheus, Grafana, ELK, Azure Monitor, Splunk)
- Strong scripting skills in Bash, PowerShell, or Python
- Good understanding of event-driven architectures and real-time data streaming