Tech Stack
AnsibleAWSAzureChefCloudConsulDockerGoogle Cloud PlatformJenkinsKubernetesLinuxPackerPrometheusPuppetPythonShell ScriptingTerraformUnixVagrantVault
About the role
- Monitor and optimize cloud infrastructure for performance, scalability, and cost-efficiency.
- Manage and Maintain CI Infrastructure (GitLab CI and Jenkins).
- Manage, Maintain and Improve our Release and Development Environments.
- Support critical production infrastructure deployed in Multiple Clouds (AWS, Azure, and GCP).
- Develop and Support RnD toolchain and implement best practices for code deployment, testing, and maintenance.
- Automate On-Premises Labs Infrastructure by adopting IaC practices.
- Lead and Develop Monitoring, Telemetry, Alerting, and Logging Production services.
Requirements
- Proven hands-on experience with Docker and Kubernetes in production.
- Hands-on experience deploying and managing complex Kubernetes environments, including services, ingresses, load balancers, and Helm charts.
- Solid understanding of Linux/Unix Internals and experience with handling complex performance and configuration problems in Linux/Unix environment.
- Deep familiarity with GCP and AWS for provisioning, networking, and cost-optimization strategies.
- Experience in DSL configuration tools like Ansible, Chef, or Puppet.
- Experience managing CI infrastructure (GitLab CI and Jenkins).
- Experience with programming languages (Python preferred) and shell scripting.
- Proficient in SRE/Monitoring methodologies (Prometheus and monitoring stacks).
- Experience with CI/CD tools and frameworks (nice to have).
- Experience managing binary repositories (RPMs, PyPI, NPM) (nice to have).
- Experience developing Ansible collections, roles, and modules (nice to have).
- Experience managing GitLab and GitLab CI (nice to have).
- Experience with HashiCorp products: Terraform, Packer, Consul, Vault, and Vagrant (nice to have).
- Experience automating configuration and deployment of on-premises lab hardware (nice to have).