Tech Stack
AnsibleAWSAzureCloudDistributed SystemsDjangoDockerFlaskGoogle Cloud PlatformGrafanaJenkinsKubernetesLinuxOraclePrometheusPythonTerraformVagrantVMware
About the role
- DevOps Responsibilities
· Build and maintain CI/CD pipelines with Jenkins, GitLab, or similar tools.
· Automate infrastructure using Terraform, CloudFormation, and Ansible.
· Manage Docker and Kubernetes environments.
· Work with developers for efficient integration and deployment.
· Monitor and optimize cloud platforms (AWS, GCP, Azure).
SRE Responsibilities
· Establish and monitor SLOs, SLIs, and error budgets to ensure system reliability.
· Utilize Prometheus, Grafana, and Nagios for effective monitoring and alerting.
· Direct incident response efforts and conduct thorough root cause analysis.
· Implement automation to minimize manual intervention.
· Contribute to disaster recovery planning and facilitate regular testing.
· Serve as the weekly on-call specialist for managing system alerts.
Requirements
- · Over 6 years' experience in DevOps and SRE.
· Skilled in Python, Bash, and Linux administration.
· Knowledge of Python frameworks (Flask, Django).
· Experience with AWS, GCP, and Azure cloud platforms.
· Familiar with networking tools (Wireshark, NMAP, TCPDump).
· Worked with log management systems (ELK, Fluentd).
· Understanding of performance testing and chaos engineering.
· Familiar with virtualization tools: KVM, libvirt, VMware, Hyper-V, Oracle VirtualBox, Vagrant.