Pythian

Senior Zabbix Administrator

Pythian

full-time

Posted on:

Origin:  • 🇮🇳 India

Visit company website
AI Apply
Manual Apply

Job Level

Senior

Tech Stack

AnsibleAWSCloudGrafanaLinuxMySQLOraclePostgresPuppetPythonVMware

About the role

  • Design and deploy a highly available and scalable Zabbix architecture (Server, Proxies, Web Frontend, Database) to support a multi-tenant MSP environment.
  • Create, customize, and manage advanced Zabbix templates for a diverse technology stack, including Windows, Linux, VMware, NetApp, EMC, Cisco (Switches, Routers, ASA), databases, and various network appliances using SNMP, IPMI, agents, and API calls.
  • Spearhead the migration of over 2000+ hosts and 10000+ services from the existing legacy monitoring environment to Zabbix.
  • Develop robust custom monitoring scripts using Python and/or Shell to address unique monitoring requirements.
  • Utilize Ansible or Puppet to automate the deployment, configuration, and management of thousands of Zabbix agents across client environments.
  • Build, manage, and optimize insightful monitoring dashboards and reports in Grafana for both internal NOC teams and external clients.
  • Perform essential administration, performance tuning, and maintenance for the Zabbix backend database (PostgreSQL or MySQL).
  • Act as the in-house Zabbix expert, training and mentoring other team members to build their monitoring skills and ensure operational excellence.
  • Establish and maintain clear documentation for monitoring standards, configurations, and standard operating procedures (SOPs).

Requirements

  • Experience: A total of 5-8 years in IT Infrastructure roles with a strong focus on enterprise monitoring.
  • Zabbix Expertise: A minimum of 3-5 years of deep, hands-on experience designing, implementing, and administering Zabbix in a large-scale, mission-critical environment.
  • Linux Administration: Strong command of Linux administration (RHEL/CentOS, Debian/Ubuntu), including system performance tuning and troubleshooting.
  • Scripting Proficiency: Demonstrable skills in Python or Shell for creating custom monitoring checks and automation tasks.
  • Automation Mindset: Proven experience using Ansible, Puppet, or a similar configuration management tool for infrastructure automation.
  • Database Knowledge: Foundational skills in managing and optimizing PostgreSQL or MySQL databases.
  • Visualization Pro: Expertise in creating advanced, data-rich dashboards and custom panels in Grafana.
  • Core Monitoring Protocols: Solid understanding of monitoring protocols and methods, including SNMP, IPMI, WMI, and agent-based data collection.
  • Zabbix Certified Specialist (ZCS) or Professional (ZCP) certification is a huge plus.