Hewlett Packard Enterprise

Senior Linux System Administrator – Support Engineer, High Performance Computing

Hewlett Packard Enterprise

full-time

Posted on:

Location Type: Hybrid

Location: CanberraAustralia

Visit company website

Explore more

AI Apply
Apply

Job Level

About the role

  • Deploy, configure, maintain, and troubleshoot Linux servers and HPC clusters systems (Red Hat, CentOS, Ubuntu, or others) across physical (primarily), virtual, and cloud environments.
  • Support, maintain, and optimize HPC systems, including cluster manager, operating system and network fabric installation, servicing, and advanced technical troubleshooting of hardware/software and parallel file systems (e.g., Lustre, GPFS).
  • Monitor system performance, availability, and security using industry-standard tools and practices; ensure compliance with organizational policies and external regulations.
  • Plan and execute upgrades, patches, enhancements, and migrations to ensure systems are current, secure, and optimized.
  • Automate system administration tasks using scripting languages (Bash, Python, Perl, etc.) and configuration management tools (Ansible, Puppet, Chef, Terraform).
  • Implement and maintain backup/recovery strategies, disaster recovery plans, and system documentation.
  • Collaborate with development, network, and security teams to support application deployments and troubleshoot issues, particularly in multi-technology HPC environments.
  • Provide technical consulting, mentoring, and guidance to junior team members and contribute to internal knowledge sharing.
  • Ensure compliance with strict security protocols in sensitive environments (e.g., government, research); TSPV clearance will be required.
  • Participate in on-call rotation and respond to system incidents and outages.
  • Assist with technical proposals, solution design, and enterprise-level architecture for new projects and customer engagements.

Requirements

  • Bachelor’s degree in Computer Science, Information Technology, or related field, or equivalent work experience.
  • At least 5 years of hands-on experience managing Linux systems in production environments, including HPC systems.
  • Expertise in Linux/Unix operating systems, parallel file systems (Lustre, GPFS), and networking technologies.
  • Proficiency in scripting/programming languages (Bash, Python, Perl, C++).
  • Experience with automation/configuration management tools (Ansible, Puppet, Chef, Terraform).
  • Strong understanding of networking concepts (TCP/IP, DNS, DHCP, firewalls, VPNs).
  • Familiarity with monitoring/logging tools (Nagios, Grafana, ELK Stack).
  • Experience with containerization technologies (Docker, Kubernetes).
  • Excellent problem-solving, analytical, and communication skills; able to diagnose complex technical problems to root cause.
  • Demonstrated ability to work independently in multi-technology environments and collaborate across teams.
  • Relevant certifications (RHCE, LFCS, AWS Certified SysOps Administrator, etc.) are a plus.
  • TSPV Government Security clearance (mandatory).
Benefits
  • Health & Wellbeing We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing.
  • Personal & Professional Development We also invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have — whether you want to become a knowledge expert in your field or apply your skills to another division.
  • Unconditional Inclusion We are unconditionally inclusive in the way we work and celebrate individual uniqueness. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good.

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
LinuxHPC systemsparallel file systemsscripting languagesnetworking technologiesautomation toolsmonitoring toolscontainerization technologiessystem performance monitoringdisaster recovery
Soft skills
problem-solvinganalytical skillscommunication skillscollaborationmentoringtechnical consultingindependent workknowledge sharingtechnical troubleshootingplanning
Certifications
RHCELFCSAWS Certified SysOps AdministratorTSPV clearance