Tech Stack
AnsibleAWSAzureCloudDNSDockerGrafanaLinuxMySQLPostgresPrometheusPythonTCP/IPTerraform
About the role
- Manage and scale global infrastructure; hands-on administration of Linux servers
- Automation, network configuration, system hardening, and ensuring high availability and performance
- Infrastructure planning, security, compliance, and supporting mission-critical environments
- Install, configure, and harden Ubuntu Server environments (LTS releases)
- Implement automated provisioning using tools such as Ansible and Terraform
- Procure and provision new server hardware and design custom infrastructure solutions tailored to client requests, including complex network topologies across Asia and region-specific server configurations
- Evaluate hosting providers and collaborate on infrastructure rollout plans
- Conduct hosting expenditure audits and recommend cost-optimization strategies
- Coordinate cross-connect implementations and engage with network providers to ensure reliable connectivity and SLAs
- Apply OS patches and perform version upgrades with minimal downtime on Linux and Windows servers
- Apply firmware updates
- Monitor and maintain health of global network and critical infrastructure using Prometheus/Grafana; respond to alerts
- Enforce security policies, conduct vulnerability assessments and remediate findings
- Administer database access controls using role-based access mechanisms (RBAC)
- Design and maintain backup strategies and disaster-recovery plans; verify backups and perform periodic restore drills
- Develop shell scripts to automate routine operations and maintain IaC repositories leveraging Ansible and Terraform
- Diagnose and resolve server, storage, and network issues; provide level-2/level-3 support
- Create and maintain runbooks, run-level diagrams, and SOPs; participate in on-call rotation
Requirements
- Bachelor’s degree in Computer Science, Information Technology, or equivalent experience
- 5+ years of hands-on experience administering Ubuntu Server (20.04, 22.04, 24.04), including Ubuntu Pro
- Strong command of Linux internals: filesystems, process management, systemd, networking, firewall, Apparmor, SELinux
- Proficiency with automation/configuration-management tools (Ansible, Terraform)
- Solid scripting skills (bash, Python or similar)
- Experience with virtualization/containerization (Docker)
- Hands-on with monitoring/observability stacks (Prometheus, Grafana, ELK)
- Solid understanding of networking fundamentals (TCP/IP, BGP, OSPF, MPLS, DNS, DHCP)
- Hands-on experience with PostgreSQL, MySQL, and ClickHouse administration (patroni, pg_bakup/pg_restore, point-in-time recovery or similar)
- Ability to read and interpret network diagrams, logs, and protocol traces (e.g., Wireshark)
- Good understanding of security best practices and compliance frameworks (e.g., CIS Benchmarks, GDPR)
- Certifications such as Ubuntu Professional, RHCE, or LPIC (preferred)
- Experience with cloud platforms (AWS, Azure, Google Cloud) and hybrid-cloud architectures (preferred)
- Familiarity with CI/CD tools (GitHub Actions) (preferred)