Site Reliability Engineer – APAC

Site Reliability Engineer improving and scaling the reliability of the Pod platform, focusing on incident response and operational tooling.

Posted 6/19/2026full-timeRemote • 🇲🇾 MalaysiaMid-LevelSenior💰 $100,000 per yearWebsite

Tools & technologies

CloudDistributed SystemsDockerGrafanaLinuxPrometheusPythonRust

Key responsibilities & impact

What you’ll need

Strong experience with Linux and cloud infrastructure
Experience operating and supporting production systems
Experience with Docker and containerized environments
Experience with observability and incident-management tools such as Grafana, Prometheus, PagerDuty, or similar
Ability to automate workflows using Rust, Python, Bash, or similar languages
Strong troubleshooting and debugging skills
A high degree of ownership and the ability to make sound decisions independently
Nice to Have: Experience with distributed systems, high-availability, low-latency services, CI/CD systems, deployment automation, designing secure operational workflows and access controls

Comp & perks

ATS Keywords

✓ Tailor your resume

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools

Linuxcloud infrastructureDockercontainerized environmentsRustPythonBashtroubleshootingdebuggingCI/CD

Soft Skills

ownershipdecision makingproblem solving