Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
pod network

Site Reliability Engineer – APAC

pod network

Site Reliability Engineer improving and scaling the reliability of the Pod platform, focusing on incident response and operational tooling.

Posted 6/19/2026full-timeRemote • 🇲🇾 MalaysiaMid-LevelSenior💰 $100,000 per yearWebsite

Tech Stack

Tools & technologies
CloudDistributed SystemsDockerGrafanaLinuxPrometheusPythonRust

About the role

Key responsibilities & impact
  • Monitor the health and performance of the platform
  • Respond to production incidents and drive them through to resolution
  • Investigate failures, identify root causes, and coordinate fixes
  • Ensure issues are detected, understood, and addressed quickly
  • Identify recurring operational pain points and eliminate them
  • Improve software, deployment processes, and operational workflows
  • Participate in incident reviews and help drive preventative improvements
  • Contribute reliability-focused changes directly to production systems
  • Design and maintain dashboards, metrics, alerting, and monitoring systems
  • Improve signal quality while reducing alert fatigue
  • Build automation and internal tools that make the platform easier to operate
  • Help establish reliability best practices across the engineering organization

Requirements

What you’ll need
  • Strong experience with Linux and cloud infrastructure
  • Experience operating and supporting production systems
  • Experience with Docker and containerized environments
  • Experience with observability and incident-management tools such as Grafana, Prometheus, PagerDuty, or similar
  • Ability to automate workflows using Rust, Python, Bash, or similar languages
  • Strong troubleshooting and debugging skills
  • A high degree of ownership and the ability to make sound decisions independently
  • Nice to Have: Experience with distributed systems, high-availability, low-latency services, CI/CD systems, deployment automation, designing secure operational workflows and access controls

Benefits

Comp & perks
  • Competitive compensation (~$100k USD/year)
  • Meaningful token/equity allocation
  • Real ownership and responsibility from day one
  • Work from wherever you are within the target timezone range (UTC+7 to UTC+1)
  • Occasional travel to Europe and elsewhere for team meetups

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
Linuxcloud infrastructureDockercontainerized environmentsRustPythonBashtroubleshootingdebuggingCI/CD
Soft Skills
ownershipdecision makingproblem solving