FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Site Reliability Engineer – APAC
pod networkSite Reliability Engineer improving and scaling the reliability of the Pod platform, focusing on incident response and operational tooling.
Tech Stack
Tools & technologiesCloudDistributed SystemsDockerGrafanaLinuxPrometheusPythonRust
About the role
Key responsibilities & impact- Monitor the health and performance of the platform
- Respond to production incidents and drive them through to resolution
- Investigate failures, identify root causes, and coordinate fixes
- Ensure issues are detected, understood, and addressed quickly
- Identify recurring operational pain points and eliminate them
- Improve software, deployment processes, and operational workflows
- Participate in incident reviews and help drive preventative improvements
- Contribute reliability-focused changes directly to production systems
- Design and maintain dashboards, metrics, alerting, and monitoring systems
- Improve signal quality while reducing alert fatigue
- Build automation and internal tools that make the platform easier to operate
- Help establish reliability best practices across the engineering organization
Requirements
What you’ll need- Strong experience with Linux and cloud infrastructure
- Experience operating and supporting production systems
- Experience with Docker and containerized environments
- Experience with observability and incident-management tools such as Grafana, Prometheus, PagerDuty, or similar
- Ability to automate workflows using Rust, Python, Bash, or similar languages
- Strong troubleshooting and debugging skills
- A high degree of ownership and the ability to make sound decisions independently
- Nice to Have: Experience with distributed systems, high-availability, low-latency services, CI/CD systems, deployment automation, designing secure operational workflows and access controls
Benefits
Comp & perks- Competitive compensation (~$100k USD/year)
- Meaningful token/equity allocation
- Real ownership and responsibility from day one
- Work from wherever you are within the target timezone range (UTC+7 to UTC+1)
- Occasional travel to Europe and elsewhere for team meetups
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Linuxcloud infrastructureDockercontainerized environmentsRustPythonBashtroubleshootingdebuggingCI/CD
Soft Skills
ownershipdecision makingproblem solving