
Senior Site Reliability Engineer, Observability
Woven Planet
full-time
Posted on:
Location Type: Hybrid
Location: Tokyo • Japan
Visit company websiteExplore more
Job Level
About the role
- Design and develop observability features for customers of an engineering platform product, and support them in adopting the observability platform
- Provide technical leadership by guiding technical decision‑making, contributing to roadmap planning, enabling effective cross‑team collaboration, and mentoring team members
- Participate in an 8‑hour on‑call rotation—covering weekdays, weekends, and holidays—to monitor and respond to incidents. On‑call duties can be performed remotely
- Provide L2 support for user requests, assisting customers with troubleshooting and product usage issues
- Conduct blameless post‑mortems and address service reliability issues through hands‑on coding
- Reduce operational toil and improve team efficiency by implementing automation
Requirements
- 7+ years of experience in SRE, DevOps, cloud engineering, observability engineering, or backend development
- Intermediate to advanced proficiency in Go, Python, or similar programming languages, with strong knowledge of data structures, algorithms, and software design principles
- Intermediate to advanced expertise with APM solutions and monitoring systems such as Prometheus, Grafana, and GCP Monitoring
- Intermediate to advanced expertise with public cloud technologies, Kubernetes, and Infrastructure as Code
- Business‑level English proficiency
Benefits
- Competitive Salary - Based on experience
- Work Hours - Flexible working time
- Paid Holiday - 20 days per year (prorated)
- Sick Leave - 6 days per year (prorated)
- Holiday - Sat & Sun, Japanese National Holidays, and other days defined by our company
- Japanese Social Insurance - Health Insurance, Pension, Workers’ Comp, and Unemployment Insurance, Long-term care insurance
- Housing Allowance
- Retirement Benefits
- Rental Cars Support
- In-house Training Program (software study/language study)
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
GoPythondata structuresalgorithmssoftware design principlesAPM solutionsmonitoring systemsKubernetesInfrastructure as Codeautomation
Soft Skills
technical leadershipcross-team collaborationmentoringtroubleshootingincident responsepost-mortemsservice reliabilityefficiency improvement