
Site Reliability Engineer
Pythian
full-time
Posted on:
Location Type: Remote
Location: India
Visit company websiteExplore more
Tech Stack
About the role
- Improve reliability, availability, and visibility for customer platforms; plan/execute documented maintenance; produce RCAs and drive remediation/automation
- Implement and evolve SLIs/SLOs, error budgets, capacity plans, and reliability reviews; harden observability (metrics/logs/traces) and keep alerts actionable
- Build and extend infrastructure as code (Terraform/CloudFormation), configuration management (Ansible/Puppet/Chef/Salt), and GitOps (Helm, Argo CD/Flux) pipelines
- Collaborate with support/app/data teams; reduce toil via scripting (Bash/Python) and small tools
- Lead technical discussions with customers; advise on platform roadmaps and architecture trade‑offs
- Mentor and cross‑train teammates; uphold documentation quality and internal knowledge sharing
- Participate in on‑call, escalations, and change reviews; proactively reduce toil via scripting/automation
Requirements
- Strong Linux administration/troubleshooting
- Kubernetes fundamentals (workloads, networking, storage, upgrades, security, HA)
- One major cloud (GCP/AWS/Azure)
- IaC + config management experience
- CI/CD/GitOps familiarity
- Scripting with Bash and/or Python
- Clear English (written/spoken), estimation, stakeholder communication, documentation quality
- Depth in one Data Platform: Elasticsearch/OpenSearch (ops, tuning, ILM/snapshots, security/EFK), or Kafka–Hadoop–Spark (production exposure; Confluent/MSK/Strimzi; Dataproc/EMR/CDP familiarity), or Redis (OSS/managed operations; data structures, eviction policies, RDB/AOF persistence)
- Nice to have: Security/IAM basics, budget/capacity awareness; Prometheus/Grafana experience; diagramming/runbook excellence
Benefits
- Competitive total rewards package
- Blog during work hours; take a day off and volunteer for your favourite charity
- Flexibly work remotely from your home, there’s no daily travel requirement to an office!
- Substantial training allowance; participate in professional development days, attend training, become certified, whatever you like!
- All the equipment you need to work from home including a laptop with your choice of OS, and an annual budget to personalize your work environment!
- Annual wellness budget to make yourself a priority (use it on gym memberships, massages, fitness and more)
- Generous amount of paid vacation and sick days, as well as a day off to volunteer for your favourite charity
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Linux administrationKubernetesTerraformCloudFormationAnsiblePuppetChefSaltBashPython
Soft Skills
communicationdocumentation qualitymentoringstakeholder communicationtechnical discussionsestimation