FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Site Reliability Engineer, SRE – Engineering Productivity
Arista NetworksSite Reliability Engineer in Arista Networks' Engineering Productivity Team. Focused on maintaining, scaling, and improving organization-wide systems and tools in a hybrid cloud environment.
Tech Stack
Tools & technologiesGoLinuxPythonShell ScriptingUnix
About the role
Key responsibilities & impact- Build, deploy safely and incrementally, and operate critical production systems with focus on scalability, reliability, observability, performance and security.
- Build automation to remove toil and proactively monitor, respond to, and enhance alerts with automated handling.
- Create and maintain incident response runbooks, triage platform and infrastructural issues, and write postmortem documents to prevent recurring incidents.
- Plan and communicate maintenance windows on production systems while engaging with 3rd party vendor support as needed.
- Work with Arista's product development teams to identify infrastructural bottlenecks and design solutions to enhance developer experience and workflow efficiency.
- Survey and adopt best practices around infrastructure and platform design to maintain secure, scalable and fault-tolerant systems, including studying OSS system implementations for better triage and resolution.
Requirements
What you’ll need- At least BSc Computer Science or Engineering + 3 years’ experience, MS Computer Science or Engineering + 3 years’ experience, or equivalent work experience.
- Knowledge of one or more of Go, Python, shell scripting to be able to implement medium complexity automation workflows.
- Knowledge of Linux (or UNIX) from administration and debugging perspective
- Hands-on experience in operating software systems (infrastructure, complex applications etc) at scale
- Experience in server provisioning (esp from storage and networking perspective).
- Strong problem solving and software troubleshooting skills
- Experience with infrastructure-as-code
Benefits
Comp & perks- Health insurance
- Retirement plans
- Paid time off
- Flexible work arrangements
- Professional development opportunities
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
GoPythonshell scriptingLinuxinfrastructure-as-codeserver provisioningsoftware troubleshootingautomation workflowsproduction systemsincident response
Soft Skills
problem solvingcommunicationcollaborationengagement with vendorsworkflow efficiency
Certifications
BSc Computer ScienceBSc EngineeringMS Computer ScienceMS Engineering