
Senior Systems/Reliability Engineer
nDeavour Consulting
full-time
Posted on:
Location Type: Remote
Location: United States
Visit company websiteExplore more
Job Level
About the role
- Operational Stability & Reliability: Own the health and performance of our hybrid (AWS/On-prem) estate.
- Infrastructure Maturation: Lead the effort to document our environment—creating the architecture diagrams and runbooks necessary to eliminate single points of failure.
- Pragmatic Kubernetes Management: Operate and improve our existing production Kubernetes clusters.
- Technical Authority & Peer Leadership: Serve as a senior technical sounding board and mentor.
- Developer Enablement: Partner with development teams to provide the automation, standards, and guardrails that allow them to own their own deployments safely.
- Sustainable On-Call: Participate in a sustainable on-call rotation.
- Security & Compliance Support: Own the operational application of ISO 27001 controls within your remit.
Requirements
- 5+ years in a Senior SysOps, DevOps, or SRE role with proven experience managing high-pressure production environments.
- Strong Linux Administration: Confident troubleshooting of production issues (services, logs, performance, and networking).
- Hybrid Infrastructure Experience: Practical knowledge of AWS (primary) and Azure, with a comfort level managing both cloud-native and physical/legacy infrastructure.
- Automation & IaC: Proficiency with Terraform and configuration management (e.g., Ansible).
- Kubernetes Competency: Experience operating and improving Kubernetes in a production setting.
- Reliability Mindset: A track record of identifying risks and fixing root causes to improve system monitoring and quality.
- Familiarity with physical data center environments or Cisco networking (Desirable).
- Experience in regulated environments (Healthcare, Finance, or similar) (Desirable).
- Experience supporting ISO 27001 or SOC2 audits (Desirable).
Benefits
- Remote Office – Flexible hybrid form of working
- Parking Space – Free parking spots provided
- Fun Office Space – Game zone and relaxation area available
- Health Insurance – Additional private health insurance, including dental care plan
- Personal Development – Company-sponsored training budget to further develop your skills
- Employee Referral Program – Receive a bonus for referring a friend
- Holidays – Extra 5 days after your 1st and 5th year with the company
- Social Events – We love to celebrate success together
- Family Insurance – Option to add insurance for family members
- Sports Cards – 100% sponsored by the company
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Linux AdministrationKubernetesTerraformAnsibleAWSAzureInfrastructure as Code (IaC)System MonitoringTroubleshootingRoot Cause Analysis
Soft Skills
Technical AuthorityPeer LeadershipMentoringCollaborationProblem SolvingRisk IdentificationCommunicationDeveloper EnablementOperational StabilityReliability Mindset