nDeavour Consulting

Senior Systems/Reliability Engineer

nDeavour Consulting

full-time

Posted on:

Location Type: Remote

Location: United States

Visit company website

Explore more

AI Apply
Apply

Job Level

About the role

  • Operational Stability & Reliability: Own the health and performance of our hybrid (AWS/On-prem) estate.
  • Infrastructure Maturation: Lead the effort to document our environment—creating the architecture diagrams and runbooks necessary to eliminate single points of failure.
  • Pragmatic Kubernetes Management: Operate and improve our existing production Kubernetes clusters.
  • Technical Authority & Peer Leadership: Serve as a senior technical sounding board and mentor.
  • Developer Enablement: Partner with development teams to provide the automation, standards, and guardrails that allow them to own their own deployments safely.
  • Sustainable On-Call: Participate in a sustainable on-call rotation.
  • Security & Compliance Support: Own the operational application of ISO 27001 controls within your remit.

Requirements

  • 5+ years in a Senior SysOps, DevOps, or SRE role with proven experience managing high-pressure production environments.
  • Strong Linux Administration: Confident troubleshooting of production issues (services, logs, performance, and networking).
  • Hybrid Infrastructure Experience: Practical knowledge of AWS (primary) and Azure, with a comfort level managing both cloud-native and physical/legacy infrastructure.
  • Automation & IaC: Proficiency with Terraform and configuration management (e.g., Ansible).
  • Kubernetes Competency: Experience operating and improving Kubernetes in a production setting.
  • Reliability Mindset: A track record of identifying risks and fixing root causes to improve system monitoring and quality.
  • Familiarity with physical data center environments or Cisco networking (Desirable).
  • Experience in regulated environments (Healthcare, Finance, or similar) (Desirable).
  • Experience supporting ISO 27001 or SOC2 audits (Desirable).
Benefits
  • Remote Office – Flexible hybrid form of working
  • Parking Space – Free parking spots provided
  • Fun Office Space – Game zone and relaxation area available
  • Health Insurance – Additional private health insurance, including dental care plan
  • Personal Development – Company-sponsored training budget to further develop your skills
  • Employee Referral Program – Receive a bonus for referring a friend
  • Holidays – Extra 5 days after your 1st and 5th year with the company
  • Social Events – We love to celebrate success together
  • Family Insurance – Option to add insurance for family members
  • Sports Cards – 100% sponsored by the company
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
Linux AdministrationKubernetesTerraformAnsibleAWSAzureInfrastructure as Code (IaC)System MonitoringTroubleshootingRoot Cause Analysis
Soft Skills
Technical AuthorityPeer LeadershipMentoringCollaborationProblem SolvingRisk IdentificationCommunicationDeveloper EnablementOperational StabilityReliability Mindset