Own the reliability, performance, capacity, and lifecycle of VMware vSphere (vCenter/ESXi) and Microsoft Hyper‑V/Failover Clustering platforms.
Design and implement automation‑first workflows for provisioning, configuration, patching, upgrades, and compliance.
Build and evolve observability (metrics, logs, traces, SLOs/error budgets) to drive data‑informed operations.
Improve resilience and security (HA/DR patterns, backup/restore validation, hardening, least privilege, segmentation).
Partner with networking, storage, and security teams to deliver performance and availability at scale across datacenters and edge sites.
Lead incident response and post‑incident learning; reduce toil through runbooks and self‑healing.
Evaluate and adopt AI‑assisted operations and emerging platform capabilities.
Document architecture, standards, and runbooks; mentor engineers and champion best practices.
Requirements
Bachelor's Degree in Computer Engineering, Computer Science, Mathematics, Engineering OR a combination of equivalent education and experience
5 or more years of related experience
Experience in automation, specifically related to deployment, recovery, or other manual processes
Experience with n-tier solutions
Experience with resilience modeling (failure mode analysis) and ability to simulate service outages for point solutions (automated) and platforms (manually)
Experience influencing software engineering team members in translating customer and technical requirements into service architecture to meet Quality of Service Expectations
Experience leading and representing the Service in backlog discussions and standups to establish appropriate prioritization of Live Site requirements
Experience leveraging data and metrics to drive behavior, process, and priority decisions. Using logic and reasoning to identify the strengths and weaknesses of alternative solutions, conclusions, or approaches to problems. Experience in problem solving using systematic procedures and investigating problems utilizing root cause analysis.
Benefits
Health care benefits (medical, dental, vision)
401(k) Savings Plan with employer matching
Life insurance
Disability insurance
Time off benefits (paid parental leave, vacations, holidays, health issues)
Voluntary benefits
Well-being resources
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.