
Senior Reliability Engineer
CyrusOne
full-time
Posted on:
Location Type: Remote
Location: United States
Visit company websiteExplore more
Salary
💰 $140,000 - $170,000 per year
Job Level
About the role
- Serve as a subject-matter expert and strategic technical authority for infrastructure reliability across a portfolio of mission-critical data center sites.
- Lead the design, governance, and continuous improvement of reliability strategies for power, cooling, and control systems.
- Independently evaluate complex reliability risks, prioritize initiatives under uncertainty, and influence operational, maintenance, and capital decisions that materially impact uptime, safety, and lifecycle cost.
- Architect and govern portfolio-level, risk-based asset strategies for mission-critical power and cooling infrastructure.
- Apply advanced RCM principles to define maintenance and inspection strategies aligned to failure risk, system criticality, and redundancy posture.
- Evaluate and balance tradeoffs between maintenance investment, operational risk, spares coverage, redundancy, and capital replacement.
- Establish and maintain enterprise PM quality standards, including audits, task effectiveness reviews, and elimination of low-value maintenance.
- Serve as a final technical authority for high-risk SOPs, MOPs, EOPs, and operational change packages.
- Perform system-level risk assessments for planned work, incidents, and abnormal operating conditions.
- Guide site teams in CMMS data integrity, work management maturity, and adherence to approved operating procedures.
Requirements
- 10+ years of experience in reliability engineering, maintenance engineering, or facilities engineering within mission-critical environments.
- Demonstrated leadership of complex, multi-system reliability programs with measurable business impact.
- Expert-level knowledge of RCM, FMEA, RCA, and maintenance optimization methodologies.
- Deep technical understanding of mission-critical infrastructure, including UPS, generators, switchgear, chillers, cooling towers, CRAH/CRAC, and BMS/EPMS.
- Proven experience governing SOP/MOP/EOP programs and assessing operational change risk in live environments.
- Advanced ability to analyze condition-monitoring, CMMS, and operational datasets and convert insights into strategic actions.
- Proficiency in data analysis and visualization tools (Excel, Power BI, or similar).
- Ability to apply statistical techniques or reliability modeling to support risk-informed decision-making under uncertainty.
- Strong executive-level communication skills; able to influence senior leaders and defend technical positions.
Benefits
- Health insurance
- Retirement plans
- Paid time off
- Flexible work arrangements
- Professional development opportunities
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
reliability engineeringmaintenance engineeringfacilities engineeringRCMFMEARCAmaintenance optimizationdata analysisstatistical techniquesreliability modeling
Soft skills
leadershipcommunicationinfluencestrategic thinkingrisk assessmentdecision-makingproblem-solvingteam guidanceauditingtask effectiveness