Equinix

Senior Manager, Product Software Architecture and Engineering

Equinix

full-time

Posted on:

Location Type: Hybrid

Location: DallasCaliforniaTexasUnited States

Visit company website

Explore more

AI Apply
Apply

Salary

💰 $177,000 - $319,000 per year

Job Level

Tech Stack

About the role

  • Define and execute the SRE strategy for server operations, aligned with organizational objectives and SLAs.
  • Build and mentor a high-performing team of SREs from the ground up, fostering a culture of accountability, innovation, and continuous improvement.
  • Develop the team’s roadmap for automation, monitoring, reliability engineering, and operational maturity.
  • Partner with engineering, infrastructure, and security teams to ensure seamless delivery and operations.
  • Collaborate with cross-functional teams including procurement, supply chain, IBX, and compliance to support operational excellence.
  • Manage CAPEX/OPEX expenditures to optimize resource allocation and drive cost-efficiency across infrastructure initiatives.
  • Oversee the health, performance, and capacity of server infrastructure, ensuring optimal uptime and efficiency.
  • Lead implementation of monitoring, observability, and alerting systems to detect and resolve issues proactively.
  • Establish incident response and postmortem practices, ensuring root cause analysis and prevention.
  • Develop and enforce operational standards, playbooks, and change management processes.
  • Drive automation for provisioning, configuration, patching, scaling, and recovery.
  • Identify and implement tools that reduce toil, improve reliability, and increase operational speed.
  • Oversee patching, vulnerability mitigation, and compliance for server hardware and OS layers.
  • Ensure adherence to security and data protection requirements.
  • Manage vendor relationships for hardware, monitoring tools, and infrastructure services.
  • Develop and manage the SRE/server operations budget.

Requirements

  • 10+ years in infrastructure, systems engineering, or SRE roles, with 3+ years in a leadership position.
  • Strong technical background in server infrastructure (hardware, OS, virtualization, cloud/hybrid environments).
  • Expertise in automation tools, scripting languages, monitoring platforms, and incident management.
  • Deep understanding of capacity planning, disaster recovery, and high-availability architectures.
  • Excellent leadership, communication, and cross-functional collaboration skills.
Benefits
  • Employee Assistance Program
  • Insurance: You may enroll in health, life, disability and voluntary plans that are designed for you and your eligible family members.
  • Retirement: You and Equinix may contribute to a retirement plan to help you plan for your financial future.
  • Paid Time Off (PTO) and Paid Holidays: You will receive an accrued amount of PTO each pay period along with various paid holidays for you to rest and recharge.
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
SRE strategyserver operationsautomationmonitoringreliability engineeringincident managementcapacity planningdisaster recoveryhigh-availability architecturesscripting languages
Soft Skills
leadershipcommunicationcross-functional collaborationaccountabilityinnovationcontinuous improvement