Tech Stack
ITSMKubernetesLinuxPythonTCP/IP
About the role
- Reporting to the Sr Manager, IT Systems, provide technical and team leadership for daily systems operations for Corporate IT and Lightspeed Data Centres.
- Manage the IT Systems team consisting of systems analysts, systems administrators, and infrastructure engineers; oversee priorities, project deliverables, staffing and related network budgets.
- Work closely with the IT PMO for project reporting and deliverables and engage with IT Service Management on requests, incidents, change requests and incident root cause analysis (RCA).
- Lead and mentor a team of Systems Engineers, including workforce planning, performance evaluations, coaching, development plans, and managing on-call rotations.
- Collaborate with IT PMO and project managers to align infrastructure engineering deliverables with project milestones and release schedules; allocate resources across initiatives.
- Take ownership of critical incidents, outages, and escalations; ensure timely resolution and drive continuous improvement in incident response and recovery processes.
- Design and implement proactive monitoring systems, performance metrics, and automation strategies to enhance reliability and efficiency.
- Own and maintain the infrastructure services for Lightspeed and related documentation; develop and manage a long-term systems technology roadmap and cost forecasting.
- Manage relationships with systems vendors and service providers; oversee hardware device support contracts, renewals, lifecycle management, and review/approve quotes and purchases.
- Lead systems budget planning, forecasting, cost tracking and ensure financial accountability across infrastructure initiatives.
- Ensure changes and implementations comply with Telesat’s Change Management procedures and governance controls; optimize service delivery processes and turnaround times.
- Develop operational metrics and presentations on results and trends and explain them to both technical and non-technical audiences.
Requirements
- Must be eligible for Canadian Security Clearance and be a PR or Canadian citizen.
- Candidates must live in the Ottawa area or be willing to relocate to Ottawa.
- Candidates must be willing to be onsite in office 5 days per week.
- Bachelor’s degree in computer science or related field.
- Proven significant experience in managing and leading infrastructure and data centre teams.
- Experience in troubleshooting and resolution of system issues as a member of an Operations team.
- Strong verbal and written communication skills to collaborate with team members and stakeholders.
- In-depth technical experience with Linux and Linux Clustering, MS Active Directory (Entra), Server and Storage devices and related hardware.
- Advanced automation and scripting (e.g. bash, PowerShell, python).
- Knowledge of HPC (High Performance Computing) principles, Hypervisors and High-availability design.
- Previous Enterprise and Data Centre experience with clustering and management of clusters including storage, DCIM (Data Centre Infrastructure Management), operational monitoring and alerting, backup and recovery, basic TCP/IP networking and routing.
- Experience with service level agreements and vendor contracts.
- Experience with systems monitoring and alerting tools.
- Experience with financial analysis and budgeting for data centre investment and ongoing operating costs.
- Ability to manage multiple demands with time related constraints in a fast-paced environment.
- Knowledge of Agile and DevOps methodologies.
- Knowledge of Linux-based systems with containerization (Kubernetes).
- Ability to attain Canadian CGP and Secret Clearance.