Controller

• Maintain 24×7 situational awareness of network health (availability, latency, loss, jitter, capacity, link/path status)
• Triage, contain, and restore service during incidents using documented runbooks/playbooks; coordinate swarming with Routing/Switching, Boundary Security, Platform, and Cyber teams
• Execute initial impact assessment, user/stakeholder communications, workarounds, and continuity steps, capture timelines and decisions for post incident review
• Operate dashboards, alerts, and synthetic/active tests; tune thresholds to reduce noise while protecting SLOs
• Correlate telemetry (syslog, SNMP/streaming telemetry, NetFlow/IPFIX, route/adjacency state) with actionable escalation paths
• Validate monitoring for new or changed network services (health checks and alerts in place before go live)
• Enforce CAB/CCB decisions, maintenance windows, freeze periods, and back out plans for network changes
• Verify pre change readiness (peer review, approvals, rollback tested, comms prepared) and confirm post change health
• Keep CMDB/CIs and topology/dependency maps current, record changes and relationships for traceability
• Lead problem investigations (recurring incidents, trend spikes); run root cause analysis (RCA) and durable corrective actions
• Track availability, MTTR/MTBF, error budgets, and capacity trends; recommend scaling, policy tuning, or resiliency patterns (path diversity, ECMP, QoS adjustments)
• Drive backlog items with owners and verify closure via effectiveness checks
• Execute patch/vulnerability windows for boundary devices per plan; validate policy deployments and change results
• Operate within RMF/DISA STIG constraints; preserve audit trails and evidence for assessments and ATO/cATO sustainment
• Coordinate with Cyber/Blue Team on detections, containment steps, and after-action improvements
• Measure and report SLAs/SLOs (site availability, ticket KPIs, change success, incident induced change rate) with daily/weekly/monthly executive rollups
• Maintain stakeholder communications for incidents, maintenance, and changes throughout the event lifecycle
• Author/maintain runbooks, troubleshooting guides, operational standards, KEDB (Known Error DB), and service catalogs; keep knowledge articles current
• Contribute to readiness reviews, go live checklists, and lessons learned, coach junior controllers and cross train peers
• Other duties as assigned

Network Operations Controller

Senior Financial Controller

Regional Controller

Assistant Controller

Controller

Controller – Head of SEC Reporting