FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Advisor, Infrastructure Engineering
FiservCloud Operations Tech Lead providing technical leadership for multi-cloud operations across AWS, Azure, and GCP at Fiserv. Ensuring operational continuity and mentoring teams for incident resolution.
Tech Stack
Tools & technologiesAWSAzureCloudGoogle Cloud Platform
About the role
Key responsibilities & impact- The Cloud Operations Tech Lead provides technical leadership, operational continuity, and escalation support for Fiserv’s enterprise multi-cloud operations across AWS, Azure, and GCP.
- Serve as the technical lead and escalation point for Cloud Operations Engineers (L1) and SREs.
- Manage resource scheduling and rotational coverage to ensure Shift continuity and operational objectives are met.
- Provide shift to shift continuity, ensuring clear handoffs, consistent decision making, and risk awareness.
- Act as the final technical authority during complex or high severity incidents.
- Ensure operational consistency across regions, shifts, and teams.
- Lead and coordinate response to major incidents, service outages, and degraded service conditions.
- Drive real-time troubleshooting, impact assessment, and recovery efforts.
- Ensure incidents are escalated appropriately and resolved efficiently.
- Facilitate and review post-incident reviews, root cause analysis, and corrective actions.
- Track recurring issues and ensure they are addressed through automation or engineering fixes.
- Mentor Cloud Operations Engineers and SREs on:
- Cloud infrastructure fundamentals.
- Incident response and escalation best practices.
- SRE concepts (reliability, toil reduction, automation).
- Support skill development and readiness for progression from Cloud Operations Engineer → Cloud Site Reliability Engineer SRE.
- Promote operational excellence, accountability, and a blameless culture.
- Enforce adherence to runbooks, SOPs, change management, and security policies.
- Review and approve operational procedures and runbook updates.
- Identify gaps in operational coverage, tooling, or documentation.
- Drive standardization and continuous improvement across cloud operations.
- Partner with SRE and Platform Engineering teams to:
- Reduce manual toil.
- Expand automation and auto remediation.
- Improve monitoring, alerting, and observability.
- Identify recurring operational patterns suitable for automation.
- Ensure automation is safe, documented, and consistently executed.
- Collaborate closely with:
- Application and product engineering teams.
- Platform engineering.
- Security and compliance teams.
- Vendors and cloud service providers.
- Act as the operations representative in technical discussions impacting reliability and supportability.
- Communicate operational risks, trends, and improvement opportunities to leadership.
Requirements
What you’ll need- 8+ years of experience in Cloud Operations, SRE, Infrastructure Engineering, or Production Support
- Hands-on experience operating enterprise environments in AWS, Azure, and/or GCP
- Strong background in:
- Incident management and escalation
- Cloud infrastructure (compute, networking, storage, IAM)
- Monitoring and observability
- Proven ability to lead technical teams during incidents
- Experience mentoring engineers in operational and reliability practices
- Strong written and verbal communication skills
- Ability to operate effectively in a 24×7 global operations model
Benefits
Comp & perks- Health insurance
- 401(k) matching
- Flexible work hours
- Paid time off
- Remote work options
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
AWSAzureGCPincident managementcloud infrastructuremonitoringobservabilityautomationroot cause analysisSRE concepts
Soft Skills
technical leadershipmentoringcommunicationoperational excellencerisk awarenesscollaborationdecision makingaccountabilitycontinuous improvementblameless culture