Citi

DevOps Lead, SRE

Citi

full-time

Posted on:

Location Type: Hybrid

Location: JacksonvilleFloridaTexasUnited States

Visit company website

Explore more

AI Apply
Apply

Salary

💰 $113,840 - $170,760 per year

Job Level

About the role

  • Provides expertise related to various Distributed Consumer Applications across multiple Lines of Business in North America
  • Enable Production management processes in non production environment to provide environment stability
  • Execute robust service readiness
  • Facilitate standard toolset adoption for all services in the domain
  • Works as a L2 expert to support the Incident Management, Problem management, risk management and Change management , CI/CD enablement pipeline for SRE function identified
  • Has Overall accountability of non production stability for his area/domain
  • Partners with Level 3 support teams to improve resolution rates, efficiency targets, and organizational Service Level Agreements
  • Performs SRE analysis and remediates identifies issues with the stakeholders and hold them accountable during release signoffs
  • Partners with SRE enablement and works as SRE eventually to identify the key areas and provides the SRE recommendation from UAT to PERF and PROD for key business transactions supported
  • Identifies and leads the implementation of Service Automation to reduce cost, reduce risk, improve efficiency and enable Service Management to keep up with the ever-increasing volume of with fast pace of newer technologies
  • Continually evolve the working practices within and services provided by Production Management to improve efficiency and productivity
  • Ability to conduct blameless problem management/post-mortem phase of major incidents, develop executive briefings, assess major incident impacts and drive service improvements to prevent repeat of an incident
  • Create PMR for P1/P2 incidents and close on the actions
  • Identify the risks, classify them in the non production estate and work with the peers , team members , create Service Improvement plans and drive them to closure
  • Create Operational readiness documents for major initiatives and provide handover to production team in a seamless manner
  • Work with SRE team to create a proactive analysis of UAT and PERF view before handing over to production management
  • Accountable for end to end service health of NAM Core space
  • Overall accountable for patching , changes, Infra changes, certificates and other KTLO activities in his domain assigned
  • Overall accountability of the monitoring and its usage by its stakeholders
  • Work with the monitoring team for setup and overall accountability
  • Represent DevOps team in various digital forums and facilitate generate of reports and presentations
  • Be proficient in various technologies of OSE, Apigee, AWS and other new age technologies
  • Adopt automation laid down by Production management automation and AIOps
  • Support and Achieve successful internal audits

Requirements

  • 8+ years development or production support experience with North America Consumer applications
  • Experience or familiarity Cloud Technology is a plus
  • Solid ITIL Foundation understanding
  • Engineering Background in system admin, development, DevOps or equivalent field, preferably with experience in Distributed Consumer applications
  • Experience/ familiarity with automation technologies, advanced analytics and predictive modelling
  • Ability to develop and manage relationships at all levels
  • Experience with databases i.e. Oracle, DB2
  • Experience in programming in one of the following languages unix shell scripting, Java, etc.
  • Competent with cloud concepts i.e. API, web services and microservices
Benefits
  • medical, dental & vision coverage
  • 401(k)
  • life, accident, and disability insurance
  • wellness programs
  • planned time off (vacation)
  • unplanned time off (sick leave)
  • paid holidays
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
SREIncident ManagementProblem ManagementChange ManagementCI/CDService AutomationUnix Shell ScriptingJavaOracleDB2
Soft Skills
relationship managementblameless problem managementexecutive briefingsservice improvementcollaborationcommunicationorganizational skillsaccountabilityleadershipefficiency improvement
Certifications
ITIL Foundation