
Site Reliability Engineer
Equifax
full-time
Posted on:
Location Type: Hybrid
Location: Trivandrum • India
Visit company websiteExplore more
About the role
- Manage system(s) uptime across cloud-native (AWS, GCP) and hybrid architectures
- Build infrastructure as code (IAC) patterns that meet security and engineering standards using one or more technologies (Terraform, scripting with cloud CLI, and programming with cloud SDK)
- Build CI/CD pipelines for build, test, and deployment of application and cloud architecture patterns, using platform (Jenkins) and cloud-native toolchains
- Build automated tooling to deploy service requests to push a change into production
- Build runbooks that are comprehensive and detailed to manage, detect, remediate, and restore services
- Solve problems and triage complex distributed architecture service maps
- On call for high severity application incidents and improving runbooks to improve MTTR
- Lead availability blameless postmortem and own the call to action to remediate recurrences
Requirements
- BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics)
- 7-10 years of experience in software engineering, systems administration, database administration, and networking
- 4+ years of experience developing and/or administering software in public cloud
- Experience in monitoring infrastructure and application uptime and availability to ensure functional and performance objectives
- Hands-on experience working in GCP infrastructure services and provisioning automated infrastructure using Terraform
- Rebuilding GCP VM instances using automated Terraform and Jenkins pipelines
- Provisioning GCP Resources (e.g., GCE, GKE resources, Storage, network components) using automated pipelines
- Configuring the monitoring (alerting and dashboards) for the microservices using Stackdriver, Datadog, and AppDynamics
- Creating new automation scripts and updating the existing automation framework using Terraform and Shell/Python scripting
- Set up IAM policy bindings through automation and troubleshoot issues with IAM policies in GCP and AWS
- Implementing custom roles and binding custom roles in GCP projects
- Automate blue/green deployment strategies in GCP environments
- On-call with pager duty for production incidents
- Hands-on experience in setting up CI/CD pipelines using Git and Jenkins
- Good experience in creating and updating helm charts for GKE resources
- Demonstrable cross-functional knowledge with systems, storage, networking, security, and databases
Benefits
- Health insurance
- Comprehensive compensation package
- Attractive paid time off
- Organizational growth potential through online learning platform
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
infrastructure as codeTerraformcloud CLIcloud SDKCI/CD pipelinesJenkinsmonitoringautomation scriptingGCPAWS
Soft skills
problem solvingtriageleadershipcommunicationcollaborationincident managementblameless postmortemcall to actiondetailed documentationcross-functional knowledge