Salary
💰 $110,000 - $186,000 per year
Tech Stack
AnsibleGrafanaJavaPythonSplunkTerraform
About the role
- Automate operational tasks and health checks to create sustainable systems and services
- Monitor the production environment, using observability tools for system health and creating dashboards for alerts
- Map business processes to identify reliability gaps and analyze performance metrics
- Collaborate in system design consulting, platform management, and capacity planning
- Drive the creation and maintenance of documentation including SOPs, configurations, and infrastructure maps
Requirements
- 5+ years of experience in Site Reliability Engineering (SRE) within a Fintech or product organization
- 4+ years of experience automating tasks using Python, Java, Ansible, or PowerShell
- 4+ years of experience with observability and monitoring tools like Dynatrace, Splunk, Moogsoft, or Grafana
- Experience managing CI/CD pipelines and automation tools like GitLab, Harness, Nexus, Terraform, or SonarQube
- Bachelor's degree in computer science or related technical field and/or 7+ years of relevant work experience.
- Incentive eligible associates can receive an annual incentive opportunity as cash bonus and equity awards
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
Site Reliability EngineeringPythonJavaAnsiblePowerShellobservability toolsmonitoring toolsCI/CD pipelinesautomation toolscapacity planning
Soft skills
collaborationdocumentationprocess mappingperformance analysis