
Senior Site Reliability Engineer – US Working Hour
ABBYY
full-time
Posted on:
Location Type: Hybrid
Location: Bangalore • India
Visit company websiteExplore more
Job Level
Tech Stack
About the role
- Сo-own critical production service designs to ensure high reliability is achievable and measurable Drive reliability and observability improvements in the services within the engineering verticals
- Using monitoring and telemetry data, help teams make informed decisions on where reliability challenges may exist and help design and build solutions to improve them
- You will build SRE dashboards from SLIs to measure SLO adherence
- You will be supporting Production applications which are hosted in Azure cloud
- Build and improve internal tools and automation software to make maintaining production services easier and safer
- Lead reliability-focused practices such as Failure Analysis, Load and Capacity Planning, Service Reviews, Architecture Designs, Incident Postmortems, and others
- Developing Infrastructure as a Code.
- Define (from design to implementation details) necessary auto-healing and fault-tolerant systems
- Point of contact for production application issues, working closely with engineering leadership
Requirements
- 7-10 Years IT Experience
- Proven experience at least one cloud technology - Azure or AWS.Preferibily Azure
- Proficient in Kubernetes, AKS, Azure Function, Storage account, and others
- Proven experience in Microsoft Technologies, Windows server, IIS(Preferred)
- Distributed monitoring experience in Grafana: logging, metrics, tracing, etc.
- Matching years of experience to level in an Infrastructure, SRE, DevOps, CloudOps role
- Experience working in SRE team in a dynamic and fast paced environment
- Experience programming in one or more of the following: C#, Java, Python, .Net, NodeJS, Go,
- Experience with Terraform, Ansible, or any similar programming language
- Experience with cloud-performant microservices and event-driven architectures
- Experience with Kubernetes administration is an added advantage.
- Experience with database performance monitoring tools (e.g Percona Toolkit, SQL Profiler).
- Proven experience in diagnosing and resolving indexing issues in relational databases (e.g., PostgreSQL, MySQL, SQL Server, Oracle).
- Strong understanding of query optimization, execution plans, and database internals.
- Understanding of information security concepts and terminology
- Strong knowledge of software development methodologies and passion for creating high-standard tool sets for infrastructure-as-code
- Ability to analyze problems quickly and find suitable solutions based on available resources
- A proactive and open-minded individual with a clean-cut client focus and structured approach
- Experience in leading and managing a small team
- Comfortable working US hours aligned to the PST time zone.
Benefits
- We provide remote and hybrid working options to fit all lifestyles.
- We use flexible hours across most of our teams to allow you to find your own definition of balance.
- Encouraging a culture of giving, we provide two paid volunteering days off every year so you can take time to contribute to the causes you care about.
- To ensure your family is cared for, we offer paid parental leave in all our locations.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
SREInfrastructure as CodeKubernetesAzureC#JavaPython.NetTerraformAnsible
Soft Skills
problem analysissolution findingclient focusstructured approachleadershipteam managementproactive mindsetopen-mindedness