FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.
Tech Stack
Tools & technologiesCloudGrafanaKubernetesOpenShiftPrometheus
About the role
Key responsibilities & impact- Monitor and manage alerts on Cloud platforms, middleware, and container environments.
- Perform end-to-end troubleshooting for incident analysis and resolution.
- Diagnose issues across multiple layers: infrastructure, network, middleware, and application.
- Execute operational procedures and standard changes according to established processes.
- Escalate complex incidents to specialized teams when necessary.
- Provide support for production and non-production environments.
- Administer and monitor APIs and integrations.
- Analyze performance, latency, and availability issues of services.
- Use observability tools for investigation and diagnosis (metrics, logs, and traces).
- Collaborate with global teams on continuous improvement initiatives and operational stability.
- Document procedures, incidents, and operational knowledge in runbooks and knowledge bases (KB).
Requirements
What you’ll need- Experience supporting and maintaining high-availability, mission-critical environments.
- Knowledge of Cloud platforms, middleware, APIs, and container environments (Kubernetes/OpenShift).
- Experience in advanced troubleshooting and incident analysis in distributed environments.
- Knowledge of infrastructure, networking, operating systems, and application servers.
- Experience with observability and monitoring tools (Dynatrace, Grafana, Prometheus, ELK, or similar).
- Strong technical analytical skills for root cause identification.
- Familiarity with ITIL processes (Incident, Problem, and Change Management).
- Good communication skills for working with global, multidisciplinary teams.
- Analytical, proactive, and problem-solving oriented profile.
- Ability to work autonomously in mission-critical production environments.
Benefits
Comp & perks- Company-subsidized health plan for the employee.
- Option to include dependents in the health plan with payroll deduction.
- Dental assistance (optional).
- Option to include dependents in the dental assistance with payroll deduction.
- Meal voucher or food allowance.
- Transportation voucher (optional).
- Impact & Care - Personal support program offering confidential emotional support and counseling in psychological, legal, financial, social, and pet-related areas at no cost for the employee and legal dependents.
- Gympass - Wellhub (access to more than 700 gyms across Brazil with plans from R$ 29.90 deducted via payroll).
- Option to include dependents in Gympass - Wellhub (up to 3 dependents - paid via credit card).
- Access to Udemy through our intranet.
- Partnerships with major consumer brands.
- SESC agreement for employee and dependents.
- Discount agreements with educational institutions (undergraduate and postgraduate) and language / certification schools.
- Group life insurance.
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Cloud PlatformsMiddlewareContainer EnvironmentsAdvanced TroubleshootingIncident AnalysisInfrastructure KnowledgeNetworking KnowledgeOperating Systems KnowledgeApplication Servers KnowledgeAPI Administration
Soft Skills
Good Communication SkillsAnalytical SkillsProactive AttitudeProblem-Solving OrientationAbility to Work Autonomously
Certifications
ITIL Certification
