FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.

Senior ITSMA Observability Engineer
HedgeServSenior ITSMA Observability Engineer at HedgeServ designing and developing observability solutions for critical applications. Collaborating with IT teams and engaging in system performance enhancements while maintaining infrastructure tools.
Tech Stack
Tools & technologiesAnsibleAWSCloudDockerElasticSearchETLGrafanaKubernetesLogstashPrometheusPythonTerraform
About the role
Key responsibilities & impact- collaborate with the ITSMA Monitoring and Analytics Team to design, build, secure, maintain, optimize, and document solutions utilizing Elastic Cloud Stack and AWS-managed Prometheus
- Proficiency with Elasticsearch, Logstash, Kibana, Beats, APM with X-Pack, Prometheus, Grafana, AWS CloudWatch, and other observability tools
- Engage closely with application owners, engineers, and development teams to evaluate requirements, architect, and support an Elasticsearch Stack solution, as well as structure queries to enhance system performance and efficiency
- Design and configure ETL data pipelines using Elastic Common Schema for onboarding application logs and metrics
- Configure index templates and manage data lifecycle (ILM) for effective data retention
- Develop Ansible playbooks for automated deployment of Beat agents across on-premises and AWS systems; utilize Terraform for safe management of production infrastructure, employing methodologies such as Infrastructure as Code within AWS environments
- Create Elastic alerting solutions via Watcher and Kibana Alerts integrated with existing ticketing tools and MS Teams
- Develop Machine Learning jobs to dynamically monitor and provide alerts based on specific metrics and KPIs
- Build Elastic and AWS observability AI solutions that enable infrastructure engineering and operations teams to address production issues efficiently
- Adhere to lifecycle processes for transitioning solutions from Development to QA to Production
- Actively participate in collaborative group sessions, attend agile sprint daily meetings, and share progress to ensure solution development aligns with organizational requirements
Requirements
What you’ll need- Technical Degree in Information Technology
- Experience with Elastic Cloud and AWS Managed Prometheus
- Knowledge of installation, system tasks, data collection, network troubleshooting, data pipelines, and cluster administration
- Proficient in Python, Bash, PowerShell, Painless, and other scripting languages
- Extensive ELK Stack expertise, including Elasticsearch, Logstash, Kibana, Beats, Machine Learning, APM, X-Pack, and REST API integration
- Skilled in evaluating and tuning Elastic clusters, configurations, indexing, search performance, security, and administration
- Proficient with Prometheus, Grafana, AWS observability tools, and their performance, security, and management
- Experienced with security integrations (Windows SAML, LDAP, Kerberos) in Elasticsearch
- Adept with AWS services: CloudWatch, CloudTrail, Kubernetes, Docker, Lambda
- Integrated Elastic alerting with third-party ticketing tools
- Experienced in implementing and integrating observability AI agents and frameworks for automated analysis, incident detection, and proactive resolution across complex systems
Benefits
Comp & perks- fully paid comprehensive health and well-being benefits
- remote and hybrid working arrangements
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
ElasticsearchLogstashKibanaBeatsPrometheusGrafanaPythonBashPowerShellMachine Learning
Soft Skills
collaborationcommunicationproblem-solvingagile methodologyteamworkrequirements evaluationsolution architectureperformance tuningdocumentationincident detection
Certifications
Technical Degree in Information Technology