Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
Data Analysis Incorporated

Senior Monitoring & Observability Engineer

Data Analysis Incorporated

Senior Monitoring & Observability Engineer responsible for designing and supporting monitoring platforms. Collaborating with teams to improve system reliability and incident response in a dynamic environment.

Posted 6/11/2026full-timeLos Angeles • California • 🇺🇸 United StatesSenior💰 $115,000 - $125,000 per yearWebsite

Tech Stack

Tools & technologies
AnsibleAWSAzureCloudCyber SecurityDNSGoogle Cloud PlatformLinuxPythonSplunkTCP/IP

About the role

Key responsibilities & impact
  • Monitor and manage IT infrastructure, network systems, and business applications using enterprise monitoring tools, aligned with the TOC Sr. Engineer scope.
  • Serve as the first point of escalation for TOC Engineers, providing advanced troubleshooting, guidance, and root cause analysis.
  • Lead or support incident response, root cause analysis, escalation, and post-incident review processes; ensure issues are properly classified, escalated, and resolved efficiently.
  • Take key roles in ITIL Incident, Problem, and Change Management processes.
  • Build and tune monitoring and observability tooling — instrumentation, integrations, dashboards, alert logic, synthetic checks, log pipelines, and APM configuration — not just consume them.
  • Develop and implement automation scripts and tooling to improve operational efficiency, alerting quality, and response times (Python, PowerShell, Bash, Ansible, or similar).
  • Analyze system logs, network traffic, event data, and performance metrics to identify trends, reduce alert noise, and prevent outages.
  • Document monitoring standards, troubleshooting steps, system configurations, dashboards, and runbooks for knowledge sharing.
  • Collaborate with IT, Security, and DevOps teams to maintain system reliability and security posture.
  • Work with vendors and service providers to resolve tool, platform, and infrastructure issues.
  • Participate in 24/7 on-call rotations and provide leadership during major incidents, helping coordinate cross-functional resolution efforts.
  • Mentor junior TOC/NOC engineers on monitoring tools, dashboards, alert handling, and incident response practices.

Requirements

What you’ll need
  • Bachelor’s degree in IT, Computer Science, Networking, or a related field (or equivalent work experience).
  • 3+ years of experience in IT operations, network monitoring, or system administration, with hands-on experience implementing and tuning enterprise monitoring/observability platforms.
  • Demonstrated experience building or implementing (not just using) one or more of: Datadog, Dynatrace, AppDynamics, Splunk, SolarWinds Orion, Orion DPA, Nagios, PRTG, or Zabbix.
  • Advanced understanding of network protocols (TCP/IP, BGP, OSPF, VLANs, VPN, DNS, DHCP).
  • Proficiency in Windows/Linux environments and at least one major cloud platform (AWS, Azure, or GCP).
  • Familiarity with ITIL best practices for incident, problem, and change management.
  • Scripting and automation experience using Python, PowerShell, Bash, Ansible, or similar tools.
  • Working knowledge of cybersecurity best practices, firewall configurations, and SIEM tools.
  • Strong leadership, communication, and collaboration skills, including the ability to translate monitoring data into clear operational action across cross-functional teams.
  • Ability to work in a high-stress, dynamic environment while handling multiple high-priority incidents.

Benefits

Comp & perks
  • 🌐 Worldwide ❌ Jobs You've Hidden ⭐️ Saved Jobs ✅ Applied Jobs ✉️ Email Alerts 👤 Account Data Analysis Incorporated Website LinkedIn All Job Openings 501 - 1000 employees 💸 Finance ⚕️ Healthcare Insurance 🤝 B2B Finance
  • Healthcare Insurance
  • B2B Data Analysis Incorporated is a parent company of a diverse group of private firms specializing in equity investment services and data-driven customer communications for the Healthcare, Insurance, and Financial Services industries. With over 60 years of experience, DAI utilizes its expertise in managing and leveraging data to focus on institutional equity markets, digital investment news, research, and customer experience solutions. DAI provides strategic direction and oversight to its subsidiaries while offering shared services like administration, finance, and human resources. Senior Monitoring & Observability Engineer 🔥 1 minute ago 🏢 Los Angeles – Onsite 💵 $115k - $125k / year ⏰ Full Time 🟠 Senior 👷🏻‍♀️ Engineer 🦅 H1B Visa Sponsor Ansible AWS Azure Cloud Cyber Security DNS Google Cloud Platform Linux Python Splunk TCP/IP Apply Now Find Hiring Managers Customize resume + cover letter Report problem ☆ Save ☑️ Mark as applied ❌ Hide 📋 Description
  • Monitor and manage IT infrastructure, network systems, and business applications using enterprise monitoring tools, aligned with the TOC Sr. Engineer scope.
  • Serve as the first point of escalation for TOC Engineers, providing advanced troubleshooting, guidance, and root cause analysis.
  • Lead or support incident response, root cause analysis, escalation, and post-incident review processes; ensure issues are properly classified, escalated, and resolved efficiently.
  • Take key roles in ITIL Incident, Problem, and Change Management processes.
  • Build and tune monitoring and observability tooling — instrumentation, integrations, dashboards, alert logic, synthetic checks, log pipelines, and APM configuration — not just consume them.
  • Develop and implement automation scripts and tooling to improve operational efficiency, alerting quality, and response times (Python, PowerShell, Bash, Ansible, or similar).
  • Analyze system logs, network traffic, event data, and performance metrics to identify trends, reduce alert noise, and prevent outages.
  • Document monitoring standards, troubleshooting steps, system configurations, dashboards, and runbooks for knowledge sharing.
  • Collaborate with IT, Security, and DevOps teams to maintain system reliability and security posture.
  • Work with vendors and service providers to resolve tool, platform, and infrastructure issues.
  • Participate in 24/7 on-call rotations and provide leadership during major incidents, helping coordinate cross-functional resolution efforts.
  • Mentor junior TOC/NOC engineers on monitoring tools, dashboards, alert handling, and incident response practices. 🎯 Requirements
  • Bachelor’s degree in IT, Computer Science, Networking, or a related field (or equivalent work experience).
  • 3+ years of experience in IT operations, network monitoring, or system administration, with hands-on experience implementing and tuning enterprise monitoring/observability platforms.
  • Demonstrated experience building or implementing (not just using) one or more of: Datadog, Dynatrace, AppDynamics, Splunk, SolarWinds Orion, Orion DPA, Nagios, PRTG, or Zabbix.
  • Advanced understanding of network protocols (TCP/IP, BGP, OSPF, VLANs, VPN, DNS, DHCP).
  • Proficiency in Windows/Linux environments and at least one major cloud platform (AWS, Azure, or GCP).
  • Familiarity with ITIL best practices for incident, problem, and change management.
  • Scripting and automation experience using Python, PowerShell, Bash, Ansible, or similar tools.
  • Working knowledge of cybersecurity best practices, firewall configurations, and SIEM tools.
  • Strong leadership, communication, and collaboration skills, including the ability to translate monitoring data into clear operational action across cross-functional teams.
  • Ability to work in a high-stress, dynamic environment while handling multiple high-priority incidents. Apply Now 📊 Check your resume score for this job Improve your chances of getting an interview by checking your resume score before you apply. Check Resume Score Similar Jobs Senior Project Engineer – Healthcare 🕒 5 days ago Swinerton 1001 - 5000 🏠 Real Estate 🏢 Enterprise Website LinkedIn All Job Openings Senior Project Engineer administering field operations for healthcare projects at Swinerton. Responsible for project profitability, scheduling, and team collaboration to meet objectives. 🏢 Los Angeles – Onsite 💵 $81.3k - $122k / year ⏰ Full Time 🟠 Senior 👷🏻‍♀️ Engineer 🦅 H1B Visa Sponsor Assistant Resident Engineer 🕒 May 29 HNTB 5001 - 10000 Website LinkedIn All Job Openings Assistant Resident Engineer managing overall field operations and ensuring compliance on infrastructure projects. Leading technical assignments and staff coordination for HNTB’s contractor agreements. 🏢 Los Angeles – Onsite 💵 $173.1k - $276.5k / year ⏰ Full Time 🟠 Senior 🔴 Lead 👷🏻‍♀️ Engineer 🦅 H1B Visa Sponsor V&V Engineer 🕒 May 20 Cubic Corporation 5001 - 10000 🚗 Transport Website LinkedIn All Job Openings V&V Engineer applying engineering principles to develop test plans and conduct system testing for Cubic's technology in transportation. Collaborating with program and customer teams to ensure successful project execution. 🏢 Los Angeles – Onsite 💵 $69.4k - $95.4k / year ⏰ Full Time 🟡 Mid-level 🟠 Senior 👷🏻‍♀️ Engineer 🦅 H1B Visa Sponsor Engineer III – Aviation Infrastructure 🕒 May 15 HNTB 5001 - 10000 Website LinkedIn All Job Openings Engineer III responsible for aviation infrastructure project management for HNTB. Overseeing design and development of civil engineering projects across multiple locations. 🏢 Los Angeles – Onsite 💵 $83.4k - $163k / year ⏰ Full Time 🟡 Mid-level 🟠 Senior 👷🏻‍♀️ Engineer 🦅 H1B Visa Sponsor Senior Avionics PCB Layout Engineer 🕒 May 6 BLUE ORIGIN 10,000+ employees Website LinkedIn All Job Openings Senior Avionics PCB Layout Engineer designing high-performance avionics hardware for space flight vehicles. Working with complex PCB layouts involving FPGA and system interfaces. 🏢 Los Angeles – Onsite 💵 $156.8k - $219.5k / year 💰 Grant on 2021-12 ⏰ Full Time 🟠 Senior 👷🏻‍♀️ Engineer 🦅 H1B Visa Sponsor Assembly View More Engineer Jobs 🌐 Worldwide Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com Search Search Jobs by country Search jobs by city Search jobs by job title Search entry-level jobs Search junior-level jobs Search senior-level jobs Search jobs by tech stack Search jobs by contract type Search remote internships Search remote part-time jobs Remote jobs Anywhere in the World Companies Hiring Anywhere in the World Companies Hiring Sales People Anywhere in the World Companies Hiring Software Engineers Anywhere in the World Resources Advice Tips for finding remote jobs Interview questions and answers Resume examples Cover letter examples Post a job Affiliates Privacy policy Terms of service Job board SEO course AI Apply Copilot OpenClaw job finder Jobs by Country Remote jobs anywhere in the world (Worldwide remote jobs) Remote jobs United States Remote jobs Australia Remote jobs Brazil Remote jobs Canada Remote jobs France Remote jobs Ireland Remote jobs Germany Remote jobs Netherlands Remote jobs Spain Remote jobs UK Popular Jobs Remote data analyst jobs Remote customer support jobs Remote executive assistant jobs Remote marketing jobs Remote product designer jobs Remote product manager jobs Remote project manager jobs Remote recruiter jobs Remote sales jobs Remote software engineer jobs Jobs by Type Remote full-time jobs Remote part-time jobs Remote contract jobs Remote internship jobs Remote entry-level jobs Remote jobs with no experience required Remote junior jobs (1-3 years of experience) Digital nomad jobs Remote jobs with no degree required Freelance remote jobs Temporary remote jobs Remote jobs hiring now Stay at home mom jobs

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
PythonPowerShellBashAnsiblenetwork monitoringsystem administrationnetwork protocolsautomation scriptingmonitoring toolsobservability platforms
Soft Skills
leadershipcommunicationcollaborationtroubleshootingroot cause analysisincident responsementoringproblem-solvingdocumentationhigh-stress management