ScienceLogic

Senior Site Reliability Engineer, Observability

ScienceLogic

full-time

Posted on:

Location Type: Remote

Location: VirginiaUnited States

Visit company website

Explore more

AI Apply
Apply

Job Level

About the role

  • Be a key contributor on an Agile development team, collaboratively realizing business value through iterative software development lifecycle
  • Build and execute the monitoring strategy for ScienceLogic SaaS infrastructure
  • Define, deploy, and maintain system and service monitors
  • Be the authority for various monitoring technologies like Prometheus, AWS Cloudwatch, Scylla manager, New Relic to provide next generation monitoring solutions for ScienceLogic SaaS
  • Employ advanced monitoring practices and technologies to detect and automatically resolve platform issues before they impact the customer’s experience.
  • Participate in architecture and operations reviews
  • Identify and automate measurement of operations SLAs, SLOs using SLIs
  • Triage incident response, document SOPs, Runbooks and train NOC team members
  • Participate in shared on-call manager rotation for escalations during incidents and outages, occasionally during off hours
  • Provide dash boarding and analytics solutions to internal teams based on requirements

Requirements

  • 8+ years of software development or site reliability engineering or equivalent experience
  • Skilled at problem solving, algorithms, and data structures
  • Building tools and scripting frameworks from scratch
  • Working with Cloud Automation tools like CloudFormation, Terraform, CDK, aws-cli
  • Scripting languages like Python, Groovy, PowerShell, Bash, Perl etc.
  • Configuration automation using Ansible or equivalent tools
  • Exposure to Windows and Linux administration skills
  • Project management tools like Jira, Trello
  • Prior experience in dealing with Datastore technologies like Postgres, MySQL, SQL, DynamoDB is desirable
  • Familiarity with basic networking, security and cloud engineering concepts
  • Team player who is eager to help others to succeed through mentoring and leading by example
  • Highly collaborative with effective written and verbal communication skills.
Benefits
  • Comprehensive medical, dental and vision plans
  • 401(k) plan with employer match
  • Flexible Paid Time Off (FTO) so that you can take the time that you need to re-energize
  • Volunteer Time Off (VTO) - take two days off per calendar year to volunteer with your preferred charitable organization
  • 5-year Service Milestone Sabbatical
  • Paid parental leave
  • Generous employee referral bonus program
  • Pet insurance
  • HQ Office centrally located in Reston Town Center featuring a well-stocked kitchen with rotating snacks and beverages, and catered lunch on Thursdays
  • Regular virtual company-wide events, including cooking classes, yoga, meditation and more
  • Mentorship and professional development opportunities with experienced product marketing leaders
  • The opportunity to learn and develop from some of the best and brightest minds in the industry!
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
Agile developmentmonitoring strategysystem monitorsmonitoring technologiesSLAsSLOsincident responsedash boardingdata structuresconfiguration automation
Soft Skills
problem solvingteam playermentoringcollaborativeeffective communication