Jellyvision

Manager, Site Reliability Engineering

Jellyvision

full-time

Posted on:

Location Type: Remote

Location: Remote • California, Colorado, Florida, Illinois, Kentucky, Minnesota, Missouri, New York, North Carolina, Ohio, Oregon, Pennsylvania, South Carolina, Tennessee, Texas, Utah, Virginia, Washington, Wisconsin • 🇺🇸 United States

Visit company website
AI Apply
Apply

Salary

💰 $160,000 - $170,000 per year

Job Level

Mid-LevelSenior

Tech Stack

AWSAzureCloudDistributed SystemsDockerGoogle Cloud PlatformKubernetesMicroservices

About the role

  • Directly manage a team of onshore and offshore software engineers
  • Lead and elevate our existing SRE team to world-class performance standards, advancing career development and technical excellence
  • Optimize and mature our established SRE practices, enhancing SLO/SLI frameworks, error budget management, and incident response effectiveness
  • Strengthen our culture of reliability and observability, driving higher standards for continuous improvement across all engineering teams
  • Refine existing on-call processes, escalation procedures, and post-incident reviews to accelerate learning and prevent recurring issues
  • Drive an AI-first agenda, leveraging AI tooling to address key pain points and improve speed to market
  • Partner with Product and Engineering leadership to help deliver the core product technology roadmap, balancing feature delivery with reliability and scalability requirements
  • Drive strategic decisions on technology consolidation and simplification to reduce operational overhead and costs
  • Lead technology platform evaluations and migrations that align with business objectives and cost optimization goals
  • Implement comprehensive monitoring, alerting, and observability solutions across all systems
  • Establish reliability engineering practices, load testing, and capacity planning
  • Drive automation initiatives that reduce manual toil and improve system reliability
  • Create and maintain disaster recovery and business continuity plans
  • Work closely with Platform & Infrastructure, Product Development, and Security teams to ensure aligned priorities
  • Collaborate with Finance and Operations teams on cost optimization and resource planning initiatives
  • Present technical strategies and progress to executive leadership

Requirements

  • 6+ years of software engineering experience with 2+ years in SRE, DevOps, or infrastructure leadership roles
  • Proven experience building and scaling SRE teams at high-growth technology companies
  • Deep expertise in cloud platforms (AWS, GCP, Azure), containerization (Kubernetes, Docker), and Infrastructure as Code
  • Strong background in distributed systems, microservices architecture, and database technologies
  • Experience with monitoring and observability tools (Dynatrace, DataDog, New Relic, etc.)
  • Experience with AI automation and Workflow optimization tools
  • Demonstrated success leading engineering teams composed of FTEs and offshore contractors
  • Track record of driving significant cost reductions through technology optimization and consolidation
  • Experience managing complex technical roadmaps with competing priorities and resource constraints
  • Strong analytical skills with the ability to make data-driven decisions on technology investments
  • Excellent written and verbal communication skills with the ability to present to executive audiences
  • Experience translating technical concepts into business impact and ROI metrics
  • Proven ability to influence cross-functional teams and drive consensus on technical decisions.
Benefits
  • Check out our benefits here!

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
software engineeringsite reliability engineering (SRE)DevOpscloud platformscontainerizationInfrastructure as Codedistributed systemsmicroservices architecturedatabase technologiesautomation
Soft skills
leadershipcommunicationanalytical skillscollaborationinfluencestrategic decision-makingproblem-solvingteam managementpresentation skillscontinuous improvement