Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
Dropbox

Staff Site Reliability Engineer – Production Engineering

Dropbox

Site Reliability Engineer at Dropbox focused on advancing stability and operational excellence in software delivery. Collaborating across teams to define reliability goals with AI technologies.

Posted 6/3/2026full-timeRemote • 🇺🇸 United StatesLead💰 $223,400 - $302,200 per yearWebsite

Tech Stack

Tools & technologies
Distributed Systems

About the role

Key responsibilities & impact
  • Define and evolve Dropbox’s company-wide technical reliability strategy to support the changing engineering environment created by AI-assisted and agentic software development.
  • Set multi-year reliability goals, standards, and roadmaps across observability, debugging, incident management, service health, and operational readiness.
  • Lead cross-team initiatives that reduce reliability risk as software delivery velocity, pull request volume, service complexity, and incident volume increase.
  • Partner with engineering leaders and platform teams to improve monitoring, alerting, debugging, SLOs, SLAs, and incident response systems at company scale.
  • Identify emerging reliability risks introduced by AI-enabled development workflows and design scalable systems, processes, and guardrails to mitigate them.
  • Provide technical leadership and mentorship to engineers across teams, raising engineering quality, reliability judgment, and operational excellence.
  • Drive clear communication and alignment with senior stakeholders on reliability priorities, tradeoffs, risks, and execution progress.

Requirements

What you’ll need
  • BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent technical experience.
  • 12+ years of experience in software engineering, site reliability engineering, infrastructure engineering, or related technical roles.
  • Proven ability to define and deliver multi-year, multi-team reliability, infrastructure, or platform strategies with measurable business and customer impact.
  • Deep experience with distributed systems, production operations, observability, incident response, SLOs/SLAs, debugging, and reliability risk management.
  • Demonstrated ability to diagnose complex technical problems, debug production systems, automate operational workflows, and design resilient software components.
  • Experience influencing engineering roadmaps across multiple teams and making technical decisions that optimize for the broader engineering organization.
  • Strong communication and collaboration skills, with the ability to align cross-functional stakeholders through ambiguity and drive execution across teams.

Benefits

Comp & perks
  • 🌐 Worldwide ❌ Jobs You've Hidden ⭐️ Saved Jobs ✅ Applied Jobs ✉️ Email Alerts 👤 Account Dropbox Website LinkedIn All Job Openings 1001 - 5000 employees Founded 2007 🏢 Enterprise ⚡ Productivity Cloud Storage
  • Enterprise
  • Productivity Dropbox is a cloud-based service that provides tools for storing, sharing, and accessing files across devices. It offers features such as document sharing, video review, automatic backups, and AI-driven scheduling. Dropbox also provides solutions for different sectors like teams, sales, marketing, and education, and industries including construction, media, technology, and manufacturing. With a focus on security, Dropbox ensures files are encrypted and protected against tampering. It offers integrations with various productivity tools and is trusted by major companies for efficient file management and collaboration. Staff Site Reliability Engineer – Production Engineering Job not on LinkedIn 🔥 1 hour ago 🇺🇸 United States – Remote 💵 $223.4k - $302.2k / year ⏰ Full Time 🔴 Lead 🏭 Production Engineer 🦅 H1B Visa Sponsor Distributed Systems Apply Now Find Hiring Managers Customize resume + cover letter Report problem ☆ Save ☑️ Mark as applied ❌ Hide 📋 Description
  • Define and evolve Dropbox’s company-wide technical reliability strategy to support the changing engineering environment created by AI-assisted and agentic software development.
  • Set multi-year reliability goals, standards, and roadmaps across observability, debugging, incident management, service health, and operational readiness.
  • Lead cross-team initiatives that reduce reliability risk as software delivery velocity, pull request volume, service complexity, and incident volume increase.
  • Partner with engineering leaders and platform teams to improve monitoring, alerting, debugging, SLOs, SLAs, and incident response systems at company scale.
  • Identify emerging reliability risks introduced by AI-enabled development workflows and design scalable systems, processes, and guardrails to mitigate them.
  • Provide technical leadership and mentorship to engineers across teams, raising engineering quality, reliability judgment, and operational excellence.
  • Drive clear communication and alignment with senior stakeholders on reliability priorities, tradeoffs, risks, and execution progress. 🎯 Requirements
  • BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent technical experience.
  • 12+ years of experience in software engineering, site reliability engineering, infrastructure engineering, or related technical roles.
  • Proven ability to define and deliver multi-year, multi-team reliability, infrastructure, or platform strategies with measurable business and customer impact.
  • Deep experience with distributed systems, production operations, observability, incident response, SLOs/SLAs, debugging, and reliability risk management.
  • Demonstrated ability to diagnose complex technical problems, debug production systems, automate operational workflows, and design resilient software components.
  • Experience influencing engineering roadmaps across multiple teams and making technical decisions that optimize for the broader engineering organization.
  • Strong communication and collaboration skills, with the ability to align cross-functional stakeholders through ambiguity and drive execution across teams. Apply Now 📊 Check your resume score for this job Improve your chances of getting an interview by checking your resume score before you apply. Check Resume Score Similar Jobs Principal Software Engineer – AI Platform, Production Engineering, Reliability 🕒 5 days ago CVS Health 10,000+ employees ⚕️ Healthcare Insurance 🛒 Retail 🧘 Wellness Website LinkedIn All Job Openings Principal Software Engineer leading production excellence for AI Platform at CVS Health. Drive operational readiness and observability for AI services with high availability and performance standards. 🇺🇸 United States – Remote 💵 $144.2k - $288.4k / year ⏰ Full Time 🔴 Lead 🏭 Production Engineer AWS Azure Cloud Distributed Systems Google Cloud Platform Engineering Manager – DGX Cloud Production 🕒 6 days ago NVIDIA 10,000+ employees 🤖 Artificial Intelligence 🎮 Gaming Website LinkedIn All Job Openings Engineering Manager leading a team focused on Kubernetes-based operations and automation for DGX Cloud infrastructure at NVIDIA. Collaborating with various teams to enhance production readiness and reliability. 🇺🇸 United States – Remote 💵 $224k - $356.5k / year ⏰ Full Time 🟠 Senior 🔴 Lead 🏭 Production Engineer 🦅 H1B Visa Sponsor Cloud Kubernetes Principal Software Engineer, DGX Cloud Production Engineering 🕒 May 19 NVIDIA 10,000+ employees 🤖 Artificial Intelligence 🎮 Gaming Website LinkedIn All Job Openings Principal Software Engineer for NVIDIA DGX Cloud shaping technical direction and leading production engineering efforts. Focus on automation, reliability, and Kubernetes-based operations for large-scale GPU infrastructure. 🇺🇸 United States – Remote 💵 $272k - $431.3k / year ⏰ Full Time 🔴 Lead 🏭 Production Engineer 🦅 H1B Visa Sponsor Cloud Distributed Systems Kubernetes Linux Python Go Principal Engineer, Software, Production Engineering 🕒 March 28 Palo Alto Networks 10,000+ employees 🔒 Cybersecurity 🏢 Enterprise Website LinkedIn All Job Openings Infrastructure Engineer developing internal tools and optimizing platform components for Palo Alto Networks' Chronosphere. Collaborating globally with teams to enhance developer productivity and system reliability. 🇺🇸 United States – Remote 💵 $147k - $237.5k / year 💰 $1M Seed Round - Morta Security on 2013-02 ⏰ Full Time 🔴 Lead 🏭 Production Engineer 🦅 H1B Visa Sponsor AWS Cloud Distributed Systems Google Cloud Platform Java Kubernetes Linux Rust Go View More Production Engineer Jobs 🌐 Worldwide Built by Lior Neu-ner. I'd love to hear your feedback — Get in touch via DM or support@remoterocketship.com Search Search Jobs by country Search jobs by city Search jobs by job title Search entry-level jobs Search junior-level jobs Search senior-level jobs Search jobs by tech stack Search jobs by contract type Search remote internships Search remote part-time jobs Remote jobs Anywhere in the World Companies Hiring Anywhere in the World Companies Hiring Sales People Anywhere in the World Companies Hiring Software Engineers Anywhere in the World Resources Advice Tips for finding remote jobs Interview questions and answers Resume examples Cover letter examples Post a job Affiliates Privacy policy Terms of service Job board SEO course AI Apply Copilot OpenClaw job finder Jobs by Country Remote jobs anywhere in the world (Worldwide remote jobs) Remote jobs United States Remote jobs Australia Remote jobs Brazil Remote jobs Canada Remote jobs France Remote jobs Ireland Remote jobs Germany Remote jobs Netherlands Remote jobs Spain Remote jobs UK Popular Jobs Remote data analyst jobs Remote customer support jobs Remote executive assistant jobs Remote marketing jobs Remote product designer jobs Remote product manager jobs Remote project manager jobs Remote recruiter jobs Remote sales jobs Remote software engineer jobs Jobs by Type Remote full-time jobs Remote part-time jobs Remote contract jobs Remote internship jobs Remote entry-level jobs Remote jobs with no experience required Remote junior jobs (1-3 years of experience) Digital nomad jobs Remote jobs with no degree required Freelance remote jobs Temporary remote jobs Remote jobs hiring now Stay at home mom jobs

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
software engineeringsite reliability engineeringinfrastructure engineeringdistributed systemsobservabilityincident responseSLOsSLAsdebuggingreliability risk management
Soft Skills
technical leadershipmentorshipcommunicationcollaborationalignmentinfluenceproblem diagnosisexecutioncross-functional teamworkambiguity management
Certifications
BS degree in Computer Scienceequivalent technical experience