DevOps Engineer

• Acts as a project or system leader, coordinating the activities of other engineers on the project or within the system
• Determines the technical tasks that other engineers will follow
• Proactively improves existing structures & processes and exercises judgement in reconciling diverse priorities
• Define mobile-specific SLIs and SLOs (e.g., crash-free sessions, ANRs, app startup time, network success rates, battery/memory usage)
• Establish best practices for observability, alerting, and incident response in Datadog
• Lead development of automation and tools for mobile reliability (automated regression detection, performance benchmarking, crash/ANR triage, release health dashboards, instrumentation libraries)
• Ensure tooling aligns with existing systems (Harness for CI/CD, Gradle/Bazel for builds)
• Act as primary liaison with backend/web SRE leadership for incident response and shared visibility
• Partner with Release Engineering, QA, and Product to ensure operational readiness of new features
• Influence architecture and design decisions to prioritize mobile reliability
• Lead cultural change: define and roll out on-call model for mobile teams and champion a blameless postmortem culture
• Mentor and guide a distributed team of senior Mobile SREs and provide technical leadership in complex incidents
• Help recruit and onboard new SREs and set technical and cultural standards
• Partner with infrastructure and developer productivity teams to integrate Bazel and Gradle builds into reliable CI/CD pipeline and establish long-term roadmaps for mobile reliability

Mobile Site Reliability Engineer

Senior Site Reliability Engineer – Networking

Senior DevOps Engineer – Cloud Platform

Senior DevOps Engineer

Site Reliability Engineer

Site Reliability Engineer