
Staff Operations Engineer – DevOps/MLOps
Adobe
full-time
Posted on:
Location Type: Office
Location: San Jose • California • United States
Visit company websiteExplore more
Salary
💰 $159,200 - $301,600 per year
Job Level
About the role
- Build and automate cloud infrastructure provisioning, scaling, and deployments using industry-standard tools and infrastructure-as-code practices
- Architect and implement end-to-end MLOps pipelines for packaging, deploying, and monitoring large-scale ML services
- Build and integrate telemetry agents to capture operational, performance, and inference metrics across distributed ML services
- Build backend dashboards and observability workflows that surface quality, performance, traffic, and reliability insights for ML services
- Lead the development of Agentic Ops solutions to optimize large-scale ML production workflows, reduce MTTR, and increase service engineering productivity
- Develop and maintain robust CI/CD pipelines (e.g., GitLab CI, GitHub Actions, Jenkins) enabling automated model conversion, optimization (PTQ/QAT), and artifact packaging
- Drive standards in reliability, cost optimization, and operational readiness across service deployments
Requirements
- 8+ years of experience in DevOps, SRE, or cloud infrastructure engineering roles
- Demonstrated experience designing and managing MLOps lifecycles, including model deployment, inference optimization, and production monitoring
- Strong knowledge of CI/CD methodologies and tools such as GitOps, Docker, Terraform, GitHub Actions, GitLab CI, or Jenkins
- Hands-on expertise with Kubernetes orchestration, including frameworks such as Kubeflow, Argo Workflows, or similar systems
- Strong programming skills in Python, with experience building automation tooling for ML or DevOps workflows
- Proficiency with observability and monitoring platforms (e.g., Prometheus, Grafana, Splunk, New Relic) for building reliable production systems
- Experience optimizing distributed architectures for cost efficiency, reliability, and performance
- Familiarity with deep learning frameworks (e.g., PyTorch, TensorFlow) and model optimization tools such as ONNX, TensorRT, TFLite, AOT, etc., is a strong plus.
Benefits
- Health insurance
- 401(k) matching
- Flexible work arrangements
- Professional development opportunities
- Paid time off
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
cloud infrastructure provisioningMLOps pipelinesCI/CD pipelinesautomation toolingKubernetes orchestrationprogramming in Pythonmodel deploymentinference optimizationdeep learning frameworksmodel optimization tools
Soft Skills
leadershipcommunicationorganizational skillsproblem-solvingcollaboration