
Staff Software Engineer
DataRobot
full-time
Posted on:
Location Type: Remote
Location: California • Massachusetts • United States
Visit company websiteExplore more
Job Level
About the role
- Build a system that ensures micro-services are secure, performant, reliable, and can go from idea to production in an hour.
- Build a system that continuously provides recommendations to right-size computing resources for Kubernetes to ensure efficient cloud spending for ourselves and our customers.
- Design and architect automated quality platforms to go from Enterprise-Grade releases from once-a-quarter to once-a-week to once-per-day to once-per-hour without sacrificing performance, security, or reliability
- Work with Product, Legal and Security to ensure the continuous delivery processes you build are compliant and secure
- Work with the team to ensure pipelines have clear playbooks and can operate 24/7 without you
- Work with a diverse group of architects and platform engineers across our R&D department to set continuous delivery and performance requirements for all production services
- Work with internal product managers to set roadmaps and define milestones to deliver innovative and simple solutions to our many teams’ continuous delivery and platform engineering issues
Requirements
- 8+ years of experience
- Expert in developing a wide variety of software with Python (5+ years)
- Experience designing and operating diverse CI/CD pipelines with Harness.io
- Experience designing and innovating large-scale horizontal and vertically-scaled build, testing, and deployment systems for Kubernetes environments and familiarity with Helm charts
- Preferred: Golang, Terraform and Terragrunt
- Nice to Have: Direct experience with modern distributed compute frameworks (e.g., Ray, Dask) and large-scale job schedulers (e.g., Slurm, Kueue).
- CKAD (Certified Kubernetes Application Developer) certification
- Publicly reviewable contributions to interesting development projects.
- Agentic AI experience
- Experience working with NVIDIA infrastructure in managing (NIM Operator, NVIDIA GPU Operator)
Benefits
- Medical, Dental & Vision Insurance
- Flexible Time Off Program
- Paid Holidays
- Paid Parental Leave
- Global Employee Assistance Program (EAP) and more!
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
PythonCI/CD pipelinesKubernetesHelm chartsGolangTerraformTerragruntdistributed compute frameworksjob schedulersautomated quality platforms
Soft Skills
collaborationcommunicationproblem-solvingleadershiporganizational skills
Certifications
CKAD