
Maintenance Engineer
Illumina
full-time
Posted on:
Location Type: Hybrid
Location: San Mateo • California • United States
Visit company websiteExplore more
Salary
💰 $170,000 - $250,000 per year
Tech Stack
About the role
- Monitor, maintain, and improve the reliability of production AI systems and workflow infrastructure
- Proactively identify, diagnose, and resolve system issues across application, integration, and cloud infrastructure layers
- Own incident response processes, including root cause analysis and long-term remediation
- Implement monitoring, alerting, and observability tooling to ensure system health and uptime
- Collaborate with Engineering to harden deployments and improve system architecture for resilience and scalability
- Support customer-facing teams by troubleshooting and resolving technical issues in live environments
- Document system configurations, operational procedures, and recovery protocols
- Continuously improve reliability standards, deployment practices, and operational safeguards
Requirements
- 3+ years of experience in support engineering, site reliability engineering, or infrastructure maintenance
- Strong proficiency in Python or scripting languages
- Experience managing cloud infrastructure (AWS, GCP, or Azure)
- Strong problem-solving skills and a proactive, preventative mindset
- Clear communication skills and ability to collaborate across engineering and customer-facing teams
- High ownership and accountability in high-reliability environments
Benefits
- Offers Equity 📊 Check your resume score for this job Improve your chances of getting an interview by checking your resume score before you apply. Check Resume Score
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Pythonscripting languagescloud infrastructure managementincident responseroot cause analysismonitoring toolsalerting toolsobservability toolingsystem architecturedeployment practices
Soft Skills
problem-solvingproactive mindsetclear communicationcollaborationownershipaccountability