
Hardware Sustaining Engineer
DigitalOcean
full-time
Posted on:
Location Type: Remote
Location: California • Colorado • United States
Visit company websiteExplore more
Salary
💰 $83,000 - $104,000 per year
About the role
- Act a member of the Sustaining Engineering team in the Infra::Machines::Design Organization
- Support server hardware, cabling, and networking hardware throughout its operational lifecycle
- Monitor the #machines channel and MACHINES JIRA project for issues and drive them to resolution
- Participate in 24/7 on-call rotation with other members of the team
- Act as Tier 2 escalation for Datacenter Operations (DCOPS) and Cloud Operations (CloudOps) regarding hardware and firmware components
- Develop and maintain standards and practices for DigitalOcean hardware operations
- Work closely with the Qualification team, Firmware team, Fleet Lifecycle Engineering team (FLE), Foresight team, and Infrastructure Services team to resolve issues in tooling, firmware packages, hardware components, and other operational concerns
- Help with development of tooling and associated runbooks to address gaps in operational capabilities around hardware and firmware operations
- Coordinate with Ops teams on monitoring thresholds, failure modes and alerting
- Assist in troubleshooting cause of failures and work to prevent them in the future
- Raise the quality bar in the delivery of our cloud infrastructure by identifying industry best practices and working to adopt them
Requirements
- Technical Degree (BS Computer Science/Engineering) or equivalent practical experience
- Hands-on experience operating a cloud infrastructure at mid-tier scale or better
- An in-depth understanding of server hardware, firmware, and infrastructure
- Strong knowledge in troubleshooting techniques, Python and BASH
- Extra points for JTAG debugging / Firmware troubleshooting / Wire sniffing experience
- Clear communication and collaboration across key stakeholders
- An insatiable passion for constant improvement.
Benefits
- We prioritize career development.
- We care about your well-being.
- We reward our employees.
- Employee Assistance Program.
- Local Employee Meetups.
- Flexible time off policy.
- Reimbursement for relevant conferences, training, and education.
- Access to LinkedIn Learning's 10,000+ courses.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
PythonBASHserver hardwarefirmwarecloud infrastructuretroubleshooting techniquesJTAG debuggingFirmware troubleshootingWire sniffing
Soft Skills
clear communicationcollaborationpassion for improvement
Certifications
BS Computer ScienceBS Engineering