Infrastructure Engineer

• Design, deploy, and manage the compute infrastructure powering Fluidstack's GPU clusters
• Design and implement GPU/ASIC infrastructure at the server, rack, and system level
• Troubleshoot complex GPU and compute system related failures
• Develop and maintain hardware/firmware management services
• Automate all aspects of the server lifecycle
• Own end-to-end compute lifecycle, including partnering with vendors on RMAs
• Serve as the main point of contact for hardware escalation and troubleshooting
• Monitor system performance, identifying and resolving bottlenecks
• Automate deployment and management tasks to improve efficiency
• Collaborate with storage and network teams to ensure cohesive infrastructure operations
• Work closely with hardware and software teams to support AI workloads

Senior / Staff Infrastructure Engineer, Compute

Job Level

Tech Stack

About the role

Requirements