Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
Montauk Capital

Head of Infrastructure

Montauk Capital

Head of Infrastructure leading design and implementation of GPU infrastructure. Overseeing supply chain decisions and technical execution for Edge AI compute platform.

Posted 5/6/2026full-timeNew York City • New York • 🇺🇸 United StatesLeadWebsite

Tech Stack

Tools & technologies
LinuxShell Scripting

About the role

Key responsibilities & impact
  • Own GPU infrastructure design and implementation details from planning through deployment
  • Own hardware selection, configuration, and deployment across early compute infrastructure
  • Help turn early technical groundwork into a functioning deployed system
  • Own the GPU roadmap we use to entice customers and build partnerships
  • Deploy, operate, and tune GPU clusters for both bare-metal and internal software stack
  • Own resilient networking implementation from each site to the cluster, including a robust OOB network for constant monitoring and management
  • Manage deployments at production scale
  • Interface with site ops on power, cooling, and connectivity
  • Build the automation and monitoring stack for distributed edge nodes
  • Own the supply chain for all infrastructure gear
  • Manage third party hardware vendors on provisioning, maintenance and break-fix support

Requirements

What you’ll need
  • Strong infrastructure engineering experience and systems-level technical judgment
  • Experience deploying or managing compute infrastructure in real-world environments
  • Experience with data center, hardware, or GPU-based systems implementation
  • Experience owning GPU provisioning, hardware selection, and systems configuration
  • GPU scheduling and orchestration specifics: GPU type awareness, memory management, topology considerations, placement strategies for multi-GPU jobs, and fragmentation minimization
  • Bare-metal provisioning lifecycle: IPMI/Redfish, BMC-based remote management, PXE boot, and automated OS deployment workflows
  • On-board storage
  • Observability stack: distributed configuration and troubleshooting, plus monitoring, alerting, and tracing
  • Deployment planning, Hardware configuration, Operational troubleshooting
  • Linux systems depth: RHEL/Ubuntu, low-level troubleshooting, shell scripting
  • Security and operational best practices for bare metal
  • Deployment tooling at production scale
  • Networking fundamentals for inference workloads and OOB management
  • Startup / 0→1 DNA: You ship fast and communicate clearly.

Benefits

Comp & perks
  • Competitive compensation + equity: True ownership over what you build

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
GPU infrastructure designhardware selectionGPU provisioningsystems configurationGPU schedulingbare-metal provisioningLinux systemsshell scriptingobservability stacknetworking fundamentals
Soft Skills
infrastructure engineering experiencesystems-level technical judgmentdeployment planningoperational troubleshootingcommunicationstartup mindset