DigitalOcean

Senior Engineer – Inference Optimizations

DigitalOcean

full-time

Posted on:

Location Type: Remote

Location: CaliforniaColoradoUnited States

Visit company website

Explore more

AI Apply
Apply

Salary

💰 $167,200 - $209,000 per year

Job Level

Tech Stack

About the role

  • Lead the technical strategy for benchmarking and performance optimizations
  • Engineer solutions for complex performance issues
  • Implement cutting-edge optimization techniques
  • Act as the subject matter expert on modern GPU families
  • Lead by example through high-quality code and design reviews
  • Partner with Product Management and TPMs to translate theoretical limits into shippable product features
  • Maintain a strong presence in the GPU infrastructure and optimization communities

Requirements

  • 5+ years of experience in high-performance computing or AI infrastructure
  • Deep familiarity with the Gen AI (LLM, VLM, LMM) landscape
  • Hands-on experience with attention-layer optimizations
  • Comprehensive understanding of NVIDIA and AMD GPU architectures
  • Extensive experience integrating, building with, and contributing to open-source software projects
  • Excellent system design skills related to low-level GPU programming
  • Experience acting as a technical lead
Benefits
  • Competitive salary
  • Flexible working hours
  • Professional development opportunities
  • Employee Assistance Program
  • Equity compensation
  • Reimbursement for relevant conferences, training, and education
  • Access to LinkedIn Learning's 10,000+ courses
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
high-performance computingAI infrastructureattention-layer optimizationsNVIDIA GPU architectureAMD GPU architecturelow-level GPU programmingopen-source softwareperformance optimization techniquesbenchmarkingtechnical leadership
Soft Skills
leadershipcommunicationcollaborationproblem-solvingdesign review