
Senior Performance Architect
quadric.io
full-time
Posted on:
Location Type: Hybrid
Location: Burlingame • California • United States
Visit company websiteExplore more
Job Level
Tech Stack
About the role
- Analyze application performance across the full stack: C++/Python source, compiler output, assembly, and hardware execution
- Identify and localize performance bottlenecks to specific code regions, assembly sequences, or architectural limitations
- Implement proof-of-concept fixes and optimizations to validate proposed solutions before broader rollout
- Develop and maintain profiling infrastructure, benchmarks, and performance regression tests
- Collaborate with compiler engineers to improve code generation and optimization passes
- Work with hardware architects to identify microarchitectural improvements and validate performance models
- Create performance models that predict workload behavior and guide optimization priorities
- Document findings and communicate performance insights to both technical and non-technical stakeholders
- Support customer engagements by analyzing their workloads and recommending optimizations
Requirements
- BS/MS in Computer Science, Computer Engineering, or Electrical Engineering with 5+ years of performance analysis experience
- Strong proficiency in C++ and Python; ability to read, reason about, and write optimized code at the assembly level
- Hands-on mentality: comfortable implementing proof-of-concept solutions, not just identifying problems
- Deep understanding of computer architecture: pipelines, caches, memory hierarchies, SIMD/vector execution
- Experience with profiling tools (perf, VTune, custom trace analysis) and performance debugging methodologies
- Ability to trace performance issues from application behavior down to microarchitectural root causes
- Strong analytical and problem-solving skills with attention to detail
- Excellent communication skills; ability to explain complex performance issues to diverse audiences
- Experience working cross-functionally with compiler, runtime, and hardware teams
- Nice to Have
- Experience with ML/AI workloads and frameworks (PyTorch, TensorFlow, ONNX)
- Background in compiler development or code generation
- Experience with GPU, DSP, or custom accelerator architectures
- Familiarity with cycle-accurate simulation and performance modeling tools
Benefits
- Competitive salary and meaningful equity
- Medical, dental, and vision coverage starting on day one
- 401(k) retirement plan
- Flexible paid time off (unlimited, non-accrual) to support work-life balance
- When working in-office, enjoy company-provided lunches and a stocked kitchen
- Convenient office location within walking distance of the Caltrain station
- Support for commuting, including monthly parking or Caltrain passes
- Downtown Burlingame office location, close to shops, cafes, and local amenities
- A politics-free, highly collaborative environment where talented people can do their best work and make an immediate impact
- The opportunity to build long-term career relationships in a company that values strong personal connections alongside professional excellence
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
C++Pythonassemblyperformance analysisprofiling toolsperformance debuggingcomputer architectureSIMDperformance modelingoptimization
Soft Skills
analytical skillsproblem-solving skillsattention to detailcommunication skillscollaborationcross-functional teamworkhands-on mentality
Certifications
BS in Computer ScienceMS in Computer ScienceBS in Computer EngineeringMS in Computer EngineeringBS in Electrical EngineeringMS in Electrical Engineering