FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.
Tech Stack
Tools & technologiesAssemblyCloudDockerJMeterKubernetesLinuxOpen SourceRust
About the role
Key responsibilities & impact- Make independent technical contributions to help their team deliver its commitments.
- Optimize AI, HPC, and other workloads and benchmarks for our RISC-V silicon with vector extensions.
- Identify bottlenecks with workload characterization and drive implementation of solutions.
- Decompose applications and how they run on the underlying hardware.
- Understand performance at a system level, from CPU, memory, storage, power utilization, etc.
- Profile and optimize on our pre-silicon (simulators, emulators, FPGAS), and silicon environments.
- Troubleshoot and root cause analysis, to resolve performance issues.
- Optimize HPC and AI libraries and runtimes.
- Work with customers to optimize their important workloads for our architecture.
- Contribute to instrumentation of performance and power at the operating system level.
- Integrate with CI/CD to automate performance benchmarking.
- Present findings to senior technical leaders at our company and at industry conferences.
- Contribute upstream where appropriate.
- Interact with colleagues across projects to unblock issues, or to reach consensus on technical topics.
- Work with hardware teams, influencing hardware/software co-design.
- Work with teams developing compilers, firmware, AI stacks, middleware, etc., and contribute as necessary.
- Collaborate with colleagues through code reviews, bug triaging, design documents,...
- Contribute to shared team responsibilities (CI failure triaging, documentation, code fixits, rotations...).
Requirements
What you’ll need- Bachelor or Master’s degree in a relevant field.
- Experience in the relevant field (1-3yr).
- Solid knowledge about computer architecture (CPU, cache, memory hierarchy, buses, etc.).
- Proven experience with performance optimization, and experience with performance profiling and analysis tools (Linux perf, iperf, lmbench, sysbench, flamegraphs, JMeter, LoadRunner, Gatling, Intel (R) VTUNE, ARM SPE, etc.).
- Solid knowledge of low-level development languages (C, C++, Rust, etc.).
- Experience with assembly language (any architecture).
- Experience with performance benchmarks (CoreMark, FPMark, LMBench, Stream, SPEC CPU, SPEC INT, graph500, CORAL, HPGMG, CoMD, Linpack, SuiteSparse Matrix Collection, MLPERF, ResNet, BERT etc.).
- Solid understanding of operating systems, databases, networking, and cloud technologies.
- Strong problem-solving skills with the ability to troubleshoot and identify complex issues.
- Experience with bare metal (without OS) development for embedded systems and microcontrollers.
- Experience with standard tools for source control (git), continuous integration, task management (Jira).
- Experience with machine learning and/or HPC performance optimization.
- Experience with Fortran.
- Experience with RISC-V development at the assembly-language level.
- Experience with statistical analysis.
- Experience with emulators and simulators (Qemu, Gem5, Spike).
- Experience in containerized environments (Docker, Kubernetes).
- Experience with hypervisor technology (KVM, Xen).
- Experience with open source software development.
Benefits
Comp & perks- Flexible work arrangements
- Professional development opportunities
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
CC++RustAssembly LanguagePerformance BenchmarkingBare Metal DevelopmentStatistical AnalysisMachine Learning OptimizationRISC-V DevelopmentEmulators and Simulators
Soft Skills
Problem-SolvingCollaborationCommunication
