
Senior Deep Learning Engineer
NVIDIA
full-time
Posted on:
Location Type: Remote
Location: Germany
Visit company websiteExplore more
Salary
💰 PLN 292,500 - PLN 650,000 per year
Job Level
About the role
- Improve inference speed for Cosmos WFMs on GPU platforms.
- Effectively carry out the production deployment of Cosmos WFMs.
- Profile and analyze deep learning workloads to identify and remove bottlenecks.
Requirements
- 8+ years of experience
- MSc or PhD in CS, EE, or CSEE or equivalent experience
- Strong background in Deep Learning
- Strong programming skills in Python and PyTorch
- Experience with inference optimization techniques (such as quantization) and inference optimization frameworks, one of: TensorRT, TensorRT-LLM, vLLM, SGLang
- Familiarity with deploying Deep Learning models in production settings (e.g., Docker, Triton Inference Server)
- CUDA programming experience
- Familiarity with diffusion models
- Proven experience in analyzing, modeling, and tuning the performance of GPU workloads, both inference and training.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
Deep LearningPythonPyTorchinference optimization techniquesquantizationTensorRTTensorRT-LLMvLLMSGLangCUDA programming
Certifications
MSc in Computer SciencePhD in Computer ScienceMSc in Electrical EngineeringPhD in Electrical EngineeringMSc in Computer Systems EngineeringPhD in Computer Systems Engineering