
Senior Deep Learning Engineer
NVIDIA
full-time
Posted on:
Location Type: Remote
Location: Poland
Visit company websiteExplore more
Salary
💰 PLN 221,250 - PLN 507,000 per year
Job Level
About the role
- Improve inference speed for Cosmos WFMs on GPU platforms.
- Effectively carry out the production deployment of Cosmos WFMs.
- Profile and analyze deep learning workloads to identify and remove bottlenecks.
Requirements
- 5+ years of experience.
- MSc or PhD in CS, EE, or CSEE or equivalent experience.
- Strong background in Deep Learning.
- Strong programming skills in Python and PyTorch.
- Experience with inference optimization techniques (such as quantization) and inference optimization frameworks, one of: TensorRT, TensorRT-LLM, vLLM, SGLang.
- Familiarity with deploying Deep Learning models in production settings (e.g., Docker, Triton Inference Server).
- CUDA programming experience.
- Familiarity with diffusion models.
- Proven experience in analyzing, modeling, and tuning the performance of GPU workloads, both inference and training.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Deep LearningPythonPyTorchinference optimization techniquesquantizationTensorRTTensorRT-LLMvLLMSGLangCUDA programming
Certifications
MSc in Computer SciencePhD in Computer ScienceMSc in Electrical EngineeringPhD in Electrical EngineeringMSc in Computer Systems EngineeringPhD in Computer Systems Engineering