NVIDIA

Solutions Architect, Scalability of Agentic Pipelines

NVIDIA

full-time

Posted on:

Location Type: Remote

Location: Remote • 🇬🇧 United Kingdom

Visit company website
AI Apply
Apply

Job Level

SeniorLead

About the role

  • Work directly with key customers to understand their technology and provide the best solutions
  • Develop and demonstrate solutions based on NVIDIA’s and open-source NLP and LLM technology and integrate them into agentic pipelines
  • Perform in-depth analysis and optimisation to ensure the best performance on GPU based systems
  • Partner with Engineering, Product and Sales teams to develop, plan best suitable solutions for customers
  • Enable development and growth of product features through customer feedback and proof-of-concept evaluations

Requirements

  • MS/PhD or equivalent experience in Computer Science, Data Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering fields
  • 8+ years of experience
  • Work experience and knowledge of modern LLM, VLM, diffusion architectures with emphasis on MoE
  • Understanding of key libraries used for DNN inference (e.g. TRT-LLM, Dynamo, RedHat Inference Server) as well as agentic pipeline development
  • Strong analytical and problem-solving skills
Benefits
  • Health insurance
  • Professional development

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
NLPLLM technologyDNN inferenceMoEanalysisoptimisationagentic pipeline developmentGPU systemsdiffusion architecturesproof-of-concept evaluations
Soft skills
analytical skillsproblem-solving skillscustomer engagementcollaborationcommunication
Certifications
MS in Computer SciencePhD in Computer Scienceequivalent experience