
Solutions Architect, Scalability of Agentic Pipelines
NVIDIA
full-time
Posted on:
Location Type: Remote
Location: Remote • 🇬🇧 United Kingdom
Visit company websiteJob Level
SeniorLead
About the role
- Work directly with key customers to understand their technology and provide the best solutions
- Develop and demonstrate solutions based on NVIDIA’s and open-source NLP and LLM technology and integrate them into agentic pipelines
- Perform in-depth analysis and optimisation to ensure the best performance on GPU based systems
- Partner with Engineering, Product and Sales teams to develop, plan best suitable solutions for customers
- Enable development and growth of product features through customer feedback and proof-of-concept evaluations
Requirements
- MS/PhD or equivalent experience in Computer Science, Data Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering fields
- 8+ years of experience
- Work experience and knowledge of modern LLM, VLM, diffusion architectures with emphasis on MoE
- Understanding of key libraries used for DNN inference (e.g. TRT-LLM, Dynamo, RedHat Inference Server) as well as agentic pipeline development
- Strong analytical and problem-solving skills
Benefits
- Health insurance
- Professional development
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
NLPLLM technologyDNN inferenceMoEanalysisoptimisationagentic pipeline developmentGPU systemsdiffusion architecturesproof-of-concept evaluations
Soft skills
analytical skillsproblem-solving skillscustomer engagementcollaborationcommunication
Certifications
MS in Computer SciencePhD in Computer Scienceequivalent experience