FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.
Tech Stack
Tools & technologiesCloudDistributed SystemsDockerKubernetesMicroservicesPythonSpark
About the role
Key responsibilities & impact- Building and shipping generative AI features end to end: from model selection and fine-tuning through to the inference path that serves them.
- Designing multi-agent and RAG architectures, and anomaly detection, that are accurate, observable, and cost-aware at scale.
- Owning the real-time inference layer on Triton and TensorRT, optimising for latency, throughput, and GPU efficiency.
- Standing up the surrounding microservices in Python and FastAPI, containerised and orchestrated for reliability.
- Setting the technical bar: making architecture decisions, raising code quality, and mentoring engineers around you.
- Partnering with research and product teams to take ideas from experiment to a service customers depend on.
Requirements
What you’ll need- 5 to 10 years of relevant experience, including a proven track record as a technical lead who mentors others.
- Strong, current Python engineering, with production services built and shipped (FastAPI or similar).
- Genuine hands-on GenAI depth: LLMs, RAG, agentic or multi-agent workflows, anomaly detection, and fine-tuning (for example LoRA or PEFT).
- Real-time inference experience with NVIDIA Triton and TensorRT, with real attention to latency, throughput, and cost.
- A microservices mindset, with services built in Python and FastAPI, containerised and orchestrated for reliability.
- Solid grounding in Docker and Kubernetes, and large-scale distributed systems on a major cloud.
- Right to work in the UK. We welcome applications from all backgrounds and are committed to equal opportunity.
- Nice to have
- Experience with vLLM or other serving frameworks alongside Triton and TensorRT.
- Experience in security, networking, or other high-reliability domains.
- Big-data tooling (Spark, Databricks, Snowflake) and modern MLOps practice.
Benefits
Comp & perks- Fully remote working anywhere in the UK, built around delivery rather than presence.
- A clear path to grow into staff and principal-level technical influence.
- Full support from C-Serv across the hiring process and beyond, with full-cycle accountability.
- A values-led, woman-owned delivery partner built on empathy, integrity, collaboration, and growth.
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Generative AIPythonFastAPINVIDIA TritonTensorRTAnomaly DetectionFine-TuningDockerKubernetesBig-Data Tooling
Soft Skills
MentoringTechnical Leadership
