FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.
About the role
Key responsibilities & impact- Evaluate and deploy open-source LLMs via vLLM, Ollama, or TGI on GPU infrastructure
- Design and maintain a REST API with auth, rate limiting, versioning, and API documentation
- Optimize inference: quantization, batching, and concurrency tuning for latency / cost trade-offs
- Own observability - latency, throughput, GPU utilization, and cost-per-call dashboards
- Integrate AI features into our web apps and document processing platforms
- Build AI-powered UI components: chat interfaces, inline suggestions, smart search, and summarization widgets
- Collaborate with frontend engineers on UX patterns specific to async and streaming AI responses
- Manage API keys, usage metering, and per-user AI feature flags on the client layer
- Design multi-step orchestration pipelines for document ingestion, parsing, classification, and extraction
- Integrate OCR, layout detection, and chunking strategies for PDFs, Word docs, spreadsheets, and scanned images
- Build RAG pipelines with hybrid retrieval (dense + sparse) over document corpora
- Design and optimize prompts for extraction, summarization, classification, and conversational AI workflows
- Implement task queues and async workers to handle high-volume document processing reliably
- Define evals and quality benchmarks for extraction accuracy, classification, and generation quality
- Write technical docs, so the platform is maintainable by the whole team and accessible for technology partners
- Partner with product, data science, and DevOps/MLOps to ship AI features reliably and at scale
- Stay current on the open-source model ecosystem and advise on model selection and upgrades.
Requirements
What you’ll need- 3+ years of software engineering experience spanning backend, API design, and web technologies
- Proven experience deploying or integrating LLMs in production - open-source model experience strongly preferred
- Hands-on knowledge of REST API design principles and at least one API framework
- Experience building or integrating AI features in a web app, including streaming/async response handling
- Familiarity with document processing workflows - parsing, OCR, extraction, or classification at scale
- Comfort designing multi-step orchestration pipelines with task queues, retries, and failure handling
- Strong communication skills; able to reason about trade-offs across latency, cost, and accuracy with stakeholders.
Benefits
Comp & perks- A fair compensation for your value
- Bonus program, paid vacation leave and competitive corporate benefits
- We are committed to maximizing your potential and ensuring your professional development
- Interaction with local and international teams
- A friendly and collaborative work environment, where authenticity and well-being are a priority
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
open-source LLMsvLLMOllamaTGIREST APIquantizationbatchingconcurrency tuningOCRRAG pipelines
Soft Skills
strong communication skillscollaborationreasoning about trade-offs
