
Senior AI Engineer
FourKites, Inc.
full-time
Posted on:
Location Type: Remote
Location: India
Visit company websiteExplore more
Job Level
About the role
- Design and implement production-scale AI agent systems and orchestration frameworks (LangGraph, LangChain, similar architectures)
- Lead architecture for multi-agent systems handling complex business workflows
- Optimize deployment strategies using both LLMs and SLMs based on use case requirements
- Build natural language-configurable business process automation frameworks
- Implement multi-modal AI systems for document understanding (tables, charts, layouts)
- Deploy and optimize LLMs/SLMs in production with fine-tuning techniques (LoRA, QLoRA, DPO)
- Implement quantization strategies (INT8, INT4) and model distillation for edge deployment
- Build evaluation frameworks including LLM-as-judge systems and regression testing
- Design streaming architectures for real-time LLM responses (SSE, WebSockets)
- Create semantic caching and embedding-based retrieval systems
- Develop GraphRAG and long-context handling strategies (100k+ tokens)
- Design scalable microservices with comprehensive observability (LangSmith, Arize, custom telemetry)
- Build secure multi-tenant systems with prompt injection prevention and output validation
- Implement cost optimization through intelligent model routing and fallback strategies
- Develop document processing pipelines with OCR and layout understanding
- Create event-driven architectures for real-time shipment tracking and exception handling
- Build data pipelines for training data curation, synthetic generation, and PII masking
- Implement RLHF/RLAIF feedback loops for continuous improvement
- Design experiment tracking and model registry systems (MLflow, DVC)
- Optimize inference costs through batch processing and spot instance utilization
- Establish model governance, audit trails, and compliance frameworks
Requirements
- 4+ years software engineering, 2+ years in production AI/ML systems
- Expertise in Python, PyTorch/JAX, and AI frameworks (LangChain, Transformers, PEFT)
- Experience with LLMs (GPT-4, Claude, Gemini) and SLMs (Phi, Llama, Mistral)
- Hands-on experience with:
- Fine-tuning techniques (LoRA, QLoRA, DPO, RLHF)
- Model optimization (quantization, distillation, pruning)
- Vector databases and RAG architectures
- Streaming systems and real-time processing
- Security measures (prompt injection prevention, jailbreak detection)
- Strong background in distributed systems, Kubernetes, and cloud platforms
Benefits
- Medical benefits start on first day of employment
- 36 PTO days( Sick, Casual and Earned), 5 recharge days, 2 volunteer days
- Home Office set ups and Technology reimbursement
- Lifestyle & Family benefits
- Mental Wellness support and guidance
- Ongoing learning & development opportunities (Professional development program)
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
PythonPyTorchJAXLangChainTransformersPEFTLLMsSLMsfine-tuning techniquesmodel optimization
Soft Skills
leadershipcommunicationorganizational