
AI Architect
Expleo Group
full-time
Posted on:
Location Type: Hybrid
Location: Pune • 🇮🇳 India
Visit company websiteJob Level
SeniorLead
Tech Stack
AWSCloudDockerGoogle Cloud PlatformJenkinsKubernetesMicroservicesPythonTerraform
About the role
- Architect multi-agent ecosystems and tool orchestration frameworks using LangChain, OpenAI Agents SDK, or Google ADK.
- Design system-level LLMOps and MLOps frameworks (model lifecycle, retraining, observability, feedback loops).
- Establish CI/CD, monitoring, and scalability strategies for AI microservices.
- Define model optimization pipelines (quantization, distillation, pruning, caching).
- Integrate MLflow and Vertex AI for experiment tracking and production deployment.
- Drive cloud infrastructure strategy — compute scaling, GPU optimization, container orchestration.
- Ensure AI safety, interpretability, and compliance across models and agents.
Requirements
- BTech, BE, MCA
- Architecture Expertise: Agentic systems, reasoning pipelines, and orchestration layers.
- MLOps / LLMOps: MLflow, Vertex AI Pipelines, Weights & Biases, model monitoring.
- DevOps Infrastructure: Kubernetes, Docker, Terraform, Jenkins, GCP/AWS CI/CD.
- Optimization Techniques: Finetuning (LoRA, QLoRA), model quantization, distillation, caching.
- System Design: Scalable APIs, message queues, observability, and fault tolerance.
- Programming: Python
- Strong understanding of AI safety, governance, and trust frameworks.
- Experience implementing MCP, multi-agent orchestration, or custom reasoning layers.
- Proven success in leading enterprise-scale AI transformation initiatives.
Benefits
- Health insurance
- 401(k) matching
- Flexible work hours
- Paid time off
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
PythonMLOpsLLMOpsmodel optimizationquantizationdistillationcachingCI/CDcloud infrastructureAI safety
Soft skills
leadershipcommunicationorganizational
Certifications
BTechBEMCA