
Data Scientist – Tech Lead
State Street
full-time
Posted on:
Location Type: Office
Location: Bangalore • 🇮🇳 India
Visit company websiteJob Level
Senior
Tech Stack
AWSAzureCloudMicroservicesPyTorchSQLTensorflow
About the role
- Lead the development and experimentation of GenAI capabilities across key initiatives like RAG-as-a-Service, Dynamic SQL as Service, Fine-Tuning Studio, Agent Marketplace and new upcoming projects
- Design and evaluate solutions for tasks such as summarization, question-answering, classification, text-to-SQL, information extraction, and semantic search
- Engineer high-quality prompts, retrieval pipelines, and evaluation datasets for enterprise use cases
- Fine-tune LLMs (e.g., Mistral, LLaMA, Qwen, Gemma, Phi, Falcon) using LoRA/QLoRA, DPO, RLHF, and other continual learning techniques
- Analyze and mitigate model risks such as hallucination, prompt injection, bias, and toxicity using standard RAI frameworks
- Collaborate with engineering teams to productionize models and embed them in microservices and pipelines
- Engage with business stakeholders to understand requirements and demonstrate model effectiveness through rapid prototyping and storytelling
- Contribute to internal knowledge sharing via workshops, research papers, and training sessions
Requirements
- 8+ years of experience building and deploying ML/AI solutions in production environments
- Strong grasp of modern machine learning and NLP methods including transformers, retrieval-augmented generation (RAG), embedding models, and attention mechanisms
- Hands-on experience with Hugging Face, LangChain/Lang Graph, PyTorch/TensorFlow, Weights & Biases, OpenAI/Anthropic APIs, and vector stores (e.g., Pinecone, FAISS, Chroma, CosmosDB)
- Solid software engineering practices: version control (Git), CI/CD, testing, and working in containerized/cloud environments (Azure, AWS)
- Familiarity with observability and evaluation frameworks (e.g., LangSmith, TruLens, ReLM, ELO rating, hallucination metrics)
- M.S. or B.tech in Computer Science, Data Science, Applied Mathematics, or related field
- Publications, patents, or open-source contributions in GenAI or NLP are a plus
Benefits
- Inclusive development opportunities
- Flexible work-life support
- Paid volunteer days
- Vibrant employee networks
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
machine learningnatural language processingtransformersretrieval-augmented generationembedding modelsattention mechanismsfine-tuningprompt engineeringmodel evaluationrisk analysis
Soft skills
collaborationcommunicationstorytellingknowledge sharingprototyping
Certifications
M.S. in Computer ScienceB.Tech in Computer ScienceM.S. in Data ScienceM.S. in Applied Mathematics