State Street

Data Scientist – Tech Lead

State Street

full-time

Posted on:

Location Type: Office

Location: Bangalore • 🇮🇳 India

Visit company website
AI Apply
Apply

Job Level

Senior

Tech Stack

AWSAzureCloudMicroservicesPyTorchSQLTensorflow

About the role

  • Lead the development and experimentation of GenAI capabilities across key initiatives like RAG-as-a-Service, Dynamic SQL as Service, Fine-Tuning Studio, Agent Marketplace and new upcoming projects
  • Design and evaluate solutions for tasks such as summarization, question-answering, classification, text-to-SQL, information extraction, and semantic search
  • Engineer high-quality prompts, retrieval pipelines, and evaluation datasets for enterprise use cases
  • Fine-tune LLMs (e.g., Mistral, LLaMA, Qwen, Gemma, Phi, Falcon) using LoRA/QLoRA, DPO, RLHF, and other continual learning techniques
  • Analyze and mitigate model risks such as hallucination, prompt injection, bias, and toxicity using standard RAI frameworks
  • Collaborate with engineering teams to productionize models and embed them in microservices and pipelines
  • Engage with business stakeholders to understand requirements and demonstrate model effectiveness through rapid prototyping and storytelling
  • Contribute to internal knowledge sharing via workshops, research papers, and training sessions

Requirements

  • 8+ years of experience building and deploying ML/AI solutions in production environments
  • Strong grasp of modern machine learning and NLP methods including transformers, retrieval-augmented generation (RAG), embedding models, and attention mechanisms
  • Hands-on experience with Hugging Face, LangChain/Lang Graph, PyTorch/TensorFlow, Weights & Biases, OpenAI/Anthropic APIs, and vector stores (e.g., Pinecone, FAISS, Chroma, CosmosDB)
  • Solid software engineering practices: version control (Git), CI/CD, testing, and working in containerized/cloud environments (Azure, AWS)
  • Familiarity with observability and evaluation frameworks (e.g., LangSmith, TruLens, ReLM, ELO rating, hallucination metrics)
  • M.S. or B.tech in Computer Science, Data Science, Applied Mathematics, or related field
  • Publications, patents, or open-source contributions in GenAI or NLP are a plus
Benefits
  • Inclusive development opportunities
  • Flexible work-life support
  • Paid volunteer days
  • Vibrant employee networks

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
machine learningnatural language processingtransformersretrieval-augmented generationembedding modelsattention mechanismsfine-tuningprompt engineeringmodel evaluationrisk analysis
Soft skills
collaborationcommunicationstorytellingknowledge sharingprototyping
Certifications
M.S. in Computer ScienceB.Tech in Computer ScienceM.S. in Data ScienceM.S. in Applied Mathematics