EXL

Data Scientist

EXL

full-time

Posted on:

Location Type: Hybrid

Location: PuneIndia

Visit company website

Explore more

AI Apply
Apply

About the role

  • Design and develop data models and analytical solutions.
  • Optimize data analysis processes and methodologies.
  • Ensure data solutions meet business and technical requirements.
  • Provide technical support for data science projects.
  • Collaborate with stakeholders to address data science needs.
  • Develop and maintain advanced Python-based applications in the Generative AI domain, ensuring high performance, reliability, and scalability.
  • Implement and optimize Generative AI models, including GPT, LLAMA, Mistral, FLAN T5 and other cutting-edge AI technologies, to create innovative solutions and knowledge graph.
  • Development of advanced RAG pipelines with proper embeddings, indexing, chunking, reranking, prompts and evaluation
  • Collaborate with cross-functional teams to integrate AI functionalities into broader systems and applications.
  • Utilize AWS/Azure/Databricks GPU machines to manage GPU memory effectively, maximizing performance and efficiency.
  • Stay updated on the latest advancements in Generative AI, Python development practices, and cloud services to continually enhance our AI capabilities.
  • Assist delivery leads in delivering Generative AI solutions to clients in a timely manner, ensuring client satisfaction and project success.

Requirements

  • 4+ years of experience as a NLP and Python developer.
  • Experience with Pandas, NumPy, Scikit, NLP a must have
  • Key fundamentals in object-oriented design, data structures and systems.
  • Ability to integrate multiple data sources into a single system.
  • Familiarity with testing tools.
  • Ability to collaborate on projects and work independently when required.
  • Working knowledge of GitHub and Jira
  • Ability to document requirements and specifications.
  • Bachelor's or Master's degree in a quantitative field (CS, machine learning, mathematics, statistics) or equivalent experience.
  • 4+ years of experience in data science, building hands-on ML models.
  • Candidate must be aware of entire evolution history of NLP (Traditional Language Models to Modern Large Language Models), training data creation, training set-up and finetuning
  • Knowledge of advanced RAG pipelines with proper embeddings, indexing, chunking, reranking, prompts and evaluation
  • Excellent programming skills in Python. Strong working knowledge of Pythons numerical, data analysis, or AI frameworks such as NumPy, Pandas, Scikit-learn, Jupyter, etc
  • SQL skills with SQL Server and Spark experience is preferred but not necessary.
  • Knowledge of predictive/prescriptive analytics including Machine Learning algorithms (Supervised and Unsupervised) and deep learning algorithms and Artificial Neural Networks
  • Experience with Natural Language Processing (NLTK) and text analytics for information extraction, parsing and topic modeling.
  • Excellent verbal and written communication. Strong troubleshooting and problem-solving skills. Thrive in a fast-paced, innovative environment
  • Experience with cloud platforms such as Azure, AWS, Databricks is preferred.
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
PythonNatural Language ProcessingMachine LearningGenerative AIRAG pipelinesSQLPandasNumPyScikit-learnDeep Learning
Soft Skills
collaborationproblem-solvingcommunicationindependencetroubleshootingclient satisfactionproject successadaptabilityinnovationdocumentation
Certifications
Bachelor's degreeMaster's degree