Mistral AI

Applied Scientist/Research Engineer

Mistral AI

full-time

Posted on:

Location Type: Office

Location: Palo Alto • California, New York • 🇺🇸 United States

Visit company website
AI Apply
Apply

Job Level

Mid-LevelSenior

Tech Stack

Open SourcePythonPyTorch

About the role

  • Develop SOTA models across different modalities (text, image, speech) and apply them across diverse use cases and domains
  • Run pre-training, post-training and deploy state-of-the-art models on clusters with thousands of GPUs
  • Generate and curate data for pre-training and post-training and work on model evaluations to ensure performance exceeds expectations
  • Develop tools and frameworks to facilitate data generation, model training, evaluation and deployment
  • Collaborate with cross-functional teams to tackle complex use cases using agents and RAG pipelines
  • Manage research projects and communications with client research teams
  • Deliver high-impact AI solutions by working with external and internal science, engineering, and product teams

Requirements

  • Fluent in English with excellent communication skills
  • Expert with PyTorch or JAX
  • Proficient in writing clean, readable, high-performance, fault-tolerant Python code
  • Comfortable contributing to and navigating a large codebase independently
  • Experience running pre-training/post-training and deploying models on large GPU clusters (handling OOM, NCCL issues)
  • Experience generating and curating data for pre-training and post-training, and performing evaluations
  • Ability to develop tools and frameworks for data generation, model training, evaluation and deployment
  • Experience collaborating with cross-functional teams and managing client research communications
  • Track record of success through personal projects, professional projects or academia
  • Preferred: PhD or Master in Mathematics, Physics, Machine Learning, Computer Science & Engineering (but exceptional candidates from other backgrounds encouraged to apply)
  • Nice-to-have: research experience in agents, multi-modality, robotics, diffusion, time-series
  • Nice-to-have: contributions to a large codebase (open source or industry)
  • Nice-to-have: publications in top academic journals or conferences
  • Nice-to-have: experience improving code quality (fixing typing issues, adding tests, improving CI pipelines)
Benefits
  • 💰 Competitive salary and bonus structure
  • 🚀 Generous Equity
  • 🧑‍⚕️ Health : Competitive Healthcare program (Medical Provider: Blueshield of California 100% coverage for employee, 75% for dependents)
  • 👴🏻 Pension : 401K (6% matching)
  • 🏝️ PTO : 18 days
  • 🚗 Transportation: Reimburse office parking charges, or $120/month for public transport
  • 🤝 Coaching: Betterup coaching on a voluntary basis
  • 🏀 Sport: $120/month reimbursement for gym membership
  • 🥕 Meal stipend: $400 monthly allowance for meals
  • 🌎 Visa sponsorship

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
PyTorchJAXPythonmodel evaluationdata generationpre-trainingpost-trainingGPU clustersfault-tolerant codinghigh-performance coding
Soft skills
communication skillscollaborationproject managementindependent workproblem-solvingcross-functional teamworkclient communicationadaptabilityleadershipcreativity
Certifications
PhDMaster in MathematicsMaster in PhysicsMaster in Machine LearningMaster in Computer ScienceMaster in Engineering