Mistral AI

Applied Scientist/Research Engineer

Mistral AI

full-time

Posted on:

Location Type: Office

Location: Palo Alto • California, New York • 🇺🇸 United States

Visit company website
AI Apply
Apply

Job Level

Mid-LevelSenior

Tech Stack

Open SourcePythonPyTorch

About the role

  • Develop SOTA models across different modalities (text, image, speech) and apply them across diverse use cases and domains
  • Run pre-training, post-training and deploy state-of-the-art models on clusters with thousands of GPUs
  • Generate and curate data for pre-training and post-training and work on model evaluations to ensure performance exceeds expectations
  • Develop tools and frameworks to facilitate data generation, model training, evaluation and deployment
  • Collaborate with cross-functional teams to tackle complex use cases using agents and RAG pipelines
  • Manage research projects and communications with client research teams
  • Deliver high-impact AI solutions by working with external and internal science, engineering, and product teams

Requirements

  • Fluent in English with excellent communication skills
  • Expert with PyTorch or JAX
  • Proficient in writing clean, readable, high-performance, fault-tolerant Python code
  • Comfortable contributing to and navigating a large codebase independently
  • Experience running pre-training/post-training and deploying models on large GPU clusters (handling OOM, NCCL issues)
  • Experience generating and curating data for pre-training and post-training, and performing evaluations
  • Ability to develop tools and frameworks for data generation, model training, evaluation and deployment
  • Experience collaborating with cross-functional teams and managing client research communications
  • Track record of success through personal projects, professional projects or academia
  • Preferred: PhD or Master in Mathematics, Physics, Machine Learning, Computer Science & Engineering (but exceptional candidates from other backgrounds encouraged to apply)
  • Nice-to-have: research experience in agents, multi-modality, robotics, diffusion, time-series
  • Nice-to-have: contributions to a large codebase (open source or industry)
  • Nice-to-have: publications in top academic journals or conferences
  • Nice-to-have: experience improving code quality (fixing typing issues, adding tests, improving CI pipelines)
Benefits
  • 💰 Competitive salary and bonus structure
  • 🚀 Generous Equity
  • 🧑‍⚕️ Health : Competitive Healthcare program (Medical Provider: Blueshield of California 100% coverage for employee, 75% for dependents)
  • 👴🏻 Pension : 401K (6% matching)
  • 🏝️ PTO : 18 days
  • 🚗 Transportation: Reimburse office parking charges, or $120/month for public transport
  • 🤝 Coaching: Betterup coaching on a voluntary basis
  • 🏀 Sport: $120/month reimbursement for gym membership
  • 🥕 Meal stipend: $400 monthly allowance for meals
  • 🌎 Visa sponsorship

ATS Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
PyTorchJAXPythonmodel evaluationdata generationpre-trainingpost-trainingGPU clustersfault-tolerant codinghigh-performance coding
Soft skills
communication skillscollaborationproject managementindependent workproblem-solvingcross-functional teamworkclient communicationadaptabilityleadershipcreativity
Certifications
PhDMaster in MathematicsMaster in PhysicsMaster in Machine LearningMaster in Computer ScienceMaster in Engineering
Airwallex

Staff AI Engineer

Airwallex
Leadfull-time$175k–$280k / yearCalifornia · 🇺🇸 United States
Posted: 15 days agoSource: jobs.ashbyhq.com
PythonPyTorchTensorflow
OpenTeams

Senior/Principal Solutions Architect, Open Source AI

OpenTeams
Seniorfull-timeColorado · 🇺🇸 United States
Posted: 19 days agoSource: boards.greenhouse.io
CloudNumpyOpen SourcePythonPyTorchRay
NVIDIA

Senior Software Engineer, cuBLASDx and cuSolverDx

NVIDIA
Seniorfull-time$148k–$288k / yearCalifornia · 🇺🇸 United States
Posted: 8 days agoSource: nvidia.wd5.myworkdayjobs.com
PythonPyTorch
NVIDIA

Senior Software Engineer – cuBLASDx and cuSolverDx

NVIDIA
Seniorfull-time🇺🇸 United States
Posted: 4 days agoSource: nvidia.wd5.myworkdayjobs.com
PythonPyTorch
Autodesk

AI Research Scientist, Multimodal

Autodesk
Mid · Seniorfull-time🇬🇧 United Kingdom
Posted: 4 days agoSource: autodesk.wd1.myworkdayjobs.com
PythonPyTorch