
Applied Scientist/Research Engineer
Mistral AI
full-time
Posted on:
Location Type: Office
Location: Palo Alto • California, New York • 🇺🇸 United States
Visit company websiteJob Level
Mid-LevelSenior
Tech Stack
Open SourcePythonPyTorch
About the role
- Develop SOTA models across different modalities (text, image, speech) and apply them across diverse use cases and domains
- Run pre-training, post-training and deploy state-of-the-art models on clusters with thousands of GPUs
- Generate and curate data for pre-training and post-training and work on model evaluations to ensure performance exceeds expectations
- Develop tools and frameworks to facilitate data generation, model training, evaluation and deployment
- Collaborate with cross-functional teams to tackle complex use cases using agents and RAG pipelines
- Manage research projects and communications with client research teams
- Deliver high-impact AI solutions by working with external and internal science, engineering, and product teams
Requirements
- Fluent in English with excellent communication skills
- Expert with PyTorch or JAX
- Proficient in writing clean, readable, high-performance, fault-tolerant Python code
- Comfortable contributing to and navigating a large codebase independently
- Experience running pre-training/post-training and deploying models on large GPU clusters (handling OOM, NCCL issues)
- Experience generating and curating data for pre-training and post-training, and performing evaluations
- Ability to develop tools and frameworks for data generation, model training, evaluation and deployment
- Experience collaborating with cross-functional teams and managing client research communications
- Track record of success through personal projects, professional projects or academia
- Preferred: PhD or Master in Mathematics, Physics, Machine Learning, Computer Science & Engineering (but exceptional candidates from other backgrounds encouraged to apply)
- Nice-to-have: research experience in agents, multi-modality, robotics, diffusion, time-series
- Nice-to-have: contributions to a large codebase (open source or industry)
- Nice-to-have: publications in top academic journals or conferences
- Nice-to-have: experience improving code quality (fixing typing issues, adding tests, improving CI pipelines)
Benefits
- 💰 Competitive salary and bonus structure
- 🚀 Generous Equity
- 🧑⚕️ Health : Competitive Healthcare program (Medical Provider: Blueshield of California 100% coverage for employee, 75% for dependents)
- 👴🏻 Pension : 401K (6% matching)
- 🏝️ PTO : 18 days
- 🚗 Transportation: Reimburse office parking charges, or $120/month for public transport
- 🤝 Coaching: Betterup coaching on a voluntary basis
- 🏀 Sport: $120/month reimbursement for gym membership
- 🥕 Meal stipend: $400 monthly allowance for meals
- 🌎 Visa sponsorship
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
PyTorchJAXPythonmodel evaluationdata generationpre-trainingpost-trainingGPU clustersfault-tolerant codinghigh-performance coding
Soft skills
communication skillscollaborationproject managementindependent workproblem-solvingcross-functional teamworkclient communicationadaptabilityleadershipcreativity
Certifications
PhDMaster in MathematicsMaster in PhysicsMaster in Machine LearningMaster in Computer ScienceMaster in Engineering