
Machine Learning Engineer – AI Platform Lead
Artera.net
full-time
Posted on:
Location Type: Remote
Location: United States
Visit company websiteExplore more
Salary
💰 $180,000 - $220,000 per year
Job Level
About the role
- Develop the long term vision and roadmap for Artera’s AI platform that will allow the company to continue to scale in terms of both increased inference volume and development workloads.
- Accountable for Artera’s ML compute infrastructure including scaling up Artera’s Foundation Model development by developing distributed training infrastructure and developer libraries.
- Build and evolve the core libraries used by AI scientists to develop, launch, and monitor AI products.
- Work with model developers to optimize GPU and CPU efficiency and data throughput of large-scale foundation models and downstream model training runs.
- Optimize Artera’s ability to store and serve terabytes of digital pathology data efficiently for the use in serving large-scale training regimes.
- Ensure that Artera’s observability infrastructure provides a clear picture of how to continue to optimize performance across our model landscape.
Requirements
- 8+ years of industry software engineering experience
- 4+ years of industry experience in using ML orchestration frameworks such as Flyte, Ray, Kubeflow, Metaflow, MLFlow, Dagster, Argo Workflow or Prefect
- 4+ years of industry experience using one of PyTorch, TensorFlow, or JAX in Python
- 3+ years of industry experience building with AWS, Docker, and Kubernetes
- 1+ years of industry experience optimizing large-scale, high data-throughput, distributed machine learning training pipelines
- Experience using Terraform, SqlAlchemy
- Experience in multi-node and multi-gpu training.
- Experience deploying and maintaining infrastructure for machine learning training and production inference
- Familiarity with TorchScript, ONNXRuntime, DeepSpeed, AWS Neuron or similar approaches to inference optimization
- Work authorization requirement: Candidates must be authorized to work in the US or Canada without visa sponsorship
Benefits
- 401k matching
- unlimited paid time off (PTO)
- Equity is a core component of our compensation
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
machine learningdistributed trainingGPU optimizationCPU optimizationdata throughputinference optimizationmulti-node trainingmulti-GPU trainingPythonSQL