
Member of Technical Staff – ML Engineering
Latent Labs
full-time
Posted on:
Location Type: Hybrid
Location: London • United Kingdom
Visit company websiteExplore more
Job Level
About the role
- Deploy, maintain, and optimize production and research compute clusters
- Design and implement scalable and efficient ML inference solutions
- Develop dynamic / heterogeneous compute solutions for balancing research and production needs
- Contribute to productizing model APIs for external use
- Develop infrastructure observability and monitoring solutions
Requirements
- Deep experience with Kubernetes and containerized workflows
- Experience with major cloud platforms (AWS, GCP, Azure)
- Knowledge of DevOps and related tools (Terraform, etc)
- Knowledge of HPC frameworks (Slurm, Ray, etc)
- Production engineering & reliability experience
- PyTorch & distributed computing experience
Benefits
- Private health insurance
- Pension/401(K) contributions
- Generous leave policies (including gender neutral parental leave)
- Hybrid working
- Travel opportunities and more
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
Kubernetescontainerized workflowsAWSGCPAzureTerraformHPC frameworksSlurmRayPyTorch
Soft Skills
production engineeringreliability