
Founding ML Engineer – Flower Frontier Model Team
Flower Labs
full-time
Posted on:
Location Type: Remote
Location: Remote • 🇬🇧 United Kingdom
Visit company websiteJob Level
Senior
Tech Stack
Distributed SystemsDockerLinuxNode.jsPythonPyTorch
About the role
- Join as one of the founding members of the Flower Frontier Model Team, a new group at Flower Labs charged with building category-defining models.
- Build SOTA LLMs and foundation models within a small, high-impact team.
- Design, implement and optimize core components across the full spectrum of stages relevant to frontier model building: data curation, evals, pre-training, post-training.
- Collaborate on the debugging of training instabilities and related issues.
- Devise surrounding infrastructure, tooling, monitoring, and observability for large-scale LLM development.
Requirements
- Exceptional software engineering skills (Python, deep learning frameworks, testing, profiling, refactoring, reproducibility)
- Expertise with modern ML training stacks: PyTorch, JAX or equivalent; experience implementing model architectures from scratch and working within libraries like DeepSpeed, Megatron or equivalent
- Ability to tune, debug, and profile large-scale training runs
- Hands-on experience working with large GPU clusters, including job orchestration, scheduling, multi-node runs, NCCL/RDMA issues, and GPU performance optimization
- Ability to collaborate effectively with both research-oriented and engineering-oriented colleagues; comfortable turning research ideas into robust, maintainable implementations
- Good engineering hygiene: modular design, code reviews, documentation, reproducibility, versioning of data/models/configurations
- Familiarity with common tools (Linux command line, git, Docker, …)
- Openness to adopting new tooling
- Solid understanding of distributed systems and networking
- Strong written English
- Open, honest and transparent communication skills
Benefits
- Professional development opportunities
- Flexible working hours
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
Pythondeep learning frameworksPyTorchJAXDeepSpeedMegatronGPU performance optimizationdata curationmodel architectureslarge-scale training
Soft skills
collaborationcommunicationengineering hygienemodular designcode reviewsdocumentationreproducibilityversioningopenness to new toolingstrong written English