Flower Labs

Founding ML Engineer – Flower Frontier Model Team

Flower Labs

full-time

Posted on:

Location Type: Remote

Location: Remote • 🇬🇧 United Kingdom

Visit company website
AI Apply
Apply

Job Level

Senior

Tech Stack

Distributed SystemsDockerLinuxNode.jsPythonPyTorch

About the role

  • Join as one of the founding members of the Flower Frontier Model Team, a new group at Flower Labs charged with building category-defining models.
  • Build SOTA LLMs and foundation models within a small, high-impact team.
  • Design, implement and optimize core components across the full spectrum of stages relevant to frontier model building: data curation, evals, pre-training, post-training.
  • Collaborate on the debugging of training instabilities and related issues.
  • Devise surrounding infrastructure, tooling, monitoring, and observability for large-scale LLM development.

Requirements

  • Exceptional software engineering skills (Python, deep learning frameworks, testing, profiling, refactoring, reproducibility)
  • Expertise with modern ML training stacks: PyTorch, JAX or equivalent; experience implementing model architectures from scratch and working within libraries like DeepSpeed, Megatron or equivalent
  • Ability to tune, debug, and profile large-scale training runs
  • Hands-on experience working with large GPU clusters, including job orchestration, scheduling, multi-node runs, NCCL/RDMA issues, and GPU performance optimization
  • Ability to collaborate effectively with both research-oriented and engineering-oriented colleagues; comfortable turning research ideas into robust, maintainable implementations
  • Good engineering hygiene: modular design, code reviews, documentation, reproducibility, versioning of data/models/configurations
  • Familiarity with common tools (Linux command line, git, Docker, …)
  • Openness to adopting new tooling
  • Solid understanding of distributed systems and networking
  • Strong written English
  • Open, honest and transparent communication skills
Benefits
  • Professional development opportunities
  • Flexible working hours

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
Pythondeep learning frameworksPyTorchJAXDeepSpeedMegatronGPU performance optimizationdata curationmodel architectureslarge-scale training
Soft skills
collaborationcommunicationengineering hygienemodular designcode reviewsdocumentationreproducibilityversioningopenness to new toolingstrong written English