
Software Engineer, Evaluation Frontend
Cohere
full-time
Posted on:
Location Type: Remote
Location: Remote • 🇬🇧 United Kingdom
Visit company websiteJob Level
Mid-LevelSenior
Tech Stack
PythonPyTorchTensorflow
About the role
- Design tools and visualizations that enable researchers and engineers to compare and analyse hundreds of model evaluations. This includes both data visualization tools, as well as statistical tools to extract signal from the noisy signals we get.
- Develop an understanding of the relative merits and limitations of each of our model evaluations, as well as suggest new facets of model evaluation.
Requirements
- Extremely strong software engineering skills.
- Strong statistical skills and experience evaluating scientific experiments related to data collection and model performance.
- Prior experience building front-end visualization systems and dashboards.
- Familiarity with ML systems evaluations.
- Proficiency in programming languages such as Python and ML frameworks (e.g., PyTorch, TensorFlow, JAX).
- Excellent communication skills to collaborate effectively with cross-functional teams and present findings.
- One or more papers at top-tier venues (such as NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, Nature, COLING, ACL, EMNLP).
Benefits
- An open and inclusive culture and work environment
- Work closely with a team on the cutting edge of AI research
- Weekly lunch stipend, in-office lunches & snacks
- Full health and dental benefits, including a separate budget to take care of your mental health
- 100% Parental Leave top-up for up to 6 months
- Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
- Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
- 6 weeks of vacation (30 working days!)
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
software engineeringstatistical skillsdata visualizationmodel evaluationPythonPyTorchTensorFlowJAXfront-end visualization systemsdashboards
Soft skills
communicationcollaboration