FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.
About the role
Key responsibilities & impact- Conduct fundamental LLM research using our SOTA story engine.
- Create a benchmark for evaluating LLM behavior by defining tasks, success criteria, and auxiliary metrics within our story engine.
- Deliver a benchmark library for performing evaluations on SOTA LLMs and a written report of your compiled results for posting to our blog and potential publication.
Requirements
What you’ll need- Current grad student in computer science or similar
- Previous publications or research projects creating LLM benchmarks
- Demonstrated software engineering experience via open-source projects or employment
- Evidence of written communication skills via previous reports or publications
Benefits
Comp & perks- Medical, dental, and vision coverage, with the company paying 100% of premiums for employees and eligible dependents (subject to plan terms and eligibility; benefits vary by location).
- Company-provided laptop (Mac by default) and the tools you need to do your best work.
- Flexible PTO, encouraged and supported. Many teammates take around 2 days per month plus additional time each quarter, depending on role and team planning.
- Access to the latest AI models to accelerate your work.
- $100/month Learning Fund for books, courses, coaching, conferences, and video games.
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
LLM researchbenchmark creationtask definitionsuccess criteriaauxiliary metricssoftware engineeringopen-source projectsevaluationswritten communication
