Salary
💰 $50 - $125 per hour
About the role
- Build advanced datasets and evaluation tools to improve LLM interactions with real-world software repositories
- Work on diverse projects improving LLM reasoning across software engineering tasks
- Deliver end-to-end use cases (automation agents, coding copilots, creative assistants)
- Review and rank multiple model-generated code responses per task
- Evaluate code diffs for correctness, maintainability, style, and efficiency
- Provide structured feedback and clear rationales for evaluation decisions
- Collaborate with AI researchers and engineers on frontier challenges
Requirements
- 5–13 years overall software engineering experience
- 4+ years of hands-on Rust development experience
- Prior full-time experience at a top-tier tech company in the US, Canada, Australia, or Western Europe (contractor/part-time experience not eligible)
- Proven expertise in full-stack development, software architecture, debugging, and code review practices
- Strong ability to assess and evaluate code diffs for quality and efficiency
- LinkedIn profile link (must be a verified account)
- Eligibility: citizens/GC/valid work permit holders in the US, Canada, Australia, or Western Europe; H-1B not eligible
- Partial Pacific Standard Time overlap required
- Flexible — minimum 10 hrs/week, up to 40 hrs/week (with partial PST overlap required)
- Remote work (Remote — US, Canada, Australia, Western Europe)
- Contract duration: 1 month (extensions possible)
- Contractor (independent, no benefits; no medical/paid leave)
ATS Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
Rustfull-stack developmentsoftware architecturedebuggingcode reviewLLM interactionsevaluation toolsautomation agentscoding copilotscreative assistants
Soft skills
collaborationstructured feedbackclear rationalesevaluation decisions