FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.
Tech Stack
Tools & technologiesAWSCloudDockerPythonSQL
About the role
Key responsibilities & impact- Ship the hardest implementation work yourself — the human-in-the-loop routing, the public/private gateway access controls, the early agent harnesses
- Design and implement the human-in-the-loop routing system: queue mechanics, reviewer assignment, back-pressure handling, run resumption semantics
- Implement the execution wrapper that enforces human-in-the-loop polices at execution time
- Build the safeguards — refusal policies, prompt-injection protections, public/private MCP exposure controls — that make our agents safe to deploy at scale
- Review PRs (human- and code-agent-authored) at a depth that builds shared judgment about what good agent code looks like
- Mentor engineers through hard implementation problems; close gaps in the team's shared knowledge
Requirements
What you’ll need- 6+ years of professional Python with deep production experience operating services, not just shipping them
- 2+ years operating LLM systems in production: prompt/context engineering, tool/function calling, structured outputs, RAG, evaluation, observability
- Demonstrated experience implementing oversight mechanisms — human-in-the-loop routing, refusal policies, autonomy boundaries — in systems where the cost of an agent error is real
- Strong written communication: you'll be authoring implementation specs that other engineers (and code agents) build against, and the spec is the work
- Extensive knowledge of LangChain/LangGraph — or a comparable framework like AgentCore Strands, CrewAI, or Semantic Kernel — and a clear view of when to use which
- Experience with LLM observability tools: Amazon CloudWatch, LangSmith, Langfuse, MLflow, or OpenTelemetry
- Experience designing evaluation frameworks (RAGAS, DeepEval, LLM-as-judge, multi-turn regression)
- Solid SQL, fluency with at least one cloud platform (AWS preferred), Git, Docker, and modern API frameworks
- A hands-on disposition — you want to ship the hard parts yourself, not just write specs about them
- Experience reviewing code authored by junior engineers, contractors, or AI agents — and giving feedback that produces better code next time
- A considered view on the failure modes of overusing AI — cognitive offloading, organizational skill loss, agent-mediated drift in decision-making — and the conviction to design against them
Benefits
Comp & perks- highly competitive benefits
- comprehensive health coverage
- well-being perks
- flexible time off
- prioritizing work-life balance
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
PythonLLM systemsprompt engineeringtool callingstructured outputsRAGSQLGitDockerAPI frameworks
Soft Skills
written communicationmentoringteam collaborationproblem-solvingfeedback delivery
