Tech Stack
AWSDockerGoogle Cloud PlatformKubernetesPython
About the role
- Build multi-step, tool-using agents that plan, retrieve, act, and verify.
- Design agent architectures, integrate tools/APIs, and deliver robust execution with safety and observability.
- Architect agent loops (planning, memory, retrieval, tool use, self-verification).
- Implement tools (functions/APIs/DB queries/files) and MCP-style interfaces.
- Combine RAG with agents: chunking, embeddings, retrieval, reranking, grounding.
- Add guardrails: execution sandboxes, permissions, rate limits, PII policies.
- Build evaluation for agents (task success, autonomy depth, recovery rate).
- Optimize for reliability, determinism where needed, and cost.
Requirements
- Production experience with LangChain/CrewAI (or similar) and function/tool calling.
- Strong Python engineering; async patterns; robust error handling.
- Practical RAG skills: vector DBs, indexing pipelines, and retrieval quality tuning.
- Systems thinking: queues, retries, idempotency, caching, concurrency control.
- Security & safety for agents (prompt injection, tool scoping, least privilege).
- Nice-to-Haves
- GCP/AWS, Docker/Kubernetes; message buses/streams.
- LLM evaluation frameworks; synthetic data generation.
- Graph-based planning, constraint solvers, or program-of-thought techniques.
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
PythonLangChainCrewAIRAGvector DBsindexing pipelineserror handlingasync patternsgraph-based planningconstraint solvers
Soft skills
systems thinkingreliabilitydeterminismcost optimization