unstructured.io

AI Engineer – Public Sector

unstructured.io

full-time

Posted on:

Location Type: Remote

Location: United States

Visit company website

Explore more

AI Apply
Apply

About the role

  • Own the lifecycle of AI solutions from initial research to AWS deployment.
  • Design and implement production-grade RAG pipelines and agentic workflows using Python.
  • Build systems that handle real-world "messy" data (PDFs, scanned docs, images, full motion video) and ensure they are performant and scalable.
  • Evaluate new models (LLMs, embedding models, object detection) and run experiments to prove what actually works.
  • Partner with the team to document architectures, contribute to technical reports for contract deliverables, and participate in pre-sales calls to architect solutions for complex client needs.

Requirements

  • Proven experience deploying Production RAG pipelines against real-world, messy datasets.
  • Deep expertise in Agentic system design (tool-use, multi-agent orchestration).
  • Strong Python engineering skills—writing clean, scalable, and maintainable code.
  • Experience operating within AWS/GovCloud environments.
  • Experience fine-tuning NLP or object detection models (Nice-to-Haves).
  • Familiarity with LLM evaluation frameworks (hallucination detection, drift monitoring).
  • Knowledge of government security standards and working in different classification environments and on-prem.
  • Existing Secret/TS clearance or eligibility is a significant plus.
Benefits
  • Opportunity to work on a dynamic team and work on cutting-edge machine learning projects.
  • Collaborative and innovative work environment with a focus on learning and growth.
  • Impactful role in shaping the company's direction and driving innovation in unstructured data processing.
  • Competitive compensation package, including benefits and stock options.
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
PythonRAG pipelinesNLP modelsobject detection modelsmulti-agent orchestrationdata handlingscalable systemsmodel evaluationclean codemaintainable code
Soft Skills
documentationtechnical reportingcollaborationcommunication
Certifications
Secret clearanceTS clearance