AI Engineer

ellowtech

contract

Posted on: 2/17/2026

Location Type: Remote

Location: India

✨ AI Apply

About the role

Design and build end-to-end pipelines for processing long-form, OCR-heavy documents
Own PDF ingestion, layout-aware parsing, and multi-page document assembly
Implement robust chunking, segmentation, and metadata tracking across long documents
Handle exception detection, retries, and deterministic failure handling
Optimize systems to reliably process 200+ page documents at scale
Work with OCR engines (Tesseract, PaddleOCR, layout-aware models, vision-language models)
Build layout-aware extraction systems using bounding boxes and structural metadata
Implement deterministic schema validation and cross-field consistency checks
Reduce reliance on manual QA through rule-based validation layers
Deploy and operate open-source LLMs using: vLLM Hugging Face TGI
Monitor and optimize GPU utilization and cost per request
Design validation layers outside the LLM
Create measurable correctness guarantees for high-stakes use cases
Collaborate with cross-functional teams to ship production-grade AI systems

6+ years of hands-on Python engineering
Proven production experience building OCR-driven document pipelines
Experience handling long-form PDFs (100+ pages)
Strong experience with: vLLM or Hugging Face TGI GPU-based LLM serving
Open-source LLMs (LLaMA, Qwen, Mistral, etc.)
Experience building deterministic validation systems (schema + rule enforcement)
Strong debugging and systems-level thinking
Ability to clearly articulate system trade-offs and business impact

Benefits

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools

PythonOCRdocument pipelineschunkingsegmentationmetadata trackingschema validationdeterministic validation systemsdebuggingsystems-level thinking

Soft Skills

collaborationcommunicationarticulation of system trade-offsbusiness impact analysis