Atmosera

Data Engineer, AI/ML Specialist

Atmosera

contract

Posted on:

Origin:  • 🇺🇸 United States • Oregon

Visit company website
AI Apply
Manual Apply

Job Level

Mid-LevelSenior

Tech Stack

AzurePythonSQL

Requirements

  • Data Engineering: Design, implement, and optimize OCR and data extraction pipelines for ingesting PDF referral forms and other healthcare documents.
  • Develop data transformation and mapping processes to store extracted data into an Azure SQL schema.
  • Build and maintain the medallion data architecture (Bronze/Silver/Gold) in Azure Data Lake Storage to support both application workflows and future analytics/model training.
  • Implement and optimize search indexing (Azure AI Search) for both structured and unstructured data.
  • AI/ML Development: Integrate Azure OpenAI or equivalent LLM services with retrieval-augmented generation (RAG) to support a natural language query feature.
  • Implement initial predictive analytics models (fall risk, infection risk, medication cost forecasting) using heuristics or light ML models, with a path to more advanced modeling in Phase 2.
  • Develop a compliance AI agent (MVP version) that runs scheduled checks on documentation and flags potential violations.
  • Collaborate with the Solution Architect to ensure AI components are secure, HIPAA-compliant, and cost-optimized.
  • Collaboration & Delivery: Work in Azure DevOps with sprint-based development, pairing with backend/API developers on AI integration points.
  • Participate in architecture/design reviews and provide input on data model decisions.
  • Document pipelines, AI integrations, and data flows for future handoff to the client’s internal team.
  • Support UAT for AI/data features, including tuning models and validating extracted data.
  • Required Skills & Experience: Technical Expertise
  • Proven experience with Azure Data Services (Azure SQL, Data Lake Storage, Azure AI Search, Azure Functions).
  • Hands-on experience with OCR (Azure Form Recognizer / Azure Document Intelligence) and unstructured data processing.
  • Strong Python skills for AI pipeline and data processing development.
  • Experience integrating LLMs (Azure OpenAI) with RAG patterns.
  • Knowledge of basic predictive modeling techniques (classification, regression) and healthcare data compliance considerations.
  • Proficiency with REST APIs for integrating external systems (e.g., pharmacy, insurance).
  • Healthcare & Compliance
  • Understanding of HIPAA compliance and secure handling of PHI.
  • Familiarity with healthcare data formats and workflows (HL7, FHIR, eMAR).
  • Collaboration & Delivery
  • Prior experience working in agile, cross-functional product teams.
  • Ability to deliver end-to-end—from data ingestion to model integration to API exposure.
  • Strong documentation and knowledge transfer skills.
  • Preferred Qualifications
  • Experience in healthcare software development (EHR, EMR, or related domains).
  • Familiarity with vector search and semantic search concepts.
  • Experience with Azure Machine Learning for model deployment.
  • Background in predictive analytics for healthcare outcomes.