Tech Stack
AWSAzureCloudDockerKubernetesPythonPyTorchTensorflowTerraform
About the role
- Research, design, and build production-grade Generative AI and intelligent automation solutions using LLMs, RAG pipelines, and vector databases.
- Lead full-cycle development: data preparation, model fine-tuning, evaluation, optimization, containerization, and secure cloud deployment (Azure/AWS).
- Implement MLOps/LLMOps pipelines for automated testing, monitoring, and rollback.
- Apply Responsible AI practices and compliance standards (e.g., NIST RMF, FedRAMP) throughout solution delivery.
- Translate business and mission requirements into technical designs; prototype and iterate quickly with stakeholders.
- Collaborate with engineers, data scientists, and domain experts across SaaS, cloud, and big data environments to translate research into production-ready solutions.
- Contribute to Azumo’s innovation roadmap by identifying research topics, publishing insights, and advancing the AI software development services portfolio.
Requirements
- Bachelor’s Degree in Computer Science, Data Science, or related field (Master’s is a plus).
- 3+ years of experience developing and deploying ML, NLP, or Generative AI systems.
- Expert-level skills in Python and software engineering fundamentals (data structures, testing, CI/CD, Git, containers).
- Hands-on experience with AI development tools: PyTorch, TensorFlow, LangChain, LangGraph, and vector databases (Pinecone, FAISS, Azure AI Search).
- Proven cloud deployment experience (Azure preferred; AWS acceptable).
- Familiarity with DevOps/Infrastructure as Code (GitHub Actions, Terraform/Bicep, Docker/Kubernetes).
- Strong written and verbal communication skills to explain technical concepts to diverse audiences.
- Preferred: Experience with compliance frameworks relevant to AI software development services (e.g., NIST, FedRAMP).
- Preferred: Contributions to research papers, open-source libraries, or AI communities.