Tech Stack
AWSAzureCloudGoogle Cloud PlatformPythonPyTorchTensorflow
About the role
- Participate and effectively contribute to the design, development, and implementation of complex applications, often using new technologies.
- Design, develop, and deploy machine learning and deep learning models, particularly leveraging OpenAI APIs.
- Implement OCR pipelines using tools such as Tesseract, Adobe, AWS Textract, Google Vision, or Azure Form Recognizer to extract structured information from unstructured documents.
- Integrate OCR outputs with NLP/LLM models to build intelligent document understanding and knowledge extraction systems.
- Optimize model performance, accuracy, and scalability through data preprocessing, feature engineering, and hyperparameter tuning.
- Develop end-to-end ML pipelines from data ingestion and training to deployment and monitoring.
- Collaborate with product managers, data scientists, and engineers to translate business requirements into AI solutions.
- Stay up to date with latest advancements in generative AI, computer vision, and OCR technologies and evaluate their potential business applications.
- Ensure compliance with data privacy, security, and responsible AI practices in all deployed systems.
- Provide technical expertise and systems design for individual initiatives and work with SME consultants.
Requirements
- Bachelor’s or Master’s degree in Computer Science, AI/ML, Data Science, or a related field.
- 4+ years of experience as a Machine Learning Engineer, NLP Engineer, or AI Specialist.
- Strong expertise with OpenAI APIs, GPT models, LangChain, or similar LLM frameworks.
- Hands-on experience with OCR tools (Tesseract, EasyOCR, PaddleOCR, AWS Textract, Google Cloud Vision, or Azure OCR).
- Proficiency in Python and ML frameworks such as TensorFlow, PyTorch, or Hugging Face Transformers.
- Experience building and deploying ML solutions in cloud environments (AWS, Azure, GCP).