Tech Stack
AWSCloudCyber SecurityPythonSQL
About the role
- Build and maintain AWS-based ML infrastructure (SageMaker, S3, Lambda, Batch)
- Design and operate CI/CD pipelines for ML models, including versioning, containerization, and automated retraining
- Implement monitoring, logging, metrics, and alerting for deployed models
- Provide scalable training environments and runtime infra to support research-to-production workflows
- Collaborate with AI and Data Engineering teams to integrate models, datasets, and features
- Transition AI prototypes and PoCs into production-ready, scalable services
Requirements
- Hands-on experience with AWS ML infrastructure (SageMaker, S3, Lambda, etc.)
- Experience with containerization and automation
- Skilled at setting up workflows for model deployment, versioning, retraining, and monitoring
- Proficiency in Python and SQL for scripting, automation, and lightweight data handling
- Experience with logging, metrics, and alerting for ML models in production
- Comfortable collaborating with AI engineers and Data Engineers
- Ability to balance rapid prototyping with building scalable, maintainable systems