Salary
💰 $160,000 - $170,000 per year
Tech Stack
AWSCloudCyber SecurityDistributed SystemsDockerDynamoDBEC2MicroservicesPythonPyTorchScikit-LearnSMTPTensorflowTerraform
About the role
- Implement and optimize LLM-based features including prompt engineering, fine-tuning, and RAG systems
- Design, develop, and train machine learning models for various business applications (classification, regression, NLP)
- Conduct rigorous model evaluation using appropriate metrics and cross-validation strategies
- Develop evaluation pipelines for both traditional ML models and LLM applications
- Create reproducible experimentation frameworks for model iteration and improvement
- Build ensemble methods and optimize model architectures for performance/accuracy trade-offs
- Deploy ML infrastructure on AWS (SageMaker, Bedrock, EC2)
- Develop cost-effective solutions for both traditional models and LLMs
- Use Docker and Terraform to manage ML environments
- Build real-time and batch inference systems for ML models
- Create data pipelines and feature stores for model training
- Set up vector databases and RAG systems for LLM applications
- Implement monitoring and alerts for models in production
- Manage both traditional ML workloads and LLM API integrations
- Collaborate with engineering teams to integrate ML features into products
- Participate in code reviews ensuring high-quality, maintainable ML code
- Contribute to technical documentation and knowledge sharing
- Support other engineers in understanding and using ML systems
- Partner with product teams to scope and deliver ML features
Requirements
- 6+ years of experience in machine learning engineering or data science
- 3+ years of production Python development with experience in ML frameworks (PyTorch, TensorFlow, Scikit-learn, XGBoost)
- Experience in building RESTful APIs and microservices
- Experience with AWS services including: SageMaker, S3, DynamoDB
- Experience with Bedrock or similar LLM services
- Experience with Docker and containerization
- Experience with the full ML lifecycle: problem framing, data analysis, model development, evaluation, and deployment
- Strong software engineering fundamentals and design patterns
- Experience with version control (Git) and collaborative development
- Understanding of distributed systems and scalability
- Track record shipping ML products (bonus)
- PhD or other research experience (bonus)
- Deep experience in applicable ML areas (NLP, Deep learning, Bayesian methods, Reinforcement learning, clustering) (bonus)
- Experience with Infrastructure as Code using Terraform or similar (bonus)
- Hands-on experience with LLM applications (prompt engineering, fine-tuning, RAG) (bonus)