Tech Stack
ApacheAWSCloudDockerJavaJenkinsKubernetesMongoDBMySQLNumpyPandasPostgresPythonPyTorchScalaScikit-LearnSparkTensorflowTerraform
About the role
- Ensure that ML models can be effectively developed, deployed, managed, and monitored in production environments.
- Productionize ML models – integrate trained ML models with production systems.
- Build and manage ML pipelines – design, build, and maintain automated pipelines including data ingestion, data preprocessing, model training, validation, and deployment utilizing CI/CD practices.
- Infrastructure management – set up and manage infrastructure for ML workloads utilizing cloud platforms and containerization technologies.
- Monitoring and alerting – implement monitoring systems to track performance of ML models in production.
- Automation – automate various tasks within the ML workflow to improve efficiency and reproducibility.
- Performance optimization – identify ways to optimize the performance, efficiency, and scalability of ML models and their supporting infrastructure.
- Collaborate with teams and perform all other duties as assigned or directed.
- Work 100% onsite in Woodlawn, MD, 5 days a week.
Requirements
- Bachelor's Degree and 7+ years' experience in Computer Science, Mathematics, Engineering or a related field.
- Masters or Doctorate degree may substitute for required experience.
- Minimum 5 years of hands-on experience designing, developing, implementing and maintaining ML workflows and data pipelines.
- Must be able to obtain and maintain a Public Trust (Contract requirement).
- No Visa Sponsorship Provided; Green Card/EAD can also apply.
- Strong foundation in AI, ML and LLMs including concepts, algorithms, model training and frameworks (TensorFlow, PyTorch, scikit-learn).
- Strong programming skills, especially Python, and relevant libraries (scikit-Learn, TensorFlow, PyTorch, NumPy, Pandas).
- Experience with MLOps platforms and tools (AWS Sagemaker, MLflow, Kubeflow, DataRobot).
- Experience with CI/CD tools (Jenkins required).
- Experience with containerization (Docker) and orchestration (Kubernetes).
- Proficiency with cloud platforms (AWS preferred), Infrastructure as Code (CloudFormation, Terraform), compute and storage (S3, EFS), and networking.
- Knowledge of data engineering fundamentals including data storage (PostgreSQL, MySQL, MongoDB) and data processing frameworks (Apache Spark).
- Strong communication, collaboration, problem-solving, analytical, and critical thinking skills.
- Prior experience with federal or state government IT projects (desired).
- Familiarity with other programming languages such as Java and Scala (desired).
- Experience with Natural Language Processing (NLP) for text and language generation (desired).
- Availability to work 100% onsite in Woodlawn, MD, 5 days a week.