Noda

MLOps Engineer

Noda

full-time

Posted on:

Location Type: Hybrid

Location: Austin • Texas • 🇺🇸 United States

Visit company website
AI Apply
Apply

Job Level

Mid-LevelSenior

Tech Stack

AirflowApacheAWSAzureCloudDockerKubernetesPythonPyTorchTensorflow

About the role

  • Own the complete lifecycle of machine learning models for autonomous vehicle orchestration
  • Build infrastructure, pipelines, and monitoring systems to deploy, validate, and continuously improve AI agents and reasoning systems
  • Design and implement automated training pipelines for LLMs, agent frameworks, and reasoning models
  • Build model versioning, experiment tracking, and artifact management systems
  • Develop automated model validation and testing frameworks, including simulation-based evaluation of agent behaviors
  • Implement A/B testing infrastructure for comparing AI reasoning strategies
  • Create model monitoring and observability systems to track performance, drift, and reliability
  • Optimize model deployment for edge computing (quantization, pruning, inference acceleration)
  • Build automated retraining pipelines using operational feedback from field deployments
  • Implement model governance and compliance frameworks, including audit trails and safety validation
  • Design feature stores and data pipelines preparing operational sensor and mission data
  • Support rollback and canary deployment strategies for safe model updates
  • Ensure secure model deployment practices, including model encryption, access controls, and adversarial robustness validation

Requirements

  • 4+ years of experience in MLOps, ML platform engineering, or production machine learning systems
  • Strong proficiency in Python
  • Experience with ML frameworks (PyTorch, TensorFlow, Hugging Face, MLflow, or similar)
  • Experience with container orchestration for ML workloads (Docker, Kubernetes)
  • Knowledge of model serving frameworks (TorchServe, TensorFlow Serving, Triton, or cloud ML endpoints)
  • Understanding of ML experiment tracking, model versioning, and artifact management systems
  • Experience with cloud ML platforms (AWS SageMaker, Azure ML, or Google AI Platform)
  • Proficiency with data pipeline tools (Apache Airflow, Prefect, or similar)
  • Knowledge of model monitoring, performance tracking, and automated alerting systems
  • Understanding of CI/CD practices specifically applied to ML model deployment
  • U.S. Citizenship with ability to obtain a security clearance
Benefits
  • Hybrid work environment
  • Competitive pay
  • Flexible time off
  • Generous PTO policy
  • Federal holidays
  • Generous health, dental, and vision benefits
  • Free OneMedical membership

ATS Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
MLOpsmachine learningPythonPyTorchTensorFlowHugging FaceMLflowDockerKubernetesCI/CD
Simform

Senior Data Scientist

Simform
Seniorfull-time🇮🇳 India
Posted: 24 days agoSource: simformsolutions.freshteam.com
AirflowAWSAzureCloudDockerETLKubernetesPythonPyTorchTensorflow
League

Principal AI Engineer

League
Leadfull-time$215k–$269k / year🇺🇸 United States
Posted: 31 days agoSource: boards.greenhouse.io
AirflowApacheAWSAzureCloudDistributed SystemsDockerGoGoogle Cloud PlatformHadoopKubernetesNoSQL+6 more
Provectus

Staff/Principal Machine Learning Engineer, AI

Provectus
Leadfull-time🇺🇸 United States
Posted: 24 days agoSource: jobs.lever.co
AirflowAWSAzureCloudDockerGoogle Cloud PlatformKubernetesPythonPyTorchSparkTensorflow
Scismic

Senior Data Scientist, NLP/LLM Specialist

Scismic
Seniorfull-time$155k–$240k / yearCalifornia · 🇺🇸 United States
Posted: 23 days agoSource: jobs.lever.co
AirflowApacheAWSAzureCloudDockerGoogle Cloud PlatformKubernetesPythonPyTorchSparkSQL+1 more
Substack

Machine Learning Systems Engineer

Substack
Mid · Seniorfull-time$200k–$250k / yearCalifornia · 🇺🇸 United States
Posted: 23 days agoSource: jobs.ashbyhq.com
AirflowAWSCloudDistributed SystemsDockerJavaScriptKubernetesPythonPyTorchRayTensorflowTypeScript