Truelogic Software

Senior ML and DevOps Engineer – Advertising

Truelogic Software

full-time

Posted on:

Location Type: Remote

Location: Remote • 🇲🇽 Mexico

Visit company website
AI Apply
Apply

Job Level

Senior

Tech Stack

AWSAzureCloudDockerETLGoogle Cloud PlatformGrafanaPrometheusPythonTerraform

About the role

  • Design and implement end-to-end MLOps pipelines for training, deployment, and inference.
  • Build and maintain Python-based APIs (FastAPI) to support ML workflows and data ingestion.
  • Configure and manage cloud infrastructure (AWS, GCP, Azure) to support AI/ML workloads.
  • Automate deployment, scaling, and monitoring of models using Docker and CI/CD pipelines.
  • Implement observability, logging, and alerting with tools like CloudWatch, Prometheus, Grafana, and GCP logging.
  • Collaborate with cross-functional teams to define requirements and deliver scalable MLOps solutions.
  • Optimize model performance and system reliability while troubleshooting production issues.
  • Apply infrastructure-as-code practices (Terraform) for reproducible and maintainable deployments.
  • Stay ahead of industry best practices in MLOps, cloud-native engineering, and AI/ML integration.

Requirements

  • Proven experience designing and implementing end-to-end MLOps pipelines for model training and inference.
  • Advanced Python expertise, including multithreading, async/await, multiprocessing, and parallelism.
  • Experience building and deploying APIs with FastAPI for data ingestion and ML workflows.
  • Strong background in data cleaning, transformation, and ETL processes using Python.
  • Hands-on experience with Terraform (including production usage and state file management).
  • Expertise in Docker, including container packaging and entry point customization.
  • Experience monitoring and troubleshooting production ML models with S3, GCP logging, CloudWatch, Prometheus, or Grafana.
  • Proficiency with CI/CD pipelines (preferably GitHub Actions).
Benefits
  • 100% Remote Work: Enjoy the freedom to work from the location that helps you thrive.
  • Highly Competitive USD Pay: Earn an excellent, market-leading compensation in USD.
  • Paid Time Off
  • Work with Autonomy: Freedom to manage your time as long as the work gets done.
  • Work with Top American Companies: Grow your expertise working on innovative U.S. companies.
  • A Culture That Values You: Prioritize well-being and work-life balance with engagement activities.
  • Diverse, Global Network: Connect with over 600 professionals in 25+ countries.
  • Team Up with Skilled Professionals: Collaborate with senior, experienced talent.

ATS Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
MLOpsPythonFastAPIAWSGCPAzureDockerTerraformCI/CDETL
Soft skills
collaborationtroubleshootingproblem-solvingcommunicationscalability
The Walt Disney Company

Senior MLOps Engineer

The Walt Disney Company
Seniorfull-time$152k–$204k / yearCalifornia · 🇺🇸 United States
Posted: 9 days agoSource: disney.wd5.myworkdayjobs.com
AWSAzureCloudDockerGoGoogle Cloud PlatformGrafanaKubernetesPrometheusPythonTensorflow
qode.world

Infrastructure Engineer, Kafka and GenAI

qode.world
Mid · Seniorfull-time🇺🇸 United States
Posted: 32 days agoSource: apply.workable.com
ApacheAWSAzureCloudDockerGoGoogle Cloud PlatformGrafanaJenkinsKafkaKubernetesPrometheus+4 more
Articul8 AI

Senior Site Reliability Engineer, SRE

Articul8 AI
Seniorfull-timeCalifornia · 🇺🇸 United States
Posted: 21 days agoSource: jobs.ashbyhq.com
AWSAzureCloudDistributed SystemsDockerGoGoogle Cloud PlatformGrafanaKubernetesNoSQLPrometheusPython+2 more
Defense Unicorns

Infrastructure Engineer, Bare Metal Kubernetes

Defense Unicorns
Mid · Seniorfull-time$149k–$201k / year🇺🇸 United States
Posted: 17 days agoSource: boards.greenhouse.io
AWSAzureCloudGoGoogle Cloud PlatformGrafanaKubernetesLinuxNGINXOpen SourcePrometheusPython+2 more
Beekeeper

Backend Software Engineer

Beekeeper
Mid · Seniorfull-time🇫🇷 France
Posted: 16 hours agoSource: boards.greenhouse.io
AWSCloudGoogle Cloud PlatformGrafanaJavaKubernetesPrometheusPythonTerraform