BJAK

MLOps Engineer

BJAK

full-time

Posted on:

Location Type: Hybrid

Location: Hong Kong • 🇭🇰 Hong Kong

Visit company website
AI Apply
Apply

Job Level

Mid-LevelSenior

Tech Stack

KubernetesRay

About the role

  • Fine-tune state-of-the-art models, design evaluation frameworks, and bring AI features into production
  • Run and manage open-source models efficiently, optimizing for cost and reliability
  • Ensure high performance and stability across GPU, CPU, and memory resources
  • Monitor and troubleshoot model inference to maintain low latency and high throughput
  • Collaborate with product, engineering, operations, infrastructure and data teams to implement scalable, reliable model serving solutions
  • Build and scale inference endpoints and monitoring to meet traffic demands and cost targets

Requirements

  • Experience with model serving platforms such as vLLM or HuggingFace TGI
  • Proficiency in GPU orchestration using tools like Kubernetes, Ray, Modal, RunPod, LambdaLabs
  • Ability to monitor latency, costs, and scale systems efficiently with traffic demands
  • Experience setting up inference endpoints for backend engineers
  • Experience fine-tuning state-of-the-art language models and designing evaluation frameworks
  • Experience ensuring model safety, trustworthiness, and production readiness
  • Experience managing GPU, CPU, and memory resources for high performance and stability
Benefits
  • Flat structure & real ownership
  • Full involvement in direction and consensus decision making
  • Flexibility in work arrangement
  • High-impact role with visibility across product, data, and engineering
  • Top-of-market compensation and performance-based bonuses
  • Global exposure to product development
  • Lots of perks - housing rental subsidies, a quality company cafeteria, and overtime meals
  • Health, dental & vision insurance
  • Global travel insurance (for you & your dependents)
  • Unlimited, flexible time off

ATS Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
model fine-tuningevaluation frameworksmodel servingGPU orchestrationlatency monitoringinference endpointsmodel safetyproduction readinesscost optimizationhigh throughput
Soft skills
collaborationtroubleshootingproblem-solvingcommunicationorganizational skills
BJAK

ML Ops Engineer

BJAK
Mid · Seniorfull-time🇬🇧 United Kingdom
Posted: 1 hour agoSource: jobs.ashbyhq.com
KubernetesRay
BJAK

ML Ops Engineer

BJAK
Mid · Seniorfull-time🇺🇸 United States
Posted: 1 hour agoSource: jobs.ashbyhq.com
KubernetesRay
BJAK

ML Ops Engineer

BJAK
Mid · Seniorfull-time🇩🇪 Germany
Posted: 1 hour agoSource: jobs.ashbyhq.com
KubernetesRay
BJAK

MLOps 엔지니어, MLOps Engineer

BJAK
Mid · Seniorfull-time🇰🇷 South Korea
Posted: 13 hours agoSource: jobs.ashbyhq.com
KubernetesRay
BJAK

MLOps エンジニア, MLOps Engineer

BJAK
Mid · Seniorfull-time🇯🇵 Japan
Posted: 13 hours agoSource: jobs.ashbyhq.com
KubernetesRay