BJAK

MLOps 工程师

BJAK

full-time

Posted on:

Location Type: Hybrid

Location: 🇨🇳 China

Visit company website
AI Apply
Apply

Job Level

Mid-LevelSenior

Tech Stack

KubernetesRay

About the role

  • 高效运行并管理开源大模型,优化推理的成本与可靠性
  • 确保在 GPU、CPU 与内存资源之间的高性能与稳定性
  • 实时监控与排查推理性能问题,确保低延迟与高吞吐量
  • 与工程团队协作,实现可扩展、可靠的模型服务架构
  • Run and manage open-source models efficiently, optimizing for cost and reliability
  • Ensure high performance and stability across GPU, CPU, and memory resources
  • Monitor and troubleshoot model inference to maintain low latency and high throughput
  • Collaborate with engineers to implement scalable and reliable model serving solutions

Requirements

  • 有使用 vLLM、HuggingFace TGI 等模型推理平台的经验
  • 熟悉 GPU 调度与资源编排,掌握 Kubernetes、Ray、Modal、RunPod、LambdaLabs 等工具
  • 具备根据流量动态监控推理延迟、成本并高效扩展系统的能力
  • 熟悉为后端工程师设置推理 API 接口的流程与规范
  • 喜欢主导项目并独立推动落地
  • 相信“清晰来自行动” —— 原型、测试、迭代,而非等待完美计划
  • 在初创环境中依然冷静高效 —— 不惧从零开始或变化快速
  • 重视速度 —— 优先交付有价值的产品,而非追求完美版本
  • 视反馈与失败为成长的一部分 —— 持续进阶自己的技能
  • 拥有谦逊、进取心与执行力,并在协作中带动他人前进
  • Experience with model serving platforms such as vLLM or HuggingFace TGI
  • Proficiency in GPU orchestration using tools like Kubernetes, Ray, Modal, RunPod, LambdaLabs
  • Ability to monitor latency, costs, and scale systems efficiently with traffic demands
  • Experience setting up inference endpoints for backend engineers
Benefits
  • 扁平化团队结构与真实项目主导权
  • 全程参与产品方向与决策制定
  • 灵活办公制度
  • 高影响力角色,跨产品、数据与工程多团队协作
  • 顶尖市场薪酬 + 绩效奖金
  • 全球化产品开发机会
  • 丰厚福利:住房租赁补贴、优质公司食堂、加班餐补
  • 健康、牙科与视力保险
  • 全球差旅保险(适用于你与家属)
  • 无限期、弹性带薪休假
  • Flat structure & real ownership
  • Full involvement in direction and consensus decision making
  • Flexibility in work arrangement
  • Top-of-market compensation and performance-based bonuses
  • Global exposure to product development
  • Housing rental subsidies, a quality company cafeteria, and overtime meals
  • Health, dental & vision insurance
  • Global travel insurance (for you & your dependents)
  • Unlimited, flexible time off

ATS Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
model inferenceGPU orchestrationlatency monitoringcost optimizationscalable systemsinference API setup
Soft skills
project leadershipindependent executioncalm under pressurespeed prioritizationfeedback acceptancehumilityproactivenesscollaboration
BJAK

MLOps 엔지니어, MLOps Engineer

BJAK
Mid · Seniorfull-time🇰🇷 South Korea
Posted: 3 hours agoSource: jobs.ashbyhq.com
KubernetesRay
BJAK

MLOps エンジニア, MLOps Engineer

BJAK
Mid · Seniorfull-time🇯🇵 Japan
Posted: 3 hours agoSource: jobs.ashbyhq.com
KubernetesRay
Matroid, Inc.

Software Engineer, Systems and Infrastructure

Matroid, Inc.
Mid · Seniorfull-time$150k–$250k / yearCalifornia · 🇺🇸 United States
Posted: 24 days agoSource: matroid.breezy.hr
AWSDockerIoTKafkaKubernetesLinuxMongoDBNGINXPrometheusPythonRayRedis
Defense Unicorns

Infrastructure Engineer, Bare Metal Kubernetes

Defense Unicorns
Mid · Seniorfull-time$149k–$201k / year🇺🇸 United States
Posted: 17 days agoSource: boards.greenhouse.io
AWSAzureCloudGoGoogle Cloud PlatformGrafanaKubernetesLinuxNGINXOpen SourcePrometheusPython+2 more
Rokt

Security Engineer

Rokt
Mid · Seniorfull-time$170k–$185k / yearNew York · 🇺🇸 United States
Posted: 8 days agoSource: apply.workable.com
AWSCloudGoGoogle Cloud PlatformKubernetesPython