Baseten

Applied AI Inference Engineer

Baseten

full-time

Posted on:

Origin:  • 🇺🇸 United States • California

Visit company website
AI Apply
Manual Apply

Job Level

Junior

Tech Stack

DockerPythonSpark

About the role

  • Partner directly with customers to architect, build, and deploy high-scale production AI applications on Baseten’s platform.
  • Own the journey with customers from initial exploration to production deployment, translating ambiguous business goals into reliable, observable services with clear quality, latency, and cost outcomes.
  • Work across product, software development, performance engineering, and customer-facing implementations.
  • Hands-on coding and software development with product management, technical customer success, and pre-sales solution engineering aspects.
  • Part of Forward Deployed Engineering Team with emphasis on performance engineering.

Requirements

  • Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field.
  • 1+ years of professional work experience in a fast-paced, high-growth environment.
  • Demonstrated experience with one or more general-purpose programming languages in a production-level environment, with a strong preference for Python.
  • Familiarity with AI/ML pipelines and the lifecycle of ML model development and deployment.
  • Strong communication skills, particularly on complex technical topics.
  • Experience in building or optimizing AI/ML projects is highly valued.