Applied AI Inference Engineer

Baseten

full-time

Posted on: 9/7/2025

Location: California • 🇺🇸 United States

✨ AI Apply

Junior

DockerPythonSpark

About the role

Partner directly with customers to architect, build, and deploy high-scale production AI applications on Baseten’s platform.
Own the journey with customers from initial exploration to production deployment, translating ambiguous business goals into reliable, observable services with clear quality, latency, and cost outcomes.
Work across product, software development, performance engineering, and customer-facing implementations.
Hands-on coding and software development with product management, technical customer success, and pre-sales solution engineering aspects.
Part of Forward Deployed Engineering Team with emphasis on performance engineering.

Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field.
1+ years of professional work experience in a fast-paced, high-growth environment.
Demonstrated experience with one or more general-purpose programming languages in a production-level environment, with a strong preference for Python.
Familiarity with AI/ML pipelines and the lifecycle of ML model development and deployment.
Strong communication skills, particularly on complex technical topics.
Experience in building or optimizing AI/ML projects is highly valued.