Apply

Ready to go for it?

AI Apply speeds things up—apply directly if you prefer.

FREE ACCESS
5,000–10,000 jobs/day
JobTailor Logo

See all jobs on JobTailor

Search thousands of fresh jobs every day.

Discover
  • Fresh listings
  • Fast filters
  • No subscription required
Create a free account and start exploring right away.
Baseten

Solution Architect, AI/LLM Inference

Baseten

Solution Architect translating AI customer needs into technical solutions for dynamic AI companies. Leading demos and project executions with Sales and Engineering teams.

Posted 5/11/2026full-timeSan Francisco • California • 🇺🇸 United StatesMid-LevelSenior💰 $165,000 - $330,000 per yearWebsite

About the role

Key responsibilities & impact
  • Partner with Sales on customer discovery calls (most often second calls, occasionally first calls for large accounts).
  • Lead demos and technical scoping to align on success criteria, architecture, and deployment approach.
  • Own benchmarking and repeatable deployments, including:
  • - Handling standard deployment patterns and configurations across many modalities – LLMs, embeddings, image and video generation, VoiceAI, etc.
  • - Advising on tradeoffs like H100s vs B200s and latency-optimized vs throughput-optimized setups.
  • - Driving consistent “playbook” style deployments for common models and use cases.
  • Become a power user of different runtimes such as vllm, sglang, and TRT-LMM and all the common configurations and tradeoffs between them
  • Drive POC and project execution, including:
  • - Scoping POCs and keeping stakeholders aligned on timeline, deliverables, and next steps.
  • - Acting as the “ringleader” or project manager for POCs.
  • - Pulling in Forward Deployed Engineering (FDE) support when deeper or more complex technical work is needed.

Requirements

What you’ll need
  • AI/ML background and the ability to credibly discuss AI/ML topics with technical stakeholders.
  • Strong customer-facing communication skills, including the ability to run structured discovery and clarify ambiguous requirements.
  • Technical depth to scope solutions, without needing to write production code.
  • Ability to script and prototype as needed, including comfort “vibe coding” to move quickly in technical workflows.
  • Experience running or supporting benchmarks for ML inference deployments (Nice to have).
  • Familiarity with infrastructure tradeoffs relevant to inference performance and cost (for example GPU selection and latency versus throughput tuning)(Nice to have).
  • Experience serving as a cross-functional technical lead for customer POCs, including coordination across Sales and Engineering (Nice to have).

Benefits

Comp & perks
  • Competitive compensation, including meaningful equity.
  • 100% coverage of medical, dental, and vision insurance for employee and dependents
  • Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
  • Paid parental leave
  • Fertility and family-building stipend through Carrot
  • Company-facilitated 401(k)
  • Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

ATS Keywords

✓ Tailor your resume
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
AI/MLbenchmarkingdeployment patternsLLMsembeddingsimage generationvideo generationVoiceAIvllmsglang
Soft Skills
customer-facing communicationstructured discoveryclarifying ambiguous requirementsproject managementstakeholder alignmenttechnical depthscriptingprototypingvibe codingcross-functional leadership