Fiddler AI

Staff Backend Engineer

Fiddler AI

full-time

Posted on:

Location Type: Hybrid

Location: Palo Alto • California • 🇺🇸 United States

Visit company website
AI Apply
Apply

Salary

💰 $190,000 - $300,000 per year

Job Level

Lead

Tech Stack

AWSCloudDistributed SystemsGoogle Cloud PlatformKafkaKubernetesMicroservicesPostgresPythonRabbitMQRayRedis

About the role

  • Design and build core services and components of a world-class cloud platform to help enterprises develop, monitor and improve their full suite of AI based applications (covering predictive models, LLMs, GenAI models and agentic applications)
  • Lead the design and implementation of distributed systems and microservices that compute, persist, and expose new ML + agentic observability metrics (e.g., response relevancy, hallucination scores) from raw trace data
  • Design enterprise-grade, scalable data infrastructure, services and APIs to support enterprise scale workloads and meet compliance needs and SLAs
  • Spearhead the development of new types of metrics and evaluation capabilities to satisfy evolving customer needs. Take part in conversations with customers around discovery and support
  • Define and evolve the operational maturity (reliability, latency, SLOs, observability) of core services, establish best practices and champion improvements to internal CI/CD processes, testing frameworks, error handling, efficiency and resiliency
  • Team & Culture Building: you will take an active role in building a world-class engineering team and actively participate in the talent acquisition process through interviewing, candidate evaluation and coaching

Requirements

  • Masters or Bachelors degree in Computer Science or related field, combined with 7+ years of industry experience, with demonstrated solid foundation in software development.
  • Deep proficiency with Python and a strong command of essential backend technologies like Postgres, Redis, Kafka, RabbitMQ, Ray. This includes the ability to design, build, and debug complex, large-scale systems.
  • Experience with deploying and working with ML/LLM models in production. The candidate should be comfortable with modern LLM frameworks (e.g., Langchain, HuggingFace, vLLM) and evaluation frameworks (e.g., Ragas, MLFlow) to ensure model performance and reliability.
  • Adaptability & Ownership: proven ability to thrive in ambiguity and a fast-paced environment. We need a self-motivated initiator who can take ownership of projects with a high degree of autonomy, confidently filling in the gaps when the full picture isn't available.
  • System Design & Optimization: A strong grasp of distributed systems and the capacity to troubleshoot production issues. A nice to have would be experience with cloud infrastructure (AWS/GCP, Kubernetes) and specialized databases (Clickhouse/Druid), indicating a deeper understanding of system architecture and performance optimization.
  • Technical Leadership & Collaboration: Demonstrated ability to plan, execute, and deliver projects by effectively breaking down complex problems into manageable tasks, and guiding a small team of engineers. Must be adept at cross-functional collaboration across a geographically distributed team, working closely with product managers, designers, frontend developers, and data scientists to ensure alignment and successful project outcomes
  • Coaching & Mentorship: you should be an excellent collaborator and a mentor to other team members, raising the technical bar for the entire team and regularly engage in code and design reviews.
  • Ability to work in our Palo Alto office 3 days a week
Benefits
  • equity
  • benefits 📊 Resume Score Upload your resume to see if it passes auto-rejection tools used by recruiters Check Resume Score

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
PythonPostgresRedisKafkaRabbitMQRayML frameworksLLM frameworkssystem designdistributed systems
Soft skills
adaptabilityownershiptechnical leadershipcollaborationcoachingmentorshipproblem-solvingcommunicationteam buildinginitiative
Certifications
Masters degreeBachelors degree
Resource Innovations

Lead Java Software Engineer

Resource Innovations
Seniorfull-time$135k–$160k / yearArizona, California, Colorado, Illinois, Wisconsin · 🇺🇸 United States
Posted: 4 hours agoSource: apply.workable.com
AngularApacheAWSCloudDockerDynamoDBETLHibernateJavaJavaScriptJUnitKubernetes+14 more
DTEX Systems

Senior Backend Full Stack Engineer

DTEX Systems
Seniorfull-time$170k–$220k / yearCalifornia · 🇺🇸 United States
Posted: 1 day agoSource: dtexsystems.applytojob.com
DjangoElasticSearchGraphQLPostgresPython
Point One Navigation

Staff Engineer – Network and Backend

Point One Navigation
Leadfull-timeCalifornia · 🇺🇸 United States
Posted: 1 day agoSource: jobs.ashbyhq.com
ApacheDistributed SystemsGoJavaKafkaMicroservicesRabbitMQ
Xero

Principal Backend Engineer, Payments Platform

Xero
Leadfull-time$224k–$302k / yearCalifornia · 🇺🇸 United States
Posted: 2 days agoSource: jobs.lever.co
Distributed SystemsGoJavaNoSQLSQL