Staff Backend Engineer, AI

Pulley

full-time

Posted on: 12/3/2025

Location Type: Remote

Location: Remote • 🇺🇸 United States

Visit company website

✨ AI Apply

Apply

Salary

💰 $200,000 - $240,000 per year

Job Level

Lead

Tech Stack

AWSCloudDistributed SystemsDockerGoGoogle Cloud PlatformKubernetesMicroservicesPostgresPythonRedis

About the role

Architect distributed systems that power AI agent orchestration, handling complex multi-step workflows with reliability and observability
Design and implement core backend services in Golang for high-performance, concurrent operations and Python for AI/ML integrations and data pipelines
Build robust APIs and service interfaces that abstract AI capabilities for consumption by multiple clients and internal systems
Own the technical roadmap for backend infrastructure supporting LLM integrations, vector search, and real-time data processing
Establish architectural patterns and standards for service design, data modeling, and inter-service communication
Drive system scalability and reliability, implementing caching strategies, database optimization, and async processing patterns
Build data pipelines that process equity data, embeddings, and user interactions to continuously improve AI model performance
Collaborate cross-functionally to translate business requirements into technical architecture and make pragmatic build vs. buy decisions
Mentor engineers on backend best practices, system design, and technical decision-making

Requirements

7+ years of backend engineering experience building production systems at scale
Expert-level proficiency in Python for AI/ML integrations, data processing, and scripting
Strong Python skills with deep understanding of concurrency patterns, performance optimization, and production Go services
Proven experience designing and building distributed systems, including microservices architecture, event-driven systems, and async workflows
Deep knowledge of API design (REST) and service-to-service communication patterns
Demonstrated experience building backend systems that integrate LLMs and AI capabilities into production applications
Hands-on work with agent frameworks, LLM orchestration, and AI service reliability
Familiarity with RAG architectures, vector databases (Pinecone, Weaviate, Qdrant), and embeddings pipelines
Experience building evaluation frameworks and observability for AI systems
Understanding of prompt engineering and LLM API optimization (latency, costs, rate limits)
Track record of architecting greenfield services from concept to production with scalability and maintainability in mind
Experience with database design and optimization (PostgreSQL, Redis, time-series databases)
Strong foundation in system observability, monitoring, and debugging distributed systems in production
Proven ability to lead technical initiatives and establish engineering best practices
Experience mentoring engineers and elevating team technical capabilities
Experience with Kubernetes, Docker, and cloud infrastructure (AWS/GCP) is a bonus
Background in FinTech or regulated industries is a bonus
Contributions to open-source projects or technical writing is a bonus
Familiarity with AI-powered developer tools like Cursor or Copilot is a bonus.

Benefits

Competitive salary and equity
Medical, Dental, and Vision insurance
Unlimited PTO + Winter holiday break
Parental leave
Generous stipends for WFH, learning, wellness, and AI tools
401(k) match (US) / Pension match (Canada)

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills

GolangPythonAPI designmicroservices architectureevent-driven systemsasync workflowsdatabase optimizationdata processingconcurrency patternsprompt engineering

Soft skills

mentoringcollaborationtechnical decision-makingleadershipcommunication