Tech Stack
AWSAzureCloudDockerGoogle Cloud PlatformKafkaMicroservicesPostgresPythonRabbitMQRedisTensorflow
About the role
- We are seeking a Senior Backend Engineer to build the infrastructure and systems that power our AI/ML features in production.
- In this role, you will design and implement scalable APIs, services, and monitoring systems that enable our ML team to deploy models reliably at scale.
- You will focus on the platform layer that makes AI workflows fast, reliable, and observable, working closely with our ML engineers to translate their models into production-ready services that serve thousands of users.
- Build scalable APIs and microservices that serve ML models in production with low latency and high availability.
- Work with the core engineering team to implement reliable integrations with LLM APIs (Claude, GPT, Gemini).
- Implement monitoring, logging, and observability systems for AI workflows to ensure reliability and performance.
- Build the integration points between the AI/ML Platform stack and the core application, supporting core product development and long term technological investment.
- Support data pipeline development and maintenance that enable ML model training and inference.
- Ideal Candidate: Strong systems thinking with experience designing scalable backend architectures.
- Experienced in SaaS environments and understands the operational challenges of serving ML models in production.
- Pragmatic engineer who prioritizes reliability and performance over experimenting with the latest frameworks.
- Collaborative mindset with strong communication skills.
- Required Experience: 5+ years building scalable backend systems (Python, NestJS, etc.).
- Experience integrating external APIs at scale, including rate limiting, retry logic, and error handling.
- Proven track record with databases (PostgreSQL, Redis) and message queues (RabbitMQ, Kafka, or similar).
- Experience with containerization (Docker) and cloud platforms (AWS, GCP, or Azure).
- Understanding of monitoring and observability tools (DataDog, New Relic, or similar).
Requirements
- 5+ years building scalable backend systems (Python, NestJS, etc.).
- Experience integrating external APIs at scale, including rate limiting, retry logic, and error handling.
- Proven track record with databases (PostgreSQL, Redis) and message queues (RabbitMQ, Kafka, or similar).
- Experience with containerization (Docker) and cloud platforms (AWS, GCP, or Azure).
- Understanding of monitoring and observability tools (DataDog, New Relic, or similar).
- U.S. citizenship with the ability to pass a Federal Background Check and Identity Verification.
- While formal education is not a strict requirement, a Bachelor's or Master’s degree in Computer Science, Engineering, or a related field is preferred.