Build robust systems for data storage, retrieval, and processing, all within our monorepo environment.
Own the reliability and observability stack that keeps our ML and analysis pipelines fast, secure, and reliable.
Optimize cloud infrastructure for cost efficiency, scalability, and performance across storage, compute, and networking layers.
Build monitoring systems that balance performance and stability under mission-critical workloads.
Collaborate with Security Engineers to enforce best practices for data isolation, access control, and auditability.
Support compliance alignment for RMF, SOC 2, and on-prem deployments without slowing down iteration speed and engineering agility.
Remove friction, speed up iteration, and reduce cognitive load for other engineers by simplifying local development, automating the boring stuff, adding tools to solve pain points, and communicating best practices to the rest of the company.
Help own code health throughout the company and engage proactively in efforts to regularly improve the codebase as a whole.
Design and manage end-to-end ML pipelines, from dataset ingestion and preprocessing to model training, validation, and deployment, ensuring the deployment of scalable and high-performance models.
Partner with AI and Security Research teams to ensure that model artifacts are versioned, reproducible, and production-ready.
Work closely with engineers to maintain the infrastructure that seamlessly integrates data feeds and manages the flow of meticulously labeled datasets across internal systems.
Requirements
A forward-thinking engineer eager to set new standards in technology who enjoys building scalable systems and solving complex operational puzzles.
Experienced with the latest in cloud technologies like Terraform, Kubernetes, AWS/GCP/Azure, and committed to sustainable, efficient solutions.
Experienced in DevOps, MLOps, or large-scale data infrastructure, ideally with exposure to distributed training or vector databases.
Thrive in a small, high-velocity team with autonomy and accountability.
Excited about applying AI to solve real business problems and improve developer productivity.
Cybersecurity experience is a bonus, but not required.
Benefits
Fully remote-first environment with in-person team offsites twice per year
Competitive compensation with equity
Comprehensive health benefits: medical, dental, and vision coverage; free One Medical annual membership
401(k) plan and Flexible Spending Account (FSA)
11 company holidays + unlimited PTO
Home office stipend to support your remote workspace setup
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.