Tech Stack
AndroidCloudDockerGoogle Cloud PlatformiOSKubernetesPython
About the role
- Work alongside machine learning researchers, engineers, and product managers to bring our AI Voices to their customers for a diverse range of use cases
- Deploy and operate the core ML inference workloads for our AI Voices serving pipeline
- Introduce new techniques, tools, and architecture that improve the performance, latency, throughput, and efficiency of our deployed models
- Build tools to give us visibility into our bottlenecks and sources of instability and then design and implement solutions to address the highest priority issues
- Make product decisions and contribute to building great user experiences that delight users
- Operate and maintain critical production services and improve system reliability and observability
Requirements
- Experience shipping Python-based services
- Experience being responsible for the successful operation of a critical production service
- Experience with public cloud environments, GCP preferred
- Experience with Infrastructure such as Code, Docker, and containerized deployments.
- Preferred: Experience deploying high-availability applications on Kubernetes.
- Preferred: Experience deploying ML models to production
- Experience building great user experiences that delight users
- Strong work ethic, solid communication skills, and obsession with winning
- Ability to work effectively in a 100% distributed/asynchronous work culture