Tech Stack
AWSCloudGoogle Cloud PlatformKubernetesPythonTerraform
About the role
- Monarch is a personal finance platform aiming to simplify finances for users.
- Design, build, and maintain infrastructure powering ML and AI workloads.
- Enable ML engineers to train, deploy, and monitor models at scale while ensuring security, cost-effectiveness, and performance.
- Implement automated infrastructure solutions using Terraform/OpenTofu.
- Introduce and integrate AI infrastructure capabilities including vector databases, observability, and GPUs.
- Own core cloud infrastructure, networking, secrets management, Kubernetes/GPU orchestration, and shared platform services.
- Collaborate with AI Engineering on LLM runtime, retrieval architecture, observability, and rollouts.
Requirements
- 4+ years of professional experience with cloud infrastructure (AWS or GCP preferred).
- 2+ years of professional experience deploying and managing ML workloads in the cloud.
- Proficiency in Python for automation, scripting, and tooling.
- Advanced hands-on experience with Infrastructure-as-Code tools (Terraform or OpenTofu) in production environments.
- Strong problem-solving skills, ability to work autonomously, and a collaborative mindset.
- Experience in cloud networking and security best practices for data-intensive workloads.
- Clear verbal and written communication, cross-functional collaboration, analytical thinking, ability to manage multiple priorities, self-motivated, and proactive.
- Experience working in a high-growth startup or fast-paced environment.