FREE ACCESS
5,000–10,000 jobs/day

See all jobs on JobTailor
Search thousands of fresh jobs every day.
Discover
- Fresh listings
- Fast filters
- No subscription required
Create a free account and start exploring right away.
Tech Stack
Tools & technologiesCloudDistributed SystemsKubernetes
About the role
Key responsibilities & impact- Contribute to the evolution of Canva’s unified training platform for AI training workloads
- Improve reliability, observability, debugging, and operational support for training systems
- Design and build platform capabilities for better scheduling at scale and resource management
- Collaborate with research scientists, ML engineers, product teams, and cloud/infrastructure teams
- Shape platform roadmap based on user pain points and long-term platform maturity
- Mentor engineers and share best practices in AI systems and infrastructure
Requirements
What you’ll need- Strong experience in training pipelines, distributed systems, or large-scale AI infrastructure
- Strong experience working with Kubernetes and containerized workloads
- Familiar with modern cloud and infrastructure services for high-performance AI workloads
- Strong sense of ownership and enjoy working on complex problems that impact multiple teams
- Comfortable collaborating with engineers, applied scientists, and infrastructure partners
- Motivated to help build platform foundations that enable AI-powered creativity at scale
Benefits
Comp & perks- Equity packages - we want our success to be yours too
- Inclusive parental leave policy that supports all parents & carers
- An annual Vibe & Thrive allowance to support your wellbeing, social connection, office setup & more
- Flexible leave options that empower you to be a force for good, take time to recharge and supports you personally
ATS Keywords
✓ Tailor your resumeApplicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
training pipelinesdistributed systemslarge-scale AI infrastructureKubernetescontainerized workloadscloud servicesinfrastructure servicesdebuggingresource managementobservability
Soft Skills
ownershipproblem-solvingcollaborationmentoringcommunication
