Canva

Senior Fullstack Engineer – Observability

Canva

full-time

Posted on:

Location Type: Remote

Location: Australia

Visit company website

Explore more

AI Apply
Apply

Job Level

About the role

  • Delivering features, taking ownership of the entire lifecycle from design doc through to implementation and release.
  • Building cross-telemetry applications that surface actionable insights by combining data from logs, metrics, traces, and profiles to help engineers understand system health and diagnose issues quickly.
  • Owning and operating the platform infrastructure that powers our observability applications, ensuring reliability, scalability, and performance as we grow.
  • Partnering directly with engineering customers to gather requirements, understand their incident response workflows, and translate those needs into practical tooling solutions.
  • Balancing best practices with user needs, applying your observability expertise to design solutions that follow industry standards while addressing Canva's specific requirements.
  • Participating in incident response, leveraging your SRE/observability background to understand real-world usage patterns and continuously improve our tools based on operational experience.
  • Being proactive in modernising our Observability tooling, and participating in brainstorming and developing proof of concept designs to determine how we can improve our existing systems.
  • Guiding and mentoring other team members, sharing your observability expertise and helping the team grow their skills in both building and operating production services.

Requirements

  • Strong Computer Engineering / Science knowledge around fundamentals and first principles relating to concurrency, multithreading, data structures, architecture, and various design patterns
  • Strong full-stack engineering experience with React for frontend development and Golang for backend services.
  • Deep observability domain knowledge from working as an SRE or on a dedicated observability team, with hands-on experience in incident response and understanding what makes effective observability tooling.
  • Experience building user-facing applications for technical audiences, with the ability to gather requirements from engineers and translate them into intuitive tools.
  • Proficiency with observability platforms and tools such as Grafana, Prometheus, Jaeger, Loki, or similar systems, with understanding of how to query and visualize telemetry data.
  • Understanding of distributed systems' observability, including experience working with logs, metrics, traces, and profiles across microservice architectures in cloud environments (ideally AWS).
  • Infrastructure and operations experience, including deploying and operating production services, with knowledge of infrastructure-as-code tools (we use Terraform).
  • Strong product thinking and customer empathy, with the ability to balance technical best practices with user requirements and advocate for the end-user experience.
  • Excellent communication skills, both written and verbal, as this role requires significant documentation, cross-team alignment, and direct collaboration with engineering customers.
  • Nice to have: Experience with multiple backend languages such as Python, Java, or others, which helps when working with diverse telemetry data sources and instrumentation libraries.
  • Experience with Grafana's Scenes UI framework, or building custom Grafana plugins and applications.
  • Familiarity with OpenTelemetry and modern observability standards and practices.
Benefits
  • Equity packages - we want our success to be yours too
  • Inclusive parental leave policy that supports all parents & carers
  • An annual Vibe & Thrive allowance to support your wellbeing, social connection, office setup & more
  • Flexible leave options that empower you to be a force for good, take time to recharge and supports you personally

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
full-stack engineeringReactGolangobservabilityincident responseinfrastructure-as-codeTerraformdistributed systemsdata structuresdesign patterns
Soft skills
product thinkingcustomer empathycommunication skillsmentoringcollaboration