Cloudflare

Distributed Systems Engineer, Analytical Database Platform

Cloudflare

full-time

Posted on:

Location Type: Hybrid

Location: AustinTexasUnited States

Visit company website

Explore more

AI Apply
Apply

About the role

  • Design, develop, and maintain scalable and reliable distributed systems across the entire data lifecycle.
  • Build and optimise key components of our high-throughput data delivery platform to ensure data integrity and low-latency delivery.
  • Develop new and improve existing components for the Cloudflare Analytical Platform to extend functionality and performance.
  • Scale, monitor, and maintain the performance of our large-scale database clusters to accommodate the growing volume of data.
  • Develop and enhance our customer-facing GraphQL APIs, log delivery, and alerting solutions, focusing on performance, reliability, and user experience.
  • Work to identify and remove bottlenecks across our data platforms, from streamlining data ingestion processes to optimizing query performance.
  • Collaborate with other teams across Cloudflare to understand their data needs and build solutions that empower them to make data-driven decisions.
  • Collaborate with the ClickHouse open-source community to add new features and contribute to the upstream codebase.
  • Participate in the development of the next generation of our data platforms, including researching and evaluating new technologies and approaches.

Requirements

  • 3+ years of experience working in software development covering distributed systems and databases.
  • Strong programming skills (Golang is preferable), as well as a deep understanding of software development best practices and principles.
  • Hands-on experience with modern observability stacks, including Prometheus, Grafana, and a strong understanding of handling high-cardinality metrics at scale.
  • Strong knowledge of SQL and database internals, including experience with database design, optimisation, and performance tuning.
  • A solid foundation in computer science, including algorithms, data structures, distributed systems, and concurrency.
  • Strong analytical and problem-solving skills, with a willingness to debug, troubleshoot, and learn about complex problems at high scale.
  • Ability to work collaboratively in a team environment and communicate effectively with other teams across Cloudflare.
  • Experience with ClickHouse is a plus.
  • Experience with data streaming technologies (e.g., Kafka, Flink) is a plus.
  • Experience developing and scaling APIs, particularly GraphQL, is a plus.
  • Experience with Infrastructure as Code tools like SALT or Terraform is a plus.
  • Experience with Linux container technologies, such as Docker and Kubernetes, is a plus.
Benefits
  • Health insurance
  • Paid time off
  • Flexible work arrangements
  • Professional development opportunities
Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard Skills & Tools
GolangSQLdatabase designperformance tuningdistributed systemsdata structuresalgorithmsdata streamingGraphQLobservability
Soft Skills
analytical skillsproblem-solvingcollaborationcommunication