
Distributed Systems Engineer, Analytical Database Platform
Cloudflare
full-time
Posted on:
Location Type: Hybrid
Location: Austin • Texas • United States
Visit company websiteExplore more
Tech Stack
About the role
- Design, develop, and maintain scalable and reliable distributed systems across the entire data lifecycle.
- Build and optimise key components of our high-throughput data delivery platform to ensure data integrity and low-latency delivery.
- Develop new and improve existing components for the Cloudflare Analytical Platform to extend functionality and performance.
- Scale, monitor, and maintain the performance of our large-scale database clusters to accommodate the growing volume of data.
- Develop and enhance our customer-facing GraphQL APIs, log delivery, and alerting solutions, focusing on performance, reliability, and user experience.
- Work to identify and remove bottlenecks across our data platforms, from streamlining data ingestion processes to optimizing query performance.
- Collaborate with other teams across Cloudflare to understand their data needs and build solutions that empower them to make data-driven decisions.
- Collaborate with the ClickHouse open-source community to add new features and contribute to the upstream codebase.
- Participate in the development of the next generation of our data platforms, including researching and evaluating new technologies and approaches.
Requirements
- 3+ years of experience working in software development covering distributed systems and databases.
- Strong programming skills (Golang is preferable), as well as a deep understanding of software development best practices and principles.
- Hands-on experience with modern observability stacks, including Prometheus, Grafana, and a strong understanding of handling high-cardinality metrics at scale.
- Strong knowledge of SQL and database internals, including experience with database design, optimisation, and performance tuning.
- A solid foundation in computer science, including algorithms, data structures, distributed systems, and concurrency.
- Strong analytical and problem-solving skills, with a willingness to debug, troubleshoot, and learn about complex problems at high scale.
- Ability to work collaboratively in a team environment and communicate effectively with other teams across Cloudflare.
- Experience with ClickHouse is a plus.
- Experience with data streaming technologies (e.g., Kafka, Flink) is a plus.
- Experience developing and scaling APIs, particularly GraphQL, is a plus.
- Experience with Infrastructure as Code tools like SALT or Terraform is a plus.
- Experience with Linux container technologies, such as Docker and Kubernetes, is a plus.
Benefits
- Health insurance
- Paid time off
- Flexible work arrangements
- Professional development opportunities
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
GolangSQLdatabase designperformance tuningdistributed systemsdata structuresalgorithmsdata streamingGraphQLobservability
Soft Skills
analytical skillsproblem-solvingcollaborationcommunication