Salary
💰 $125,562 - $179,374 per year
Tech Stack
AirflowApacheAWSAzureCassandraDistributed SystemsDynamoDBETLGoogle Cloud PlatformGRPCHadoopJavaMySQLPandasPostgresPythonScalaSpringSpring BootSpringBootSQL
About the role
- Maintain and enhance Spotify’s reporting pipelines, ensuring they remain business-critical
- Build large-scale batch and real-time data pipelines to efficiently collect, process, and store reporting-related data
- Make enhancements to data models and ETL code to improve pipeline efficiency and data quality
- Design, develop, and maintain high-performance backend services and APIs to power Spotify’s advertising platform
- Work closely with stakeholders to understand and translate business requirements into technical solutions
- Write scalable, testable, maintainable, and high-quality code
- Collaborate with engineers, product managers, and other stakeholders, embracing learning and leadership opportunities daily
- Monitor daily execution, diagnose issues, and ensure SLAs are met by maintaining and fixing business-critical pipelines
Requirements
- Experience in data engineering with high-volume, heterogeneous data, ideally within distributed systems such as Hadoop, BigTable, Cassandra, GCP, AWS, or Azure
- Familiarity with data processing frameworks like Scio, GCP Dataflow, AWS EMR, and Apache Beam
- Familiarity with API design, microservice architecture, and frameworks like Spring Boot or gRPC.
- Programming skills in Java or Scala, and for analytics and data extraction (SQL, Python, Pandas, R)
- Commitment to agile software processes, data-driven development, reliability, and responsible experimentation
- Hands-on experience with orchestration tools like Airflow or similar
- Experience with database systems, both relational (e.g., PostgreSQL, MySQL) and non-relational (e.g., Cassandra, DynamoDB).
- Excellent problem-solving and communication skills