Salary
💰 PLN 289,000 - PLN 325,000 per year
Tech Stack
AirflowAmazon RedshiftApacheBigQueryDistributed SystemsETLKafkaNumpyPandasPulsarPythonSQL
About the role
- Architect and build the backbone of a next-generation multi-tenant influencer marketing analytics platform
- Design and implement ETL pipelines migrating from transactional databases to analytical data warehouses
- Create real-time data ingestion systems processing campaign data, user metrics, and business intelligence
- Build multi-tenant data models with proper partitioning strategies for enterprise-scale clients
- Develop data quality frameworks with comprehensive validation, monitoring, and alerting
- Implement Row-Level Security (RLS), Role-Based Access Control (RBAC), dynamic permission models, and session-based context management
- Create comprehensive audit trails and access logging for compliance requirements
- Design database schemas with advanced partitioning and indexing strategies and build materialized views and aggregated tables for real-time analytics
- Implement query optimization, data skipping, and compression techniques to handle high-concurrency embedded dashboard usage with sub-second query performance
- Build dashboard data sources with optimized SQL transformations, handle complex data structures and parsing, and create denormalized tables optimized for embedded analytics consumption
- Collaborate with cross-functional teams to deliver secure, scalable analytics solutions for enterprise partners
Requirements
- 5+ years of data engineering experience with production-scale systems
- Expert-level SQL skills with analytical databases (columnar databases preferred)
- Strong Python programming with data libraries: pandas, numpy, pyarrow
- Experience with ETL orchestration tools: Apache Airflow, Prefect, dbt, or similar
- Deep understanding of analytical databases, partitioning strategies, and OLAP optimization
- Experience building SaaS data platforms with tenant isolation requirements
- Knowledge of Row-Level Security (RLS) implementation in analytical databases
- Understanding of RBAC patterns and session-based access control
- Experience with authentication flows in data systems
- Familiarity with compliance requirements (SOC2, GDPR) for multi-tenant data
- Bonus: Experience with columnar databases (ClickHouse, BigQuery, Redshift, Snowflake)
- Bonus: Knowledge of streaming data platforms (Apache Kafka, Pulsar, Kinesis)
- Bonus: Understanding of distributed systems and database replication
- Bonus: Experience with embedded analytics platforms, semantic modeling, and data visualization
- Bonus: Experience with real-time data processing, data mesh/fabric architectures, ML pipeline integration, data lineage and catalog tools