Tech Stack
Amazon RedshiftAWSAzureCloudETLGoogle Cloud PlatformGraphQL
About the role
- Define and govern the data architecture powering Hubexo’s global content lifecycle.
- Design conceptual, logical, and physical data models and data contracts for new systems.
- Lead data migration and mapping initiatives, ensuring seamless and accurate transition from legacy systems.
- Create and enforce standards for data attributes and metadata; build a comprehensive data dictionary as single source of truth.
- Establish frameworks for data quality, data governance, and data lineage tracking to ensure provenance and trust.
- Architect data structures enabling advanced analytics, AI, and machine learning initiatives.
- Collaborate with Solution Architects and API development teams to define schemas for GraphQL and REST APIs.
- Implement data governance and security policies with security teams to ensure GDPR compliance.
- Guide API developers and data migration strategies to ensure data integrity and quality.
Requirements
- 8+ years of experience in data architecture, with extensive experience in data modeling (conceptual, logical, physical) for complex, large-scale systems.
- Proven experience designing high-throughput, low-latency data ingestion and processing pipelines (ETL/ELT) in a cloud-native environment.
- Deep practical experience with core cloud data services (e.g., AWS S3, Glue, Redshift, Kinesis; or equivalents in Azure/GCP).
- Strong proficiency in defining data contracts and schemas for APIs (e.g., REST, GraphQL) and event-driven architectures.
- Expertise in establishing data governance frameworks, including data quality, metadata management, data lineage, and data security policies.
- Demonstrated experience in planning and overseeing data migration strategies from legacy to modern data platforms.
- Excellent communication and stakeholder management skills, with the ability to articulate complex technical concepts to diverse audiences.
- Demonstrated experience in breaking down large-scale, global systems into manageable components.
- Knowledge of GDPR and implementing data governance/security in partnership with security teams.