Expand the Developer tools that capture business events, operational data and third party data into the Lakehouse.
Expand the self-serve and orchestration framework that developers use to build pipelines.
Improve the observability and SLAs for the above services and raw layer data in the Lakehouse.
Build and Improve the tools owned by the Capture team
Own the service/raw layer data observability and SLAs
Participate in planning and prioritization by collaborating with stakeholders across the various product verticals and functions (ML, Analytics, Finance) to ensure our architecture aligns with the overall business objectives.
Collaborate with stakeholders such as Software Engineering, Machine Learning, Machine Learning Platform and Analytics teams to ingest data into the Lakehouse and adopt the developer tools
Participate in code reviews and architecture discussions to exchange actionable feedback with peers.
Contribute to engineering best practices and mentor junior team members.
Help break down complex projects and requirements into sprints.
Continuously monitor and improve data platform performance, reliability, and security.
Stay up-to-date with emerging technologies and industry best practices in data engineering.
Requirements
A bachelor's degree in Computer Science, Data Science, Engineering, or a related field.
3+ years of experience in data engineering or related fields, with a strong focus on data quality, governance, and data infrastructure.
Proficiency in data engineering tech stack; Databricks / PostgreSQL / Python / Spark / Kafka/ Streaming / SQL / AWS / Airflow/ Airbyte/ Fivetran / DBT / EKS/ containers and orchestration (Docker, Kubernetes) and others.
Ability to approach problems with first principles thinking, embrace ambiguity, and enjoy collaborative work on complex solutions.
Benefits
target bonuses
equity compensation
generous benefits packages (including medical, dental, vision, and 401k)
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
data engineeringdata qualitydata governancedata infrastructurePythonSQLSparkKafkaDatabricksAWS