Tech Stack
AirflowAWSAzureBigQueryCloudDockerETLGoogle Cloud PlatformKafkaPythonSparkSQL
About the role
- Build and optimize data pipelines and AI systems
- Implement data pipelines using modern cloud platforms (Databricks, Snowflake, BigQuery) and streaming technologies (Kafka, Spark)
- Build and optimize ETL/ELT processes for large-scale data ingestion and transformation
- Develop RAG pipelines, vector search systems, and chatbot integrations using LangChain and OpenAI APIs
- Create ML pipelines with automated training, monitoring, and deployment using MLOps best practices
- Implement cloud-native solutions using AWS, Azure, and GCP services
- Build data quality monitoring and observability systems to ensure pipeline reliability
- Develop APIs and web interfaces for data access and analytics dashboards
- Optimize system performance, troubleshoot production issues, and implement cost-efficient solutions
- Contribute to code reviews, documentation, and engineering best practices
Requirements
- Bachelor's degree in Computer Science, Data Engineering, or related field; or equivalent industry experience
- 5+ years of hands-on experience building production data systems and pipelines
- Strong programming skills in Python and SQL
- Experience with at least one major cloud platform (AWS, Azure, GCP)
- Hands-on experience with data pipeline orchestration tools (Airflow, dbt, or similar)
- Knowledge of data warehousing concepts, data modeling, and performance optimization
- Experience with containerization (Docker) and CI/CD practices
- Understanding of API development and web service integration
- Strong problem-solving skills and ability to work independently on complex technical challenges
- Experience collaborating with cross-functional teams and communicating technical concepts clearly
- Health, dental, and vision insurance
- Paid Time Off and parental leave
- Flexible work arrangements
- Professional growth opportunities
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
data pipelinesAI systemsETLELTRAG pipelinesvector search systemschatbot integrationsML pipelinesprogramming in PythonSQL
Soft skills
problem-solvingindependent workcollaborationcommunication
Certifications
Bachelor's degree in Computer ScienceBachelor's degree in Data Engineering