Acelab

Senior Data Engineer

Acelab

full-time

Posted on:

Location Type: Remote

Location: Remote • 🇺🇸 United States

Visit company website
AI Apply
Apply

Job Level

Senior

Tech Stack

AirflowAWSAzureCloudDockerGoogle Cloud PlatformKafkaKubernetesNeo4jPostgresPythonSQL

About the role

  • Production Engineering: Transform experimental AI workflows into robust, automated production systems with comprehensive monitoring and quality assurance
  • System Architecture: Design scalable data processing pipelines, create reusable modular components, and establish engineering standards for team collaboration
  • Data Quality & Automation: Build evaluation frameworks to monitor pipeline quality, implement automated error handling, and reduce manual intervention requirements
  • Technical Leadership: Mentor team members, conduct thorough code reviews, and set best practices for AI system design

Requirements

  • Advanced Python programming with experience building production data processing systems
  • Proven experience productionalizing and scaling AI/ML systems using LangChain or similar LLM orchestration frameworks
  • Advanced SQL skills with PostgreSQL experience and familiarity with vector databases
  • Experience with Google Cloud Platform or another major cloud platform
  • Experience with containerization using Docker
  • Proficiency with workflow orchestration tools such as Airflow
  • Strong system design skills and experience with CI/CD pipelines
  • Strategic problem-solving with ability to choose appropriate AI versus deterministic approaches
  • Experience mentoring team members and setting technical standards
  • Experience conducting thorough code reviews with focus on quality, security, and performance
  • Self-motivated with proven ability to take ownership of complex technical initiatives
  • Excellent communication skills for working with other engineers, business subject matter experts, and product teams
  • Embrace learning of new technologies and sharing knowledge with colleagues
  • Data science experience, including evaluation and monitoring experience
  • MCP (Model Context Protocol) experience
  • Graph database (e.g. Neo4j) experience
  • Kubernetes experience
  • Experience with event-driven architectures (Kafka, GCP Pub/Sub, AWS SQS, or Azure Event Hubs)
  • Experience with web scraping
  • Interest in or experience with the AEC (Architecture, Engineering, Construction) industry
Benefits
  • Impact & Growth Opportunity
  • Drive Business Growth: Enable significant expansion of our data capabilities and platform reach
  • Shape Technical Direction: Influence system architecture decisions and establish patterns for platform evolution
  • Own Critical Infrastructure: Take ownership of core data workflows after onboarding
  • Work with Cutting-Edge Technology: Leverage the latest AI/LLM technology and supporting technologies in production

Applicant Tracking System Keywords

Tip: use these terms in your resume and cover letter to boost ATS matches.

Hard skills
PythonSQLPostgreSQLLangChainAI/ML systemsDockerAirflowCI/CD pipelinesdata processing systemsdata quality
Soft skills
technical leadershipmentoringproblem-solvingcommunicationownershipcollaborationcode reviewslearningknowledge sharingquality assurance
Certifications
MCP (Model Context Protocol)
Vālenz® Health

Senior Data Engineer

Vālenz® Health
Seniorfull-time🇺🇸 United States
Posted: 1 hour agoSource: valenz.applytojob.com
AzureETLNoSQLPySparkPythonRDBMS
Software Mind

MS SQL Server Data Engineer

Software Mind
Mid · SeniorcontractMississippi · 🇺🇸 United States
Posted: 1 hour agoSource: jobs.smartrecruiters.com
AWSAzureCloudETLITSMMS SQL ServerSQL
Circle

Data Engineer II / Senior Data Engineer

Circle
Seniorfull-time$148k–$195k / yearCalifornia, Pennsylvania, Texas · 🇺🇸 United States
Posted: 5 hours agoSource: circle.wd1.myworkdayjobs.com
AirflowAWSAzureBigQueryCloudGoogle Cloud PlatformJavaPythonScalaSQL
R1 RCM

Data Engineer II

R1 RCM
Junior · Midfull-time$80k–$110k / year🇺🇸 United States
Posted: 7 hours agoSource: r1rcm.wd1.myworkdayjobs.com
AWSAzureCloudHadoopJavaKafkaPythonScalaSDLCSpark