Seneca Holdings

Data Engineer

Seneca Holdings

full-time

Posted on:

Origin:  • 🇺🇸 United States

Visit company website
AI Apply
Apply

Job Level

Mid-LevelSenior

Tech Stack

ETLPythonSparkSQL

About the role

  • Design, build, and maintain scalable data pipelines and ETL/ELT processes to support analytics and business intelligence
  • Develop and optimize data models for both operational and analytical systems
  • Implement best practices for data ingestion, transformation, storage, and retrieval across environments
  • Collaborate with data scientists, analysts, and business stakeholders to define data requirements and deliver reliable solutions
  • Ensure data quality, consistency, and governance through monitoring, testing, and validation frameworks
  • Write and maintain efficient SQL queries, stored procedures, and scripts for data processing
  • Work with structured and unstructured data from multiple sources (databases, APIs, streaming platforms, third-party feeds)
  • Optimize performance and scalability of large datasets using distributed computing frameworks (e.g., Spark)
  • Apply security, compliance, and privacy standards to data solutions
  • Curate data for reusability and ensure normalization, deduplication, and metadata reconciliation
  • Troubleshoot data-related issues to ensure minimal downtime and reliable delivery of data products
  • Provide Data Quality measures and improvements for silver and gold data zones
  • Ensure data is discoverable, well-managed, and integrated with metadata and business rules using tools like Collibra
  • Coordinate across Advana lines of business to link mission critical data with other domain data
  • Auto-catalog data sources and integrate with data dictionaries and metadata to expose data descriptions and business rules

Requirements

  • Must have an active SECRET clearance
  • Bachelor of Science or Arts from an accredited university or college, scientific or technical discipline preferred
  • Must have a minimum of 5 years of experience supporting Federal clients with implementing data management practices
  • Must have a minimum of 3 years of experience with Python programming language
  • Must have a minimum of 3 years of experience with enterprise data tools like Databricks, Databricks SQL Editor, Erwin data modeler, etc.
  • Experience designing, building, and maintaining scalable data pipelines and ETL/ELT processes
  • Experience developing and optimizing data models for operational and analytical systems
  • Proficiency with SQL queries, stored procedures, and scripting for data processing
  • Experience with structured and unstructured data sources (databases, APIs, streaming platforms, third-party feeds)
  • Experience with distributed computing frameworks (e.g., Spark)
  • Knowledge of data quality, testing, validation frameworks, and metadata/catalog management (e.g., Collibra)
  • Familiarity with data governance, security, compliance, and privacy standards
  • Ability to normalize data, deduplicate, investigate mismatches, and reconcile metadata
  • Strong organizational, writing, and collaboration skills (preferred)
  • Competent writing skills and ability to work independently in Microsoft 365 (preferred)
  • Up to date knowledge of Collibra Data Governance Center functionalities (preferred)
  • Understanding of Procurement Lifecycle (preferred)
NVIDIA

Senior Data Engineer – Developer Programs

NVIDIA
Seniorfull-time$168k–$322k / yearCalifornia · 🇺🇸 United States
Posted: 5 days agoSource: nvidia.wd5.myworkdayjobs.com
AWSETLPythonSQL
Diabetes Youth Families

Data Engineer

Diabetes Youth Families
Junior · Midfull-time🇲🇽 Mexico
Posted: 6 days agoSource: insulet.wd5.myworkdayjobs.com
AWSAzureCloudETLJavaMongoDBPythonSparkSQL
Providence

Data Software Engineer I

Providence
Juniorfull-time$41–$64🇺🇸 United States
Posted: 14 hours agoSource: evac.fa.us2.oraclecloud.com
ETLJavaNoSQLPySparkPythonSQL
Medsuite Inc

Lead Snowflake Data Engineer

Medsuite Inc
Seniorfull-time🇮🇳 India
Posted: 3 days agoSource: globalcareers-ventrahealth.icims.com
ApacheAzureCloudETLKafkaMatillionPythonSparkSQL
Medsuite Inc

Lead Snowflake Data Engineer

Medsuite Inc
Seniorfull-time🇮🇳 India
Posted: 6 days agoSource: globalcareers-ventrahealth.icims.com
ApacheAzureCloudETLKafkaMatillionPythonSparkSQL