
Data Engineer
Sutter Health
full-time
Posted on:
Location Type: Hybrid
Location: Sacramento • California • United States
Visit company websiteExplore more
Salary
💰 $145,205 - $217,797 per year
About the role
- Responsible for developing Sutter Health’s research analytic data infrastructure
- This includes all aspects of how data is ingested, stored, collected, governed, cleansed, accessed, and used
- Ensures that the data used by the organization is of the highest quality and is made available as soon as possible in a format that allows the business (researchers) to make critical decisions based on the data
- Utilizes tools and infrastructure such as scalable data pipelines to manage high volume and high-speed data storage and retrieval
- Works with all types of data including batch and streaming data, structured, semi-structured and unstructured data
- Creates and improves processes required by other data-dependent functions including analytics, strategic business intelligence, and data science
- Develops and tests new architectures that enable data extraction, automation, and modeling for predictive or prescriptive analytic purposes
- Sets the standard for high-value high quality datasets that are accurate, timely, secure and well-suited to strategic analytic purposes for research organization
- Works on IRB approved research studies providing accurate and timely curated data
- Works closely with Principal investigators and statisticians
- Works in accordance with Research Privacy and HIPAA regulations and methods for safeguarding PHI and PII
- This position is part of a new, exciting strategic initiative within Sutter Health Research
Requirements
- Bachelor’s degree in Computer Science, Engineering, Information Management, or Healthcare Administration
- 8 years recent relevant experience
- Experience creating data pipelines on big data platforms and data integrations in databases and data lakes
- Experience leveraging scalable data platforms to build secure infrastructure
- Experience building batch or streaming data ingestion pipelines
- Ability to assess and profile raw data and reassemble raw data from multiple sources into a single, enterprise model
- Hands on experience with data management tools (Cloudera, Spark, Python, Databricks, etc.)
- Fluency with SQL programming, scripting, and data architecture
- Extensive familiarity with relational database concepts / technologies (SQL, Oracle, etc.) including data design, table design, partitioning
- Experience ensuring data quality and implementing tools and frameworks for automating identification of data quality issues
- Strong understanding of data engineering and data traceability best practices and framework
- Ability to work in a consulting role, building technology and communicating with end-users and customers of varying levels of technical capability
- Strong knowledge in the development of Business Intelligence and Reporting solutions
- Strong problem solving, organization, and prioritization skills
- Detail-oriented, producing timely results and ability to work both independently with minimal supervision and as a member of a scrum/product team
- Familiar with healthcare provider data structures and sources; experienced with HIPAA regulations and methods for safeguarding PHI and PII through mitigation of data exposure risk.
Benefits
- Yes 📊 Check your resume score for this job Improve your chances of getting an interview by checking your resume score before you apply. Check Resume Score
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
data pipelinesbig data platformsdata integrationsdata lakesdata managementSQL programmingdata architecturedata qualitydata engineeringBusiness Intelligence
Soft Skills
problem solvingorganizationprioritizationdetail-orientedindependent workteam collaborationcommunicationconsulting