
Data Engineer
Yahara Software
full-time
Posted on:
Location Type: Hybrid
Location: Madison • Wisconsin • United States
Visit company websiteExplore more
Tech Stack
About the role
- Design and maintain enterprise-scale pipelines using CDP and big data tooling.
- Build scalable ETL/ELT workflows for structured and unstructured data.
- Develop distributed processing jobs using big data framework components.
- Design data storage solutions balancing performance and cost.
- Collaborate with analysts, scientists, and developers to deliver data solutions.
- Develop technical documentation for pipelines and architectures.
Requirements
- 5–7 years in data engineering with big data or distributed systems.
- Experience with CDP, CDH, or similar enterprise big data platforms.
- Degree in CS, Data Science, Information Systems, or equivalent experience.
- Strong background in distributed data processing.
- Ability to obtain and maintain Public Trust clearance.
- Self-starter with a passion for data engineering.
- Strong analytical and problem-solving skills.
- Enthusiastic about big data technologies and performance optimization.
- Detail-oriented with a commitment to accuracy and reliability.
- Ability to translate business requirements into effective solutions.
- Collaborative, able to recognize blockers and leverage team strengths.
- Experience with Agile development environments.
- Proven experience designing and implementing production pipelines.
- Experience in biohealth, laboratory, or scientific data environments is a plus.
- Familiarity with HIPAA, FDA, or GxP preferred but not required.
- Cloudera ecosystem experience: CDP, HDFS, Hive/Impala, Spark.
- Programming: Python, Scala, or Java.
- Advanced SQL and distributed compute (Spark, MapReduce).
- Shell scripting and version control (Git).
- Data storage formats: Parquet, Avro, ORC.
- Workflow orchestration and scheduling.
- Cloud experience (Azure, AWS, or GCP) and understanding of hybrid patterns.
Benefits
- 20+ days of PTO accruable in the first year!
- Comprehensive health insurance (Medical, Dental, Vision) with HMO and PPO options
- Health Savings Account (HSA) with annual employer contributions
- 401(k) with guaranteed company match (Traditional and Roth options)
- 100% company-paid short-term and long-term disability
- 100% company-paid life insurance with option to increase coverage
- 100% company-paid identity theft protection
- On-site gym with basketball court
- Hybrid/remote schedule with home office stipend
- Fresh fruit, healthy snacks, and beverages provided daily
- Bonus certification program (Microsoft, AWS, PMP, IIBA, etc.)
- Employee Assistance Program (counseling, legal, financial services)
- Monthly and Quarterly Recognition Awards with spot bonuses
- Company-supported community outreach and volunteer opportunities
- Employee-run committee involvement opportunities
- Collaborative culture founded on realized values and incredible people
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard Skills & Tools
ETLELTbig datadistributed processingPythonScalaJavaSQLSparkMapReduce
Soft Skills
analytical skillsproblem-solving skillsdetail-orientedcollaborativeself-starterpassion for data engineeringcommitment to accuracyability to translate business requirementsenthusiastic about big data technologiesability to recognize blockers