Work with data science team to analyze data and improve quality
Collaborate across teams to gather insights and data for various projects
Present findings and recommendations to stakeholders
Adapt to project changes and contribute to solution-based thinking
Utilize Apache Spark, Python, SQL, and other tools to extract and process data
Requirements
A desire to grow as a data professional working with purpose to help others (animals, coworkers, veterinarians)
Experience or willingness to quickly learn working with Apache Spark infrastructure
Strong experience with Python and SQL; pyspark and DBT would be a strong plus and an excellent fit for this position
Familiarity with navigating and pulling/working with data in Databricks, Snowflake, and AWS
2-3 years of experience focused in providing data quality with natural language processing (NLP) techniques
Relative comfort in presenting in front of stakeholders and breaking down big-picture ideas for both technical and non-technical audiences
Adaptability to variable project timelines/open ended problems/solutions
Being open to communicating across teams and being the first to message or scheduling skip level meetings
Basic software engineering practices/standards and documentation skills, experience using git/previous projects/work on GitHub
Experience with object oriented programming principles is a plus
Experience using ontologies and hierarchical taxonomies for normalization and machine learning applications
Interest in not just generative AI but also machine learning at large including NER, LLMs, fuzzy matching and statistics
Problem solving abilities in real-world applications (anything from outside the box thinking to simple basic principles, creativity is encouraged but we want to make sure maintainability is considered)
Time management skills that can handle the real-world timeline of intake requests -> requirements discussion -> delegation -> preparing for presentation and tweaks -> delivery (teamwork is important and preparation is 85% or more of the success of a project!)
Benefits
401k savings & company match
Paid time off
Paid holidays
Maternity leave
Parental leave
Military leave
Other leaves of absence
Health, dental, and vision benefits
Health savings accounts
Flexible spending accounts
Life & disability benefits
Identity theft protection
Pet insurance
Applicant Tracking System Keywords
Tip: use these terms in your resume and cover letter to boost ATS matches.
Hard skills
PythonSQLApache SparkpysparkDBTnatural language processingobject oriented programmingmachine learningstatisticsdata quality